Search Results - "Shaham, Uri"
-
1
Understanding adversarial training: Increasing local stability of supervised models through robust optimization
Published in Neurocomputing (Amsterdam) (13-09-2018)“…We show that adversarial training of supervised learning models is in fact a robust optimization procedure. To do this, we establish a general framework for…”
Get full text
Journal Article -
2
DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network
Published in BMC medical research methodology (26-02-2018)“…Medical practitioners use survival models to explore and understand the relationships between patients' covariates (e.g. clinical and genetic features) and the…”
Get full text
Journal Article -
3
Learning by coincidence: Siamese networks and common variable learning
Published in Pattern recognition (01-02-2018)“…•Siamese networks are proposed for unsupervised learning of a common source of variability in multimodal data.•The common source of variability is identified…”
Get full text
Journal Article -
4
Efficient Long-Text Understanding with Short-Text Models
Published in Transactions of the Association for Computational Linguistics (22-03-2023)“…Transformer-based pretrained language models (LMs) are ubiquitous across natural language understanding, but cannot be applied to long sequences such as…”
Get full text
Journal Article -
5
Removal of batch effects using distribution-matching residual networks
Published in Bioinformatics (Oxford, England) (15-08-2017)“…Sources of variability in experimentally derived data include measurement error in addition to the physical phenomena of interest. This measurement error is a…”
Get full text
Journal Article -
6
Gating mass cytometry data by deep learning
Published in Bioinformatics (Oxford, England) (01-11-2017)“…Mass cytometry or CyTOF is an emerging technology for high-dimensional multiparameter single cell analysis that overcomes many limitations of…”
Get full text
Journal Article -
7
Deep unsupervised feature selection by discarding nuisance and correlated features
Published in Neural networks (01-08-2022)“…Modern datasets often contain large subsets of correlated features and nuisance features, which are not or loosely related to the main underlying structures of…”
Get full text
Journal Article -
8
Algorithms, Applications and Theoretical Properties of Deep Neural Networks
Published 2017“…Various theoretical and practical aspects of deep learning are discussed. We present approximation bounds for deep neural networks and their dependence on the…”
Get full text
Dissertation -
9
SpecRaGE: Robust and Generalizable Multi-view Spectral Representation Learning
Published 04-11-2024“…Multi-view representation learning (MvRL) has garnered substantial attention in recent years, driven by the increasing demand for applications that can…”
Get full text
Journal Article -
10
G-SPARC: SPectral ARchitectures tackling the Cold-start problem in Graph learning
Published 03-11-2024“…Graphs play a central role in modeling complex relationships across various domains. Most graph learning methods rely heavily on neighborhood information,…”
Get full text
Journal Article -
11
What Do You Get When You Cross Beam Search with Nucleus Sampling?
Published 20-07-2021“…We combine beam search with the probabilistic pruning technique of nucleus sampling to create two deterministic nucleus search algorithms for natural language…”
Get full text
Journal Article -
12
Sparse Binarization for Fast Keyword Spotting
Published 09-06-2024“…With the increasing prevalence of voice-activated devices and applications, keyword spotting (KWS) models enable users to interact with technology hands-free,…”
Get full text
Journal Article -
13
Neural Machine Translation without Embeddings
Published 21-08-2020“…Many NLP models operate over sequences of subword tokens produced by hand-crafted tokenization rules and heuristic subword induction algorithms. A simple…”
Get full text
Journal Article -
14
Efficient Long-Text Understanding with Short-Text Models
Published 01-08-2022“…Transformer-based pretrained language models (LMs) are ubiquitous across natural language understanding, but cannot be applied to long sequences such as…”
Get full text
Journal Article -
15
ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Published 23-05-2023“…We introduce ZeroSCROLLS, a zero-shot benchmark for natural language understanding over long texts, which contains only test and small validation sets, without…”
Get full text
Journal Article -
16
Causes and Cures for Interference in Multilingual Translation
Published 14-12-2022“…Multilingual machine translation models can benefit from synergy between different language pairs, but also suffer from interference. While there is a growing…”
Get full text
Journal Article -
17
Deep Ordinal Regression using Optimal Transport Loss and Unimodal Output Probabilities
Published 15-11-2020“…It is often desired that ordinal regression models yield unimodal predictions. However, in many recent works this characteristic is either absent, or…”
Get full text
Journal Article -
18
Instruction Induction: From Few Examples to Natural Language Task Descriptions
Published 22-05-2022“…Large language models are able to perform a task by conditioning on a few input-output demonstrations - a paradigm known as in-context learning. We show that…”
Get full text
Journal Article -
19
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Published 03-01-2024“…As instruction-tuned large language models (LLMs) gain global adoption, their ability to follow instructions in multiple languages becomes increasingly…”
Get full text
Journal Article -
20
CoverBench: A Challenging Benchmark for Complex Claim Verification
Published 06-08-2024“…There is a growing line of research on verifying the correctness of language models' outputs. At the same time, LMs are being used to tackle complex queries…”
Get full text
Journal Article