Search Results - "Mitliagkas, Ioannis"
-
1
Joint Power and Admission Control for Ad-Hoc and Cognitive Underlay Networks: Convex Approximation and Distributed Implementation
Published in IEEE transactions on wireless communications (01-12-2011)“…Power control is important in interference-limited cellular, ad-hoc, and cognitive underlay networks, when the objective is to ensure a certain quality of…”
Get full text
Journal Article -
2
Asynchrony begets momentum, with an application to deep learning
Published in 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton) (01-09-2016)“…Asynchronous methods are widely used in deep learning, but have limited theoretical justification when applied to non-convex problems. We show that running…”
Get full text
Conference Proceeding -
3
Feature learning as alignment: a structural property of gradient descent in non-linear neural networks
Published 07-02-2024“…Understanding the mechanisms through which neural networks extract statistics from input-label pairs through feature learning is one of the most important…”
Get full text
Journal Article -
4
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
Published 16-07-2023“…In out-of-distribution (OOD) generalization tasks, fine-tuning pre-trained models has become a prevalent strategy. Different from most prior work that has…”
Get full text
Journal Article -
5
Performative Prediction with Neural Networks
Published 13-04-2023“…Performative prediction is a framework for learning models that influence the data they intend to predict. We focus on finding classifiers that are…”
Get full text
Journal Article -
6
Generating Tabular Data Using Heterogeneous Sequential Feature Forest Flow Matching
Published 20-10-2024“…Privacy and regulatory constraints make data generation vital to advancing machine learning without relying on real-world datasets. A leading approach for…”
Get full text
Journal Article -
7
Compositional Risk Minimization
Published 08-10-2024“…In this work, we tackle a challenging and extreme form of subpopulation shift, which is termed compositional shift. Under compositional shifts, some…”
Get full text
Journal Article -
8
Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptation
Published 22-06-2022“…It is common in computer vision to be confronted with domain shift: images which have the same class but different acquisition conditions. In domain adaptation…”
Get full text
Journal Article -
9
Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation
Published 05-07-2023“…We tackle the problems of latent variables identification and ``out-of-support'' image generation in representation learning. We show that both are possible…”
Get full text
Journal Article -
10
No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths
Published 20-06-2023“…Understanding the optimization dynamics of neural networks is necessary for closing the gap between theory and practice. Stochastic first-order optimization…”
Get full text
Journal Article -
11
Convergence Analysis and Implicit Regularization of Feedback Alignment for Deep Linear Networks
Published 20-10-2021“…We theoretically analyze the Feedback Alignment (FA) algorithm, an efficient alternative to backpropagation for training neural networks. We provide…”
Get full text
Journal Article -
12
Solving Hidden Monotone Variational Inequalities with Surrogate Losses
Published 07-11-2024“…Deep learning has proven to be effective in a wide variety of loss minimization problems. However, many applications of interest, like minimizing projected…”
Get full text
Journal Article -
13
Gradient penalty from a maximum margin perspective
Published 15-10-2019“…A popular heuristic for improved performance in Generative adversarial networks (GANs) is to use some form of gradient penalty on the discriminator. This…”
Get full text
Journal Article -
14
Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation
Published 03-11-2022“…We study the problem of model selection in causal inference, specifically for conditional average treatment effect (CATE) estimation. Unlike machine learning,…”
Get full text
Journal Article -
15
A Reproducible and Realistic Evaluation of Partial Domain Adaptation Methods
Published 03-10-2022“…Unsupervised Domain Adaptation (UDA) aims at classifying unlabeled target images leveraging source labeled ones. In this work, we consider the Partial Domain…”
Get full text
Journal Article -
16
Improving Gibbs Sampler Scan Quality with DoGS
Published 18-07-2017“…The pairwise influence matrix of Dobrushin has long been used as an analytical tool to bound the rate of convergence of Gibbs sampling. In this work, we use…”
Get full text
Journal Article -
17
YellowFin and the Art of Momentum Tuning
Published 12-06-2017“…Hyperparameter tuning is one of the most time-consuming workloads in deep learning. State-of-the-art optimizers, such as AdaGrad, RMSProp and Adam, reduce this…”
Get full text
Journal Article -
18
Towards efficient representation identification in supervised learning
Published 10-04-2022“…Humans have a remarkable ability to disentangle complex sensory inputs (e.g., image, text) into simple factors of variation (e.g., shape, color) without much…”
Get full text
Journal Article -
19
Understanding Adam Requires Better Rotation Dependent Assumptions
Published 25-10-2024“…Despite its widespread adoption, Adam's advantage over Stochastic Gradient Descent (SGD) lacks a comprehensive theoretical explanation. This paper investigates…”
Get full text
Journal Article -
20
Gradient Descent Is Optimal Under Lower Restricted Secant Inequality And Upper Error Bound
Published 01-03-2022“…Advances in Neural Information Processing Systems 35 (2022): 24893-24904 The study of first-order optimization is sensitive to the assumptions made on the…”
Get full text
Journal Article