Search Results - "Chavan, Arnav"

  • Showing 1 - 20 results of 20
Refine Results
  1. 1

    Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space by Chavan, Arnav, Shen, Zhiqiang, Liu, Zhuang, Liu, Zechun, Cheng, Kwang-Ting, Xing, Eric

    “…This paper explores the feasibility of finding an optimal sub-model from a vision transformer and introduces a pure vision transformer slimming (ViT-Slim)…”
    Get full text
    Conference Proceeding
  2. 2
  3. 3

    Rescaling CNN Through Learnable Repetition of Network Parameters by Chavan, Arnav, Bamba, Udbhav, Tiwari, Rishabh, Gupta, Deepak

    “…Deeper and wider CNNs are known to provide improved performance for deep learning tasks. However, most such networks have poor performance gain per parameter…”
    Get full text
    Conference Proceeding
  4. 4

    Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning by Chavan, Arnav, Tiwari, Rishabh, Bamba, Udbhav, Gupta, Deepak K.

    “…Gradient based meta-learning methods are prone to overfit on the meta-training set, and this behaviour is more prominent with large and complex networks…”
    Get full text
    Conference Proceeding
  5. 5

    On Designing Light-Weight Object Trackers Through Network Pruning: Use CNNS or Transformers? by Aggarwal, Saksham, Gupta, Taneesh, Sahu, Pawan K., Chavan, Arnav, Tiwari, Rishabh, Prasad, Dilip K., Gupta, Deepak K.

    “…Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy…”
    Get full text
    Conference Proceeding
  6. 6

    Surgical Feature-Space Decomposition of LLMs: Why, When and How? by Chavan, Arnav, Lele, Nahush, Gupta, Deepak

    Published 17-05-2024
    “…Low-rank approximations, of the weight and feature space can enhance the performance of deep learning models, whether in terms of improving generalization or…”
    Get full text
    Journal Article
  7. 7

    Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language Models by Chavan, Arnav, Lele, Nahush, Gupta, Deepak

    Published 12-12-2023
    “…Due to the substantial scale of Large Language Models (LLMs), the direct application of conventional compression methodologies proves impractical. The…”
    Get full text
    Journal Article
  8. 8

    Beyond Uniform Scaling: Exploring Depth Heterogeneity in Neural Architectures by T, Akash Guna R, Chavan, Arnav, Gupta, Deepak

    Published 19-02-2024
    “…Conventional scaling of neural networks typically involves designing a base network and growing different dimensions like width, depth, etc. of the same by…”
    Get full text
    Journal Article
  9. 9

    Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward by Chavan, Arnav, Magazine, Raghav, Kushwaha, Shubham, Debbah, Mérouane, Gupta, Deepak

    Published 02-02-2024
    “…Despite the impressive performance of LLMs, their widespread adoption faces challenges due to substantial computational and memory requirements during…”
    Get full text
    Journal Article
  10. 10

    One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning by Chavan, Arnav, Liu, Zhuang, Gupta, Deepak, Xing, Eric, Shen, Zhiqiang

    Published 13-06-2023
    “…We present Generalized LoRA (GLoRA), an advanced approach for universal parameter-efficient fine-tuning tasks. Enhancing Low-Rank Adaptation (LoRA), GLoRA…”
    Get full text
    Journal Article
  11. 11

    Patch Gradient Descent: Training Neural Networks on Very Large Images by Gupta, Deepak K, Mago, Gowreesh, Chavan, Arnav, Prasad, Dilip K

    Published 31-01-2023
    “…Traditional CNN models are trained and tested on relatively low resolution images (<300 px), and cannot be directly operated on large-scale images due to…”
    Get full text
    Journal Article
  12. 12

    Transfer Learning Gaussian Anomaly Detection by Fine-tuning Representations by Rippel, Oliver, Chavan, Arnav, Lei, Chucai, Merhof, Dorit

    Published 13-06-2022
    “…Current state-of-the-art anomaly detection (AD) methods exploit the powerful representations yielded by large-scale ImageNet training. However, catastrophic…”
    Get full text
    Journal Article
  13. 13

    Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning by Chavan, Arnav, Tiwari, Rishabh, Bamba, Udbhav, Gupta, Deepak K

    Published 03-06-2022
    “…Gradient based meta-learning methods are prone to overfit on the meta-training set, and this behaviour is more prominent with large and complex networks…”
    Get full text
    Journal Article
  14. 14

    ChipNet: Budget-Aware Pruning with Heaviside Continuous Approximations by Tiwari, Rishabh, Bamba, Udbhav, Chavan, Arnav, Gupta, Deepak K

    Published 14-02-2021
    “…Structured pruning methods are among the effective strategies for extracting small resource-efficient convolutional neural networks from their dense…”
    Get full text
    Journal Article
  15. 15

    Rescaling CNN through Learnable Repetition of Network Parameters by Chavan, Arnav, Bamba, Udbhav, Tiwari, Rishabh, Gupta, Deepak

    Published 14-01-2021
    “…Deeper and wider CNNs are known to provide improved performance for deep learning tasks. However, most such networks have poor performance gain per parameter…”
    Get full text
    Journal Article
  16. 16

    On Designing Light-Weight Object Trackers through Network Pruning: Use CNNs or Transformers? by Aggarwal, Saksham, Gupta, Taneesh, Sahu, Pawan Kumar, Chavan, Arnav, Tiwari, Rishabh, Prasad, Dilip K, Gupta, Deepak K

    Published 24-11-2022
    “…Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy…”
    Get full text
    Journal Article
  17. 17

    Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space by Chavan, Arnav, Shen, Zhiqiang, Liu, Zhuang, Liu, Zechun, Cheng, Kwang-Ting, Xing, Eric

    Published 03-01-2022
    “…This paper explores the feasibility of finding an optimal sub-model from a vision transformer and introduces a pure vision transformer slimming (ViT-Slim)…”
    Get full text
    Journal Article
  18. 18

    Multi-Plateau Ensemble for Endoscopic Artefact Segmentation and Detection by Jadhav, Suyog, Bamba, Udbhav, Chavan, Arnav, Tiwari, Rishabh, Raj, Aryan

    Published 23-03-2020
    “…http://ceur-ws.org/Vol-2595/endoCV2020_paper_id_20.pdf Endoscopic artefact detection challenge consists of 1) Artefact detection, 2) Semantic segmentation, and…”
    Get full text
    Journal Article
  19. 19
  20. 20