Search Results - "Shagan Sah"
-
1
Understanding temporal structure for video captioning
Published in Pattern analysis and applications : PAA (01-02-2020)“…Recent research in convolutional and recurrent neural networks has fueled incredible advances in video understanding. We propose a video captioning framework…”
Get full text
Journal Article -
2
Semantic Text Summarization of Long Videos
Published in 2017 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-03-2017)“…Long videos captured by consumers are typically tied to some of the most important moments of their lives, yet ironically are often the least frequently…”
Get full text
Conference Proceeding -
3
Robust Spatial Filtering With Graph Convolutional Neural Networks
Published in IEEE journal of selected topics in signal processing (01-09-2017)“…Convolutional neural networks (CNNs) have recently led to incredible breakthroughs on a variety of pattern recognition problems. Banks of finite-impulse…”
Get full text
Journal Article -
4
Show, Translate and Tell
Published in 2019 IEEE International Conference on Image Processing (ICIP) (01-09-2019)“…Humans have an incredible ability to process and understand information from multiple sources such as images, video, text, and speech. Recent success of deep…”
Get full text
Conference Proceeding -
5
Key frame extraction for salient activity recognition
Published in 2016 23rd International Conference on Pattern Recognition (ICPR) (01-12-2016)“…Surveillance cameras have become big business, with most metropolitan cities spending millions of dollars to watch residents, both from street corners, public…”
Get full text
Conference Proceeding -
6
Towards 3D convolutional neural networks with meshes
Published in 2017 IEEE International Conference on Image Processing (ICIP) (01-09-2017)“…Voxels are an effective approach to 3D mesh and point cloud classification because they build upon mature Convolutional Neural Network concepts. We show…”
Get full text
Conference Proceeding -
7
Multi-Modal Deep Learning to Understand Vision and Language
Published 01-01-2018“…Developing intelligent agents that can perceive and understand the rich visual world around us has been a long-standing goal in the field of artificial…”
Get full text
Dissertation -
8
Multi Stage Common Vector Space for Multimodal Embeddings
Published in 2019 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) (01-10-2019)“…Deep learning frameworks have proven to be very effective at tasks like classification, segmentation, detection, and translation. Before being processed by a…”
Get full text
Conference Proceeding -
9
Image description through fusion based recurrent multi-modal learning
Published in 2016 IEEE International Conference on Image Processing (ICIP) (01-09-2016)“…Current research in computer vision and machine learning has demonstrated some great abilities at detecting and recognizing objects in natural images. The…”
Get full text
Conference Proceeding -
10
Semantically Invariant Text-to-Image Generation
Published in 2018 25th IEEE International Conference on Image Processing (ICIP) (01-10-2018)“…Image captioning has demonstrated models that are capable of generating plausible text given input images or videos. Further, recent work in image generation…”
Get full text
Conference Proceeding -
11
Multimodal Reconstruction Using Vector Representation
Published in 2018 25th IEEE International Conference on Image Processing (ICIP) (01-10-2018)“…Recent work has demonstrated that neural embedding from multiple modalities can be utilized to focus the results of generative adversarial networks. However,…”
Get full text
Conference Proceeding -
12
Batch-normalized recurrent highway networks
Published in 2017 IEEE International Conference on Image Processing (ICIP) (01-09-2017)“…Gradient control plays an important role in feed-forward networks applied to various computer vision tasks. Previous work has shown that Recurrent Highway…”
Get full text
Conference Proceeding -
13
Vector Learning for Cross Domain Representations
Published in 2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) (01-10-2017)“…Recently, generative adversarial networks have gained a lot of popularity for image generation tasks. However, such models are associated with complex learning…”
Get full text
Conference Proceeding -
14
Adaptive hierarchical classification networks
Published in 2016 23rd International Conference on Pattern Recognition (ICPR) (01-12-2016)“…Hierarchical decomposition enables increased number of classes in a classification problem. Class similarities guide the creation of a family of course to fine…”
Get full text
Conference Proceeding -
15
A multi-temporal fusion-based approach for land cover mapping in support of nuclear incident response
Published 01-01-2013“…An increasingly important application of remote sensing is to provide decision support during emergency response and disaster management efforts. Land cover…”
Get full text
Dissertation -
16
General-Purpose Deep Point Cloud Feature Extractor
Published in 2018 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-03-2018)“…Depth sensors used in autonomous driving and gaming systems often report back 3D point clouds. The lack of structure from these sensors does not allow these…”
Get full text
Conference Proceeding -
17
Show, Translate and Tell
Published 14-03-2019“…Humans have an incredible ability to process and understand information from multiple sources such as images, video, text, and speech. Recent success of deep…”
Get full text
Journal Article -
18
Adaptive Hierarchical Decomposition of Large Deep Networks
Published 17-07-2020“…Deep learning has recently demonstrated its ability to rival the human brain for visual object recognition. As datasets get larger, a natural question to ask…”
Get full text
Journal Article -
19
Semantic sentence embeddings for paraphrasing and text summarization
Published in 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP) (01-11-2017)“…This paper introduces a sentence to vector encoding framework suitable for advanced natural language processing. Our latent representation is shown to encode…”
Get full text
Conference Proceeding -
20
Multistream hierarchical boundary network for video captioning
Published in 2017 IEEE Western New York Image and Signal Processing Workshop (WNYISPW) (01-11-2017)“…Video understanding has become increasingly important as surveillance, social, and informational videos weave themselves into our everyday lives. Video…”
Get full text
Conference Proceeding