Search Results - "Bhattacherjee, Souvik"
-
1
RStore: A Distributed Multi-Version Document Store
Published in 2018 IEEE 34th International Conference on Data Engineering (ICDE) (01-04-2018)“…We address the problem of compactly storing a large number of versions (snapshots) of a collection of keyed documents or records in a distributed environment,…”
Get full text
Conference Proceeding -
2
Efficient Layouts and Algorithms for Managing Versioned Datasets
Published 01-01-2018“…Version Control Systems were primarily designed to keep track of and provide control over changes to source code and have since provided an excellent way to…”
Get full text
Dissertation -
3
Predictive Caching Framework for Mobile Wireless Networks
Published in 2015 16th IEEE International Conference on Mobile Data Management (01-06-2015)“…With increasing popularity of Netflix, Yahoo! Video, etc., interactive multimedia services such as video-on-demand (VoD) provide an interesting and rich field…”
Get full text
Conference Proceeding -
4
RStore: A Distributed Multi-version Document Store
Published 21-02-2018“…We address the problem of compactly storing a large number of versions (snapshots) of a collection of keyed documents or records in a distributed environment,…”
Get full text
Journal Article -
5
L-Store: A Real-time OLTP and OLAP System
Published 15-01-2016“…Arguably data is the new natural resource in the enterprise world with an unprecedented degree of proliferation. But to derive real-time actionable insights…”
Get full text
Journal Article -
6
Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff
Published 19-05-2015“…The relative ease of collaborative data science and analysis has led to a proliferation of many thousands or millions of $versions$ of the same datasets in…”
Get full text
Journal Article -
7
Multidimensional Balanced Allocation for Multiple Choice & (1 + Beta) Processes
Published 03-11-2011“…Allocation of balls into bins is a well studied abstraction for load balancing problems.The literature hosts numerous results for sequential(single…”
Get full text
Journal Article -
8
Towards "Intelligent Compression" in Streams: A Biased Reservoir Sampling based Bloom Filter Approach
Published 03-11-2011“…With the explosion of information stored world-wide,data intensive computing has become a central area of research.Efficient management and processing of this…”
Get full text
Journal Article -
9
Perfectly Balanced Allocation With Estimated Average Using Expected Constant Retries
Published 03-11-2011“…Balanced allocation of online balls-into-bins has long been an active area of research for efficient load balancing and hashing applications.There exists a…”
Get full text
Journal Article -
10
Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams
Published 17-12-2012“…Applications involving telecommunication call data records, web pages, online transactions, medical records, stock markets, climate warning systems, etc.,…”
Get full text
Journal Article -
11
DataHub: Collaborative Data Science & Dataset Version Management at Scale
Published 02-09-2014“…Relational databases have limited support for data collaboration, where teams collaboratively curate and analyze large datasets. Inspired by software version…”
Get full text
Journal Article