Data.ai
Overview
Foundational and applied work in data management and analytics: scalable databases and pipeline automation, data integration and cleaning, big-data processing, and tools for exploratory and statistical analysis.
Location
KR1
Principal Investigators
Recent Publications
-
Efficient Dataframe Systems: Lazy Fat Pandas on a Diet
Bhushan Pal Singh, Priyesh Kumar, Chiranmoy Bhattacharya, S Sudarshan
arXiv preprint arXiv:2501.08207 (2025)
-
Scheduling of intermittent query processing
Saranya Chandrasekaran, S Sudarshan
International Database Engineered Applications Symposium (2024)
-
PACMMOD Volume 2 Issue 3
Divyakant Agrawal, Alexandra Meliou, S Sudarshan
Proceedings of the ACM on Management of Data 2 (3) (2024)
-
Data Generation for Testing Complex Queries
Sunanda Somwase, Parismita Das, S Sudarshan
arXiv preprint arXiv:2409.18821 (2024)
-
Text-to-SQL Calibration: No Need to Ask--Just Rescale Model Probabilities
Ashwin Ramachandran, Sunita Sarawagi
arXiv preprint arXiv:2411.16742 (2024)
-
Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class
Annie D'souza, Sunita Sarawagi
arXiv preprint arXiv:2412.15657 (2024)
-
Shapley Values for Explanation in Two-sided Matching Applications.
Suraj Shetiya, Ian P Swift, Abolfazl Asudeh, Gautam Das
EDBT (2024)
-
Scheduling of Intermittent Query Processing.
C Saranya, S Sudarshan
CoRR (2023)
-
Modern AI for Analyzing Large Structured Databases: Opportunities and Challenges
Sunita Sarawagi
2023 IEEE 30th International Conference on High Performance Computing (2023)
-
CRUSH4SQL: Collective retrieval using schema hallucination for Text2SQL
Mayank Kothyari, Dhruva Dhingra, Sunita Sarawagi, Soumen Chakrabarti
EMNLP 2023 (Main) (2023)