Overview    Publications    Projects    Team    Services    Teaching   

Current Projects

My research projects are mainly on data systems and involve collaborations with other professors at UTCS:

  • A Scalable and Efficient Distributed Database over CXL memory (with Prof. Witchel)
  • A Multi-Version Database with Protocols and Storages Co-Design
  • Automatic Data Layout Designs for Knowledge Databases
  • A Unified Execution Engine for Large-Scale Multi-Modal Data Analysis
  • An Optimization Framework for Data-Intensive User-Defined Programs (with Prof. Dillig)

Past Projects

My past research in PhD and Postdoc was focused on supporting user-centered data analytical applications at scale by reshaping modern data analytical stacks. The major projects I worked on include:

  • FormS: a Python library that efficiently translates spreadsheet formulas to SQL queries
  • Smash: a string distance metric that captures acronyms, abbreviations, and typos together.
  • Transactional Panorama: a conceptual framework for user perception in analytical visual interfaces
  • Taco: efficient and compact spreadsheet formula graphs
  • Modin: a scalable dataframe system
  • Lux: a visualization recommendation library for data scientists to perform easy data exploration in dataframe workflow
  • CrocodileDB: a new database architecture that exploits time slackness to enable new resource-efficient query execution (video)
  • ACC: a high-performance main-memory database that adaptively choosees and mixes multiple concurrency control protocols