My past research in PhD and Postdoc was focused on supporting user-centered data analytical applications at scale by reshaping modern data analytical stacks. The major projects I worked on include:
- FormS: a Python library that efficiently translates spreadsheet formulas to SQL queries
- Smash: a string distance metric that captures acronyms, abbreviations, and typos together.
- Transactional Panorama: a conceptual framework for user perception in analytical
visual interfaces
- Taco: efficient and compact spreadsheet formula graphs
- Modin: a scalable dataframe system
- Lux:
a visualization recommendation library for data scientists to perform easy data exploration in dataframe workflow
- CrocodileDB:
a new database architecture that exploits time slackness to enable new resource-efficient query execution (video)
- ACC:
a high-performance main-memory database that adaptively choosees and mixes multiple concurrency control protocols