Industrial Experience

2018.May-2018.Aug [preprint]

Google Research, New York, USA,
Group: Structured Data Group
Mentor: Jialu Liu, Flip Korn.      Manager: Cong Yu.

Contextual Fact Ranking for Table Synthesis and Compression

  • Design deep learning models to rank facts based on the descriptive title/query

    • pointwise and pairwise model

    • enhance model by incorporating collection mapping information

  • Implement and evaluate various models in Tensorflow

2017.May-2017.Aug [AAAI paper] [Slides ppt] [Slides pdf]

Microsoft Research, Redmond, WA, USA,
Group: Data Management, Exploration & Mining(DMX) Group
Mentor: Bolin Ding, Chi Wang.      Manager: Surajit Chaudhuri.

Auto Configuration Selection for Machine Learning in Large Datasets

  • Propose Confidence Interval (CI)-based framework

    • extrapolate over training set size and derive the estimator

    • design scheduling strategy among candidate configurations

  • build a layer on top of Microsoft ML toolkit and Scikit-Learn in Python

2015.May-2015.Aug [SIGMOD paper]

Microsoft Research, Redmond, WA, USA,
Group: Data Management, Exploration & Mining(DMX) Group
Mentor: Bolin Ding.      Manager: Surajit Chaudhuri.

Approximate Query Processing for Aggregates

  • Design sophisticated sampling and indexing strategy

  • Achieve fast query processing with accuracy guarantee

2013.Jun-2013.Sep

Huawei Noah’s Ark Research Lab, Hong Kong,
Group: Large-scale Data Mining and Machine Learning
Mentor: Mingxuan Yuan.      Manager: Qiang Yang.

Spatial-temporal Data Analysis and Mining

  • Infer social strength for online social network based on physical trajectories

  • Conduct experiment on MBB(mobile broadband) data