Integrated Search Menu

Jia Zou

Biography: 

Jia Zou is a Tenure-Track Assistant Professor in the School of Computing, Informatics, and Decision Systems Engineering, Arizona State University - Tempe, starting in summer 2019. She is also the director of the CACTUS data-intensive systems lab founded in the summer of 2020. Before that, she was a Research Scientist in the Department of Computer Science of Rice University, Houston, TX, and before that she worked in IBM Research - China as a researcher. She received her Ph.D in Computer Science from Tsinghua University, China.

More project information is here.

 

Research Interests

1. Automatic Integration of Fast-Evolving Data Sources for Artificial Intelligence or Big Data analytics applications

2. Applying Deep Learning to Database Systems

3. Database Systems for Big data Analytics and Machine Learning

 

[New!!!] We are now recruiting highly motivated graduate and undergraduate students (including 1 Ph.D student starting from 2021 Spring/2021 fall). If you are interested in applying deep learning techniques to Big Data management and database systems, please send your CV to jia.zou@asu.edu

Don't worry about the Ph.D application deadline, still hiring!!

 

Education: 
  • Ph.D. Department of Computer Science, Tsinghua University, China 
Research Interests: 

1. Automatic Integration of Fast-Evolving Data Sources for Artificial Intelligence or Big Data analytics applications

2. Applying Deep Learning to Database Systems

3. Database Systems for Big data Analytics and Machine Learning

 

 

Research Group: 

Ph.D Students

Lixi Zhou. (lixi.zhou@asu.edu)

Master Students

Pratik Barhate (pbarhate@asu.edu)

Amitabh Das (adas59@asu.edu)

Valay Dave (vddave@asu.edu)

Zijie Wang (zijiewang@asu.edu)

Publications: 

Publications

(Supervised students are marked with *)

 

2020

Jia Zou, Pratik Barhate*, Amitabh Das*, Arun Iyengar, Binhang Yuan, Dimitrije Jankov, and Chris Jermaine. "Lachesis: Automated Generation of Persistent Partitionings for UDF-Centric Analytics." arXiv:2006.16529 [cs.DB] (pre-submission) (14 pages)

Zijie Wang*, Lixi Zhou*, Jia Zou. "Integration of Fast-Evolving Data Sources Using A Deep Learning Approach." SFDI 2020, workshop co-located with VLDB 2020 (Accepted) (14 pages)

Jia Zou, Ming Zhao, Juwei Shi and Chen Wang. "WATSON: A Workflow-based Data Storage Optimizer for Analytics." MSST 2020 (Accepted) (14 pages)

Jia Zou, Arun Iyengar, and Chris Jermaine. "Architecture of a distributed storage that combines file system, memory and computation in a single layer." The VLDB Journal (2020): 1-25. [PDF] (Accepted) (25 pages)

 

2019 and before

Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou, Chris Jermaine, Zekai J. Gao. Declarative recursive computation on an RDBMS, or, why you should use a database for distributed machine learning, VLDB 2019, PVLDB Volume 12 Issue 7. [14 pages] (PDF) (Honorable Mention, VLDB 2019 Best Paper Award runner-up, 2020 SIGMOD Research Highlight Award)

Jia Zou, Arun Iyengar, Chris Jermaine, Pangea: Monolithic Distributed Storage for Data Analytics, VLDB 2019, PVLDB Volume 12 Issue 6. [14 pages] (PDF)

Jia Zou, R Matthew Barnett, Tania Lorido-Botran, Shangyu Luo, Carlos Monroy, Sourav Sikdar, Kia Teymourian, Binhang Yuan, Chris Jermaine, PlinyCompute: A Platform for High- Performance, Distributed, Data-Intensive Tool Development, SIGMOD 2018. [16 pages] (PDF)

Jia Zou, Juwei Shi, Tongping Liu, Zhao Cao, Chen Wang, Foreseer: Workload-aware Data Storage for MapReduce, ICDCS 2015. [2 pages]

Lanjun Wang, Oktie Hassanzadeh, Shuo Zhang, Juwei Shi, Limei Jiao, Jia Zou, Chen Wang, Schema Management for Document Stores, VLDB 2015, PVLDB Volume 8 Issue 9. [12 pages]

Juwei Shi, Jia Zou, Jiaheng Lu, Zhao Cao, Shiqiang Li, Chen Wang, MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs, VLDB 2014, PVLDB Volume 7 Issue 13. [12 pages]

Jia Zou, Gong Su, Arun Iyengar, Yu Yuan, Yi Ge, Design and Analysis of a Distributed Multi-leg Stock Trading System, ICDCS 2011. [12 pages]

Jia Zou, Jing Xiao, Rui Hou, Yanqi Wang, Frequent Instruction Sequential Pattern Mining in Hardware Sample Data, ICDM 2010. [6 pages]

Jia Zou, Zhiyong Liang, Yiqi Dai, Scalability Evaluation and Optimization of Multi-core SIP Proxy Server, ICPP 2008. [8 pages]

Jianguo Hao, Jia Zou, Yiqi Dai, A real-time payment scheme for SIP service based on hash chain, ICEBE 2008. [8 pages]

Jia Zou, Wei Xue, Zhiyong Liang, Yixin Zhao, Bo Yang and Ling Shao, SIP Parsing Offload: Design and Performance, GLOBECOM 2007. [6 pages]

Jia Zou, Yiqi Dai, Motivating and Modeling SIP Offload, ICCCN 2007. [6 pages]

 

Granted Patents

1. with Juwei Shi, Chen Wang and et al. Method and Apparatus for Generating Schema of Non- Relational Database. US Patent 10002142B2, 2018

2. with Li Li, Juwei Shi and et al. Resource management in MapReduce architecture and architec- tural system. US Patent 9582334 B2, 2017

3. with Zhao Cao, Juwei Shi and et al. Scheduling and execution of tasks based on resource avail- ability. US Patent 9495206 B2, 2016

4. with Heng Cao, Juwei Shi and et al. Determining location of a user of a mobile device. US Patent 9374800, B2, 2016

5. with Xiaotao Chang, Fei Chen and et al. Method and system for allocating FPGA resources. US Patent 9389915 B2, 2016

6. with Kun Wang, Tianyi Wang and et al. Data processing method, data query method in a database, and corresponding device. US Patent 9471612 B2, 2016

7. with Bo Yang, Juwei Shi and et al. Method and apparatus for processing database data in distributed database system. US Patent 10140351B2, 2016

8. with Arun Iyengar, Su Gong and et al. Methods and systems for highly available coordinated transaction processing. US Patent 9146944B2, 2015

9. with Stephen Heisig, Yanqi Wang and et al. Computer system performance analysis. US Patent 8639697 B2, 2014

10. with Arun Iyengar, Su Gong and et al. Systems and methods for multi-leg transaction processing. US Patent 8601479 B2, 2013

Fall 2020
Course NumberCourse Title
CSE 412Database Management
CSE 580Practicum
CEN 580Practicum
CSE 590Reading and Conference
CEN 590Reading and Conference
CEN 599Thesis
CSE 599Thesis
CEN 792Research
Summer 2020
Course NumberCourse Title
CEN 792Research
Spring 2020
Course NumberCourse Title
CSE 598Special Topics
Fall 2019
Course NumberCourse Title
CSE 205Object-Oriented Program & Data
CSE 580Practicum
CSE 590Reading and Conference
CEN 590Reading and Conference
CEN 599Thesis
Honors / Awards: 

2020 SIGMOD Research Highlight Award

VLDB 2019 Best Paper Runner-up, Honorable Mention Award

Service: 
  • Program Vice Chair of IEEE Big Data 2020.

  • Organization Committee Member of IEEE Service Hackathon 2020.

  • Program Committee Member of IEEE SmartDataServices 2020.

  • Program Committee Member of CIKM 2019.

  • Program Committee Member of IEEE Cluster 2018, 2019.

  • Program Committee Member of HotData I, the First International Workshop on Hot Topics in Big Data and Networking, in conjunction with ICCCN 2014.

  • Reviewer of IEEE Transactions on Parallel and Distributed Systems (TPDS), IEEE Transactions on Knowledge and Data Engineering (TKDE), Journal of SuperComputing and etc.. (https://publons.com/researcher/1466438/jia-zou/)

  • Co-lead of IBM GTO Topic (subtopic: Future of Data Management) 2011.

  • Technical Assistant to Director of IBM Research-China, 2011.

Industry Positions: 

Researcher, IBM Research -China, 2008-2014