Qiang Zhu


Welcome to my homepage!

My name is Qiang Zhu. I am a Senior Data Scientist at LinkedIn. I am now was a Data Scientist at StumbleUpon. I am was a fourth year PhD student in the Department of Computer Science and Engineering at University of California, Riverside. My advisor is Dr. Eamonn Keogh. Before joining UCR, I received my BS degree in Computer Science from Xiamen University, China in 2007.

My research interests include Data Mining, Pattern Recognition and Information Retrieval. Currently I'm particularly interested in Similarity Search, Motif Discovery and Clustering for large-scale time series and shapes/images datasets.  

You can view my publications here. And selected projects I have done during my graduate school years.

My current research is being funded by the UCR Computational Anthropology Project.
(NSF 0803410)
  • (Dec 12, 2011) Today is another important day to me. I finally leave the ivory tower, after twenty years, and enter the real world from "stumbling" :) StumbleUpon is a recommendation engine which helps people explore new and interesting things from every corner of the Web. Currently we are serving more than 1 billion "stumbles" to 20 million "stumblers" monthly. As a Data Scientist, my job is to play with 30 billion user experience data records to discover interesting patterns and improve our recommendation strategies.

  • (Oct 31, 2011) I successfully defended my Ph.D. Dissertation "Data Mining Historical Manuscripts and Culture Artifacts" today! It is great to be done:) Many thanks to my committee members: my advisor Dr. Eamonn Keogh, Dr. Stefano Lonardi and Dr. Christian Shelton.

  • (Sep 19, 2011) We are pleased to announce a major expansion of the UCR Time Series Classification/Clustering datasets, the world's largest collection of time series datasets.
    Please reference the datasets in your paper as Keogh, E., Zhu, Q., Hu, B., Hao. Y., Xi, X., Wei, L. & Ratanamahatana, C. A. (2011). The UCR Time Series Classification/Clustering Homepage:www.cs.ucr.edu/~eamonn/time_series_data/

  • (Apr 2, 2011) Journal version of "Mother Fugger" will appear in the Knowledge and Information Systems (KAIS) soon, in which we provide more case studies and propose a novel framework helping improve the performance of image search engine like Google images.

  • (Nov 23, 2010) My paper "Mother Fugger: Mining Historical Manuscripts with Local Color Patches" [pdf] has been selected as one of the best papers in ICDM 2010.

  • (Sep 13, 2010) I got two papers accepted (among 72 regular papers) in ICDM 2010: "Mother Fugger: Mining Historical Manuscripts with Local Color Patches" with my advisor Dr. Eamonn Keogh, and "Fast and Flexible Multivariate Time Series Subsequence Search" with my fellows in NASA Ames Research Center.

  • (Aug 19, 2010) The Data Mining and Knowledge Discovery Journal (DMKD) accepts my paper "An Efficient and Effective Similarity Measure to Enable Data Mining of Petroglyphs" today. This is a vastly extended version of our SIGKDD 2009 paper.

  • (Apr 29, 2010) Mitsubishi Electric Research Laboratories (MERL) offered me an internship position today. I will work there from June to September.

  • (Apr 6, 2010) It is not always good news.. PricewaterhouseCoopers just notified me that they decided to close their Center for Advanced Research, and therefore have to withdraw my offer. I am in the market again...

  • (Mar 16, 2010) NASA Ames Research Center is in the process of open sourcing the code of Multivariate Time-Series Search project I participated in last summer.

  • (Mar 4, 2010) The 9th International Symposium on Intelligent Data Analysis (IDA 2010) accepts my paper "Using CAPTCHAs to Index Cultural Artifacts" [pdf] (with my advisor Eamonn Keogh) for the full oral presentation. In this paper, we proposed the first real-valued-response CAPTCHA (Let me know if it is not:-)).

  • (Mar 2, 2010) PricewaterhouseCoopers has offered me a summer internship in their Center for Advanced Research(CAR) in the San Jose, CA office.

  • Looking for a 2010 summer internship. Resume available on request.

  • (Aug 14, 2009) Today I advance to a PhD candidate.

  • (May 8, 2009) I won a Student Travel Award from KDD'09, which will be held in Paris this summer.

  • (Apr 17, 2009) I will be working as a summer intern at Intelligent Systems Division of NASA Ames Research Center this year, on a NASA-Google project of Multivariate Time-Series Search.

  • (Apr 10, 2009) My paper "Augmenting the Generalized Hough Transform to Enable the Mining of Petroglyphs"[pdf] (with Xiaoyue Wang, Eamonn Keogh and Sang-Hee Lee) has been accepted for inclusion in the proceedings and a regular presentation at SIGKDD'09.

  • Currently I'm looking for a summer 09 internship. Resume available on request.


Room 368, Engineering Building Unit 2
University of California, Riverside
CA 92521, USA
Email: a@b.c {a=qzhu; b=cs; c=ucr.edu}
We data mining practitioners should pay more attention on theoretic works to stop producing meaningless results.