Yanping ChenPh.D Candidate in Computer Science and Engineering Center for Research in Data Mining and Machine Learning University of Callifornia, Riverside Email: ychen053 [at] cs.ucr.edu |
||
[Grant Awarded | Research | Publications | Professional Activities | Awards and Honors | Links]
About Me | |
|
Grant Awarded | |
|
|
Our project, titled "Using Data to Understand Insect-Vectored Diseases", produces real-time information that can be used to plan effective suppression programs to combat problems such as malaria, by taking advantage of "The unreasonable effectiveness of data", obtained using our self-designed inexpensive sensors. |
Research | |
|
|
With an optical sensor, we can record on the order of millions of the “sound” of insect flights. The enormous amounts of data we collected allow us to take advantage of “The unreasonable effectiveness of data”. By exploring this huge dataset, we proposed a simple, but very accurate and robust classifier for insect classification. |
|
top) An audio snippet of a female Cx. stigmatosoma pursued by a male. bottom left) An audio snippet of a common house fly. bottom right) If we convert these sound snippets into periodograms, we can cluster and classify the insects. |
|
|
|
Direct application of off-the-shelf semi-supervised learning algorithms do not typically work well for time series. We explained why they typically fail, and we introduced a very simple but very effective fix. The fix requires only a single line of code, but it dramatically improves the performance of semi-supervised learning in time series problems. |
|
(a). Labeled dataset P and unlabeled dataset U; (b). Both ED/DTW failed to select the right object (U2) from U to add to P; (c) The proposed DTW-D succeeded to select U2; |
|
|
|
We proposed a framework for continuously discovering/learning patterns from time series data streams, with no/very little prior knowledge of the patterns to be learned. The data stream can be real-valued and never-ending. Our framework is very general and flexible. It is also scalable and robust to significant noise. |
|
(a). A pattern detected from a time series data stream, which was converted from an activity video. An oracle labeled it pushing and this pattern was added to the dictionary. (b) Another example of pushing was detected 9.6 minutes after it was discovered. |
Publications | |
|
Professional Activities | |
|