NSF III-Core-Small: MoveMine: Mining Sophisticated Patterns and Actionable Knowledge from Massive Moving Object Data

National Science Foundation Award Number: NSF IIS 10-17362 (09/01/2010-08/31/2015)

 Award Abstract Link @ NSF

 

 

Contact Information

 

Jiawei Han,  PI
Department of Computer Science
University of Illinois, Urbana-Champaign
1304 West Springfield Ave. , Urbana, Illinois 61801 U.S.A.
Office: (217) 333-6903,   Fax: (217) 265-6494

E-mail: hanj at cs.uiuc.edu, URL: http://www.cs.uiuc.edu/~hanj

 

List of Supported Students and Staff

 

§  Zhenhui Li, Ph.D. student, Department of Computer Science, University of Illinois at Urbana-Champaign (duration working on this project: 2010-2012)

§  Lu An Tang, Ph.D. student, Department of Computer Science, University of Illinois at Urbana-Champaign  (duration working on this project: 2010-2013)

§  Manish Gupta, Ph.D. student, Department of Computer Science, University of Illinois at Urbana-Champaign (duration working on this project: 2010-2013)

§  Jingjing Wang, Ph.D. student, Department of Computer Science, University of Illinois at Urbana-Champaign (duration working on this project: 2012-present)

§  Chao Zhang, Ph.D. student, Department of Computer Science, University of Illinois at Urbana-Champaign (duration working on this project: 2013-present)

Project Award Information

§  Award Number: NSF IIS 10-17362 (09/01/2010-08/31/2013)

§  Duration: NSF IIS 10-17362 (09/01/2010-08/31/2013)

§  Title: NSF III-Core-Small: MoveMine: Mining Sophisticated Patterns and Actionable Knowledge from Massive Moving Object Data

§  Keywords:  Moving object data mining; multidimensional data analysis; pattern discovery; spatiotemporal data analysis; traffic mining; efficiency and scalability

Project Summary

This research project is to investigate principles and methods for uncovering sophisticated patterns and actionable knowledge from massive moving object data.  Thanks to the rapid progress and broad adoption of sensor, GPS, wireless network, and other advanced technologies, moving object data have been accumulating in unprecedented scale. However, moving object data could be dynamic, sparse, scattered, and noisy, and patterns and knowledge to be mined could be deeply hidden, sophisticated, and subtle.  The MoveMine project investigates effective and scalable methods for mining various kinds of complex patterns from dynamic and noisy moving object data, finding multiple interleaved periodic patterns, and performing in-depth multidimensional analysis of moving object data.  It integrates and extends multiple disciplinary approaches derived from spatiotemporal data analysis, data mining, pattern recognition, statistics, and machine learning.  The study takes bird and animal movement data and traffic data as the major sources of data for investigation.  However, developed methods can be applied to the analysis of many other kinds of moving object data for environmental study, traffic control, law enforcement, and protection of homeland security.  The study also addresses the issue of ensuring privacy and security protection while developing powerful pattern and knowledge discovery mechanisms.  The research results are to be published in various research and application forums and be integrated into the educational programs at UIUC.  The progress of the project and the research results are also disseminated via the project Web site (http://www.cs.uiuc.edu/homes/hanj/projs/movemine.htm).

Publications and Products: (Note: major publications related to this project are in bold font)

Note:  Please search and download all the papers in PDF, if available, at our group’s publication website by following the link: Selected research publications.

Books (authored or edited)

 

1.      Jiawei Han, Micheline Kamber, and Jian Pei, Data Mining: Concepts and Techniques, 3rd ed., Morgan Kaufmann, 2011.

2.      Ashok N. Srivastava and Jiawei Han (eds.), Machine Learning and Knowledge Discovery for Engineering Systems Health Management: Detection, Diagnostics, and Prognostics, Chapman & Hall, 2011.

3.      David Lo, Siau-Cheng Khoo, Jiawei Han, and Chao Liu (eds.), Mining Software Specifications: Methodologies and Applications, Taylor & Francis, 2011.

4.      Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010 (586 + xxiii pages).

5.      Manish Gupta, Jing Gao, Charu Aggawal, and Jiawei Han, Outlier Detection for Temporal Data, Morgan & Claypool Publishers, 2014.

 

Journal articles

 

1.      Chao Zhang, Jiawei Han, Lidan Shou, Jiajun Lu, and Thomas F. La Porta, "Splitter: Mining Fine Grained Sequential Patterns in Semantic Trajectories", PVLDB 7(9): 769-780, 2014 (Also, Proc. 2014 Int. Conf. on Very Large Data Bases (VLDB'14), Hangzhou, China, Sept. 2014.)

2.      Manish Gupta, Jing Gao, Charu C. Aggarwal, and Jiawei Han, "Outlier Detection for Temporal Data: A Survey", accepted by IEEE Trans. on Knowledge and Data Engineering, (to appear), 2014.

3.      Tim Weninger, Thomas J. Johnston, and Jiawei Han, “The Parallel Path Framework for Entity Discovery on the Web", ACM Transactions on the Web, accepted March 2013.

4.      Mohammad Mai_ Hasan Khan, Tarek Abdelzaher, Hieu K. Le, Hossein Ahmadi, and Jiawei Han, “Troubleshooting interactive complexity bugs in wireless sensor networks using data mining techniques", ACM Transactions on Sensor Networks, 2013.

5.      Yizhou Sun, Brandon Norick, Jiawei Han, Xifeng Yan, Philip S. Yu, and Xiao Yu, "PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks", ACM Transactions on Knowledge Discovery from Data (TKDD), 2013.

6.      Xin Jin, Jiebo Luo, Jie Yu, Gang Wang, Dhiraj Joshi, and Jiawei Han, “Reinforced Similarity Integration in Image-Rich Information Networks", IEEE Transactions on Knowledge and Data Engineering (TKDE), 25(2):448-460, 2013.

7.      Yizhou Sun and Jiawei Han, “Mining Heterogeneous Information Networks: A Structural Analysis Approach", SIGKDD Explorations, 14(2):20-28, 2012.

8.      Duo Zhang, Chengxiang Zhai and Jiawei Han, “MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells and Its Applications", Statistical Analysis and Data Mining, 2012.

9.      Lu-An Tang, Yu Zheng, Jing Yuan, Jiawei Han, Alice Leung, Wen-Chih Peng, Thomas La Porta, “A Framework of Traveling Companion Discovery on Trajectory Data Streams", ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2012 (to appear)

10.  Jianbin Huang, Heli Sun, Qinbao Song, Hongbo Deng, and Jiawei Han, “Revealing Density-Based Clustering Structure from the Core-Connected Tree of a Network", IEEE Transactions on Knowledge and Data Engineering, accepted in Apr. 2012.

11.  Mohammad M. Masud, Qing Chen, Latifur Khan, Charu C. Aggarwal, Jing Gao, Jiawei Han, Ashok Srivastava and Nikunj C. Oza, “Classification and Adaptive Novel Class Detection of Feature-Evolving Data Streams", IEEE Transactions on Knowledge and Data Engineering, accepted in Apr. 2012.

12.  Lu-An Tang, Xiao Yu, Sangkyum Kim, Quanquan Gu, Jiawei Han, Alice Leung, Thomas La Porta, “Trustworthiness Analysis of Sensor Data in Cyber-Physical Systems”, accepted by Special Issue on Data Warehousing and Knowledge Discovery from Sensors and Streams, Journal of Computer and System Sciences (JCSS), April 2012.

13.  Lu-An Tang, Xiao Yu, Sangkyum Kim, Jiawei Han, Wen-Chih Peng, Yizhou Sun, Alice Leung, Thomas La Porta, “Multidimensional Sensor Data Analysis in Cyber-Physical Systems: An Atypical Cube Approach”, International Journal of Distributed Sensor Networks, Vol. 2012, 2012

14.  Zhijun Yin, Liangliang Cao, Quanquan Gu, and Jiawei Han, “A Probabilistic Model of Community-Based Latent Topic Analysis”, ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2012.

15.  Zhenhui Li, Jiawei Han, Bolin Ding, and Roland Kays, “Mining Periodic Behaviors of Object Movements for Animal and Biological Sustainability Studies”, Data Mining and Knowledge Discovery, 24(2):355-386, 2012.

16.  Lu Liu, Feida Zhu, Meng Jiang, Jiawei Han, Lifeng Sun, and Shiqiang Yang, “Mining diversity on social media networks”, Multimedia Tools and Applications, 56(1): 179-205 (2012)

17.  Zhenhui Li, Jiawei Han, Ming Ji, Lu-An Tang, Yintao Yu, Bolin Ding, Jae-Gil Lee, and Roland Kays, "MoveMine: Mining Moving Object Data for Discovery of Animal Movement Patterns", ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2(4):37, 2011.

18.  Mohammad Mehedy Masud, Clay Woolam, Jing Gao, Latifur Khan, Jiawei Han, Kevin W. Hamlen, and Nikunj C. Oza, “Facing the Reality of Data Stream Classification: Coping with Scarcity of Labeled Data”, Knowledge and Information Systems (KAIS), accepted June 2011.

19.  Liangliang Cao, Xin Jin, Zhijun Yin, Andrey Del Pozo, Jiebo Luo, Jiawei Han, Thomas S. Huang, “RankCompete: Simultaneous Ranking and Clustering of Information Networks”, Neurocomputing (special issue on Learning from Social Media Network), conditionally accepted 5/22/11.

20.  Lijun Zhang, Chun Chen, Jiajun Bu, Deng Cai, and Jiawei Han, “Locally Discriminative Co-Clustering, IEEE Transactions on Knowledge and Data Engineering (TKDE), accepted 2/15/11.

21.  Bolin Ding, Bo Zhao, Cindy Xide Lin, Jiawei Han, Chengxiang Zhai, Ashok Srivastava, Nikunj C. Oza, “Efficient Keyword-Based Search for Top-K Cells in Text Cube", IEEE Transactions on Knowledge and Data Engineering (TKDE) (Special Issue: Keyword Search on Structured Data), accepted, Dec. 2010.

22.  Jie Yu, Xin Jin, Jiawei Han, Jiebo Luo, “Collection-based Sparse Label Propagation and Its Application on Social Group Suggestion from Photos", ACM Transactions on Intelligent Systems and Technology (TIST), 2(2):12, 2011

23.  Zhenhui Li, Jiawei Han, Ming Ji, Lu-An Tang, Yintao Yu, Bolin Ding, Jae-Gil Lee, and Roland Kays, “MoveMine: Mining Moving Object Data for Discovery of Animal Movement Patterns", ACM Transactions on Intelligent Systems and Technology (ACM TIST) (Special Issue on Computational Sustainability), 2(4):37, 2011.

24.  Lu Liu, Feida Zhu, Meng Jiang, Jiawei Han, Lifeng Sun, and Shiqiang Yang, “Mining diversity on social media networks", Multimedia Tools and Applications, accepted 2010.

25.  Mohammad M. Masud, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani Thuraisingham, “Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints", IEEE Transactions on Knowledge and Data Engineering, accepted Feb. 2010.

26.  Jae-Gil Lee, Jiawei Han, Xiaolei Li, and Hong Cheng, “Mining Discriminative Patterns for Classifying Trajectories on Road Networks", IEEE Transactions on Knowledge and Data Engineering, 23(5):713-725, 2011.

27.  Xin Jin, Sangkyum Kim, Jiawei Han, Liangliang Cao, and Zhijun Yin, “A General Framework for Efficient Clustering of Large Datasets Based on Activity Detection", Statistical Analysis and Data Mining, 4(1): 11-29, 2011.

28.  Hongyan Liu, Yuan Lin, and Jiawei Han, “Methods for Mining Frequent Items in Data Streams: An Overview", Knowledge and Information Systems, 26(1): 1-30, 2011.

29.  Deng Cai, Xiaofei He, and Jiawei Han, “Speed up Kernel Discriminant Analysis", VLDB Journal, 20(1): 21-33, 2011.

30.  Tim Weninger, Fabio Fumarola, Rick Barber, Jiawei Han, Donato Malerba, “Unexpected Results in Automatic List Extraction on the Web”, SIGKDD Explorations, 12(2): 26-30, 2010.

31.  Zhenhui Li, Bolin Ding, Jiawei Han, and Roland Kays, “Swarm: Mining Relaxed Temporal Moving Object Clusters", PVLDB 3(1): 723-734, 2010. (Also, Proc. 2010 Int. Conf. on Very Large Data Bases (VLDB'10), Singapore, Sept. 2010.)

32.  Peixiang Zhao and Jiawei Han, “On Graph Query Optimization in Large Networks", PVLDB 3(1): 340-351, 2010. (Also, Proc. 2010 Int. Conf. on Very Large Data Bases (VLDB'10), Singapore, Sept. 2010.)

33.  Marisa Thoma, Hong Cheng, Arthur Gretton, Jiawei Han, Hans-Peter Kriegel, Alexander J. Smola, Le Song, Philip S. Yu, Xifeng Yan and Karsten M. Borgwardt, “Discriminative Frequent Subgraph Mining with Optimality Guarantees", Statistical Analysis and Data Mining, 3(5):302-318, 2010.

34.  Manish Gupta, Rui Li, Zhijun Yin, and Jiawei Han, “Survey on Social Tagging Techniques", SIGKDD Explorations, 12(1):58-72, 2010.

35.  Hector Gonzalez, Jiawei Han, Hong Cheng, Xiaolei Li, Diego Klabjan, and Tianyi Wu, “Modeling Massive RFID Datasets: A Gateway-Based Movement-Graph Approach", IEEE Transactions on Knowledge and Data Engineering, 22(1):90-104, 2010.

36.  TianyiWu, Yuguo Chen, and Jiawei Han, “Re-Examination of Interestingness Measures in Pattern Mining: A Unified Framework", Data Mining and Knowledge Discovery, 21(3):371-397, 2010.

 

Book Chapters

 

1.      Zhenhui Li and Jiawei Han, "Mining Periodicity from Dynamic and Incomplete Spatiotemporal Data", in Wesley W. Chu (ed.), Data Mining and Knowledge Discovery for Big Data, pp. 41-82, Springer, 2014.

2.      Manish Gupta, Rui Li, Zhijun Yin, and Jiawei Han, “An Overview of Social Tagging Techniques", in Charu C. Aggarwal (ed.), Social Network Data Analysis, pp. 447-498, Springer, 2011.

3.      Xiaoxin Yin, Jiawei Han, and Philip S. Yu, “Scalable Link-Based Similarity Computation and Clustering", in Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010, pp. 45-72.

4.      Hong Cheng, Xifeng Yan and Jiawei Han, “Discriminative Frequent Pattern-Based Graph Classification", in Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010, pp. 237-264.

5.      Xiaoxin Yin, Jiawei Han, and Philip S. Yu, “Veracity Analysis and Object Distinction", in Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010, pp. 283-306.

6.      Chen Chen, Feida Zhu, Xifeng Yan, Jiawei Han, Philip S. Yu, and Raghu Ramakrishnan, “InfoNetOLAP: OLAP and Mining of Information Networks", in Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010, pp. 411-438.

7.      Yizhou Sun and Jiawei Han, “Integrating Clustering and Ranking for Heterogeneous Information Network Analysis", in Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010, pp. 439-474.

8.      Chen Chen, Cindy Xide Lin, Matt Fredrikson, Mihai Christodorescu, Xifeng Yan, and Jiawei Han, “Mining Large Information Network by Graph Summarization", in Philip S. Yu, Jiawei Han and Christos Faloutsos (eds.), Link Mining: Models, Algorithms and Applications, Springer, 2010, pp. 475-504.

9.      Tarek Abdelzaher, Mohammad Khan, Hieu Le, Hossein Ahmadi, and Jiawei Han, “Data Mining for Diagnostic Debugging in Sensor Networks: Preliminary Evidence and Lessons Learned", in Alfredo Cuzzocrea (ed.), Intelligent Techniques for Warehousing and Mining Sensor Network Data, IGI Global, 2010.

10.  Haixun Wang, Philip S. Yu, Jiawei Han, “Mining Concept-Drifting Data Streams", in Oded Maimon and Lior Rokach (Eds.), Data Mining and Knowledge Discovery Handbook, 2nd ed., Springer 2010 pp. 789-802.

 

Refereed Conference Publications

 

1.      Chi Wang, Xueqing Liu, Yanglei Song, and Jiawei Han, "Scalable Moment-based Inference for Latent Dirichlet Allocation", Proc. of 2014 European Conf. on Machine Learning and Principles and Practices of Knowledge Discovery in Databases (ECMLPKDD'14), Nancy, France, Sept. 2014.

2.      Xiang Ren, Jialu Liu, Xiao Yu, Urvashi Khandelwal, Quanquan Gu, Lidan Wang, and Jiawei Han, "ClusCite: Effective Citation Recommendation by Information Network-Based Clustering," Proc. 2014 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'14), New York, NY, Aug. 2014.

3.      Quanquan Gu, Tong Zhang, Jiawei Han, "Batch-Mode Active Learning via Error Bound Minimization", Proc. of 2014 Conf. on Uncertainty in Artificial Intelligence (UAI), Quebec City, Quebec, Canada, July 2014.

4.      Chi Wang, Xueqing Liu, Yanglei Song, Jiawei Han, "Scalable Moment-Based Learning for Topic Model", Proc. 2014 ICML Workshop on Method of Moments and Spectral Learning (ICML-MMSL'14), Beijing, China, June 2014.

5.      Wei Shen, Jiawei Han, and Jianyong Wang, "A Probabilistic Model for Linking Named Entities in Web Text with Heterogeneous Information Networks", Proc. of 2014 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'14), Snowbird, UT, June 2014.

6.      Fangbo Tao, Jiawei Han, Heng Ji, George Brova, Chi Wang, Brandon Norick, Ahmed El-Kishky, Jialu Liu, Xiang Ren, Yizhou Sun, "NewsNetExplorer: Automatic Construction and Exploration of News Information Networks", (system demo), Proc. of 2014 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'14), Snowbird, UT, June 2014.

7.      Xiao Yu, Xiang Ren, Yizhou Sun, Quanquan Gu, Bradley Sturt, Urvashi Khandelwal, Brandon Norick, and Jiawei Han, "Personalized Entity Recommendation: A Heterogeneous Information Network Approach", Proc. 2014 ACM Int. Conf. on Web Search and Data Mining (WSDM'14), New York City, NY, Feb. 2014.

8.      Xiao Yu, Hao Ma, Bo-June (Paul) Hsu and Jiawei Han, "On Building Entity Recommender Systems Using User Click Log and Freebase Knowledge", Proc. 2014 ACM Int. Conf. on Web Search and Data Mining (WSDM'14), New York City, NY, Feb. 2014. 

9.      Xiang Ren, Yujing Wang, Xiao Yu, Jun Yan, Zheng Chen, and Jiawei Han, "Heterogeneous Graph-Based Intent Learning with Queries, Web Pages and Wikipedia Concepts", Proc. 2014 ACM Int. Conf. on Web Search and Data Mining (WSDM'14), New York City, NY, Feb. 2014.

10.  Marina Danilevsky, Chi Wang, Nihit Desai, Xiang Ren, Jingyi Guo, and Jiawei Han, "Automatic Construction and Ranking of Topical Keyphrases on Collections of Short Documents", Proc. of 2014 SIAM Int. Conf. on Data Mining (SDM'14), Philadelphia, PA, April 2014.

11.  Manish Gupta, Arun Mallya, Subhro Roy, Jason H. D. Cho, and Jiawei Han, "Local Learning for Mining Outlier Subgraphs from Network Datasets", Proc. of 2014 SIAM Int. Conf. on Data Mining (SDM'14), Philadelphia, PA, April 2014.

12.  Manish Gupta, Jing Gao, Xifeng Yan, Hasan Cam, and Jiawei Han, "Top-K Interesting Subgraph Discovery in Information Networks", Proc. 2014 IEEE Int. Conf. on Data Engineering (ICDE'14), Chicago, IL, Mar. 2014.

13.  Xiao Yu, Hao Ma, Bo-June (Paul) Hsu and Jiawei Han, "On Building Entity Recommender Systems Using User Click Log and Freebase Knowledge", Proc. 2014 ACM Int. Conf. on Web Search and Data Mining (WSDM'14), New York City, NY, Feb. 2014.

14.  Hyungsul Kim, Xiang Ren, Yizhou Sun, Chi Wang, and Jiawei Han, "Semantic Frame-Based Document Representation for Comparable Corpora", Proc. 2013 IEEE Int. Conf. on Data Mining (ICDM'13), Austin, TX, Dec. 2013, pp. 350-359.

15.  Chi Wang, Marina Danilevsky, Jialu Liu, Nihit Desai, Heng Ji, and Jiawei Han, "Constructing Topical Hierarchies in Heterogeneous Information Networks", Proc. 2013 IEEE Int. Conf. on Data Mining (ICDM'13), Austin, TX, Dec. 2013, pp. 767-776.

16.  Scott Deeann Chen, Ying-Yu Chen, Jiawei Han, and Pierre Moulin, "A Feature-Enhanced Ranking-Base Classifier for Multimodal Data and Heterogeneous Information Networks", Proc. 2013 IEEE Int. Conf. on Data Mining (ICDM'13), Austin, TX, Dec. 2013, pp. 997-1002.

17.  Chi Wang, Xiao Yu, Yanen Li, Chengxiang Zhai, and Jiawei Han, "Content Coverage Maximization on Word Networks for Hierarchical Topic Summarization", Proc. of 2013 Int. Conf. on Information and Knowledge Management (CIKM'13), San Francisco, CA, Oct. 2013, pp. 249-258.

18.  Manish Gupta, Jing Gao, Xifeng Yan, Hasan Cam, and Jiawei Han, “On Detecting Association-Based Clique Outliers in Heterogeneous Information Networks", Proc. of 2013 IEEE/ACM Int. Conf. on Social Networks Analysis and Mining (ASONAM'13), Niagara Falls, Canada, Aug. 2013

19.  Tim Weninger, Xihao Avi Zhu, and Jiawei Han, “An Exploration of Discussion Threads in Social News Sites: A Case Study of the Reddit Community", Proc. of 2013 IEEE/ACM Int. Conf. on Social Networks Analysis and Mining (ASONAM'13), Niagara Falls, Canada, Aug. 2013

20.  Quanquan Gu, Charu Aggarwal, Jialu Liu, and Jiawei Han, “Selective Sampling on Graphs for Classification", Proc. of 2013 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug. 2013.

21.  Lu-An Tang, Xiao Yu, Quanquan Gu, Jiawei Han, Alice Leung, and Thomas La Porta, “Mining Lines in the Sand: On Trajectory Discovery From Untrustworthy Data in Cyber-Physical System", Proc. of 2013 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug. 2013.

22.  Chi Wang, Marina Danilevsky, Nihit Desai, Yinan Zhang, Phuong Nguyen, Thrivikrama Taula, and Jiawei Han, “A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy", Proc. of 2013 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug. 2013.

23.  Yang Li (UCSB), Chi Wang (UIUC), Fangqiu Han (UCSB), Jiawei Han (UIUC), Dan Roth (UIUC), Xifeng Yan (UCSB), “Mining Evidences for Named Entity Disambiguation", Proc. of 2013 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug. 2013.

24.  Marina Danilevsky, Chi Wang, Fangbo Tao, Son Nguyen, Gong Chen, Nihit Desai, and Jiawei Han, “AMETHYST: A System for Mining and Exploring Topical Hierarchies in Information Networks", (system demo) Proc. of 2013 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug. 2013.

25.  Fangbo Tao, Kin Hou Lei, Jiawei Han, ChengXiang Zhai, Xiao Cheng, Marina Danilevsky, Nihit Desai, Bolin Ding, Jing Ge, Heng Ji, Rucha Kanade, Anne Kao, Qi Li, Yanen Li, Cindy Xide Lin, Jialiu liu, Nikunj Oza, Ashok Srivastava, Rod Tjoelker, Chi Wang, Duo Zhang, and Bo Zhao, “EventCube: Multi-Dimensional Search and Mining of Structured and Text Data", (system demo) Proc. of 2013 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug. 2013.

26.  Hongzhao Huang, Zhen Wen, Dian Yu, Heng Ji, Yizhou Sun, Jiawei Han and He Li, “Resolving Entity Morphs in Censored Data", Proc. of 2013 Annual Meeting of the Association for Computational Linguistics (ACL'13), Sofia, Bulgaria, Aug. 2013

27.  Jialu iu, Chi Wang, Marina Danilevsky, and Jiawei Han, “Large-Scale Spectral Clustering on Graphs", Proc. of 2013 Int. Joint Conf. on Artificial Intelligence (IJCAI'13), Beijing, China, August 2013.

28.  Guo-Jun Qi, Charu C. Aggarwal, Jiawei Han, and Thomas Huang, “Mining Collective Intelligence in Groups", Proc. of 2013 Int. Conf. on Word Wide Web (WWW'13), Rio de Janeiro, Brazil, May 2013, pp. 1041-1052.

29.  Quanquan Gu and Jiawei Han, “Clustered Support Vector Machine", Proc. 2013 Int. Conf. on Artificial Intelligence and Statistics (AISTAT'13), Scottsdale, AZ, Apr. 2013.

30.  Quanquan Gu, Charu Aggarwal and Jiawei Han, “Unsupervised Link Selection in Networks", Proc. 2013 Int. Conf. on Arti_cial Intelligence and Statistics (AISTAT'13), Scottsdale, AZ, Apr. 2013.

31.  Hongbo Deng, Jiawei Han, Hao Li, Heng Ji, HongningWang, and Yue Lu, “Exploring and Inferring User-User Pseudo-Friendship for Sentiment Analysis with Heterogeneous Networks", Proc. of 2013 SIAM Data Mining Conf. (SDM'13), Austin, TX, May 2013.

32.  Jialu Liu, Chi Wang, Jing Gao, and Jiawei Han, “Multi-View Clustering via Joint Nonnegative Matrix Factorization", Proc. of 2013 SIAM Data Mining Conf. (SDM'13), Austin, TX, May 2013.

33.  Chi Wang, Hongning Wang, Jialu Liu, Ming Ji, Lu Su, Yuguo Chen, Jiawei Han, “On the Detectability of Node Grouping in Networks", Proc. of 2013 SIAM Data Mining Conf. (SDM'13), Austin, TX, May 2013.

34.  Ling Chen, Xue Li, and Jiawei Han, “MedRank: Discovering Influential Medical Treatments from Literature by Information Network Analysis", Proc. 2013 Australasian Database Conf. (ADC'13), Adelaide, South Australia, Jan. 2013.

35.  Zhenhui Li, Jingjing Wang, and Jiawei Han, "Mining Periodicity for Sparse and Incomplete Event Data", Proc. of 2012 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'12), Beijing, China, Aug. 2012

36.  Jingjing Wang and Bhaskar Prabhala, "Periodicity Based Next Place Prediction", Proc. of Workshop on Mobile Data Challenge by Nokia, Newcastle, UK, June 2012

37.  Liangliang Cao, John Smith, Zhen Wen, Zhijun Yin, Xin Jin, and Jiawei Han, "BlueFinder: Estimate Where a Beach Photo Was Taken" (poster paper), Proc. of 2012 Int. Conf. on Word Wide Web (WWW'12), Lyon, France, Apr. 2012.

38.  Manish Gupta, Peixiang Zhao, and Jiawei Han, "Evaluating Event Credibility on Twitter", Proc. 2012 SIAM Int. Conf. on Data Mining (SDM'12), Anaheim, CA, April 2012.

39.  Quanquan Gu, Marina Danilevsky, Zhenhui Li, and Jiawei Han, “Locality Preserving Feature Learning”, Proc. 2012 Int. Conf. on Artificial Intelligence and Statistics (AISTAT'12), La Palma, Canary Islands, April 2012.

40.  Lu-An Tang, Yu Zheng, Jing Yuan, Jiawei Han, Alice Leung, Chih-Chieh Hung, and Wen-Chih Peng, "On Discovery of Traveling Companions from Streaming Trajectories", Proc. 2012 IEEE Int. Conf. on Data Engineering (ICDE'12), Arlington, VA, Apr. 2012.

41.  Lu-An Tang, Xiao Yu, Sangkyum Kim, Jiawei Han, Yizhou Sun, Wen-Chih Peng, Hector Gonzalez, Sebastian Seith, "Multidimensional Analysis of Atypical Events in Cyber-Physical Data", Proc. 2012 IEEE Int. Conf. on Data Engineering (ICDE'12), Arlington, VA, Apr. 2012.

42.  Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang Zhai, and Thomas Huang, "LPTA: A Probabilistic Model for Latent Periodic Topic Analysis", Proc. 2011 IEEE Int. Conf. on Data Mining (ICDM'11), Vancouver, Canada, Dec. 2011.

43.  Xin Jin, Chi Wang, Jiebo Luo and Jiawei Han, "LikeMiner: A System for Mining the Power of ’Like’ in Social Media Networks", Proc. of 2011 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'11), (system demo), San Diego, Aug. 2011.

44.  Quanquan Gu, Zhenhui Li, and Jiawei Han, “Linear Discriminant Dimensionality Reduction”,  Proc. 2011 European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'11), Athens, Greece, Sept. 2011

45.  Sangkyum Kim, Marina Barsky, and Jiawei Han, “Efficient Mining of Top Correlated Patterns Based on Null-Invariant Measures”, Proc. 2011 European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'11), Athens, Greece, Sept. 2011

46.  Quanquan Gu, Zhenhui Li, and Jiawei Han, “Generalized Fisher Score for Feature Selection”, Proc. of 2011 Int. Conf. on Uncertainty in Artificial Intelligence (UAI'11), Barcelona, Spain, July 2011.

47.  Hongbo Deng, Jiawei Han, Bo Zhao, Yintao Yu, Cindy Xide Lin, "Probabilistic Topic Models with Biased Propagation on Heterogeneous Information Networks", Proc. of 2011 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'11), San Diego, Aug. 2011.

48.  Ming Ji and Jiawei Han, "Ranking-Based Classification of Heterogeneous Information Networks", Proc. of 2011 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'11), San Diego, Aug. 2011.

49.  Xin Jin, Chi Wang, Jiebo Luo and Jiawei Han, "LikeMiner: A System for Mining the Power of ’Like’ in Social Media Networks", Proc. of 2011 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'11) (system demo), San Diego, Aug. 2011.

50.  Manish Gupta, Charu C. Aggarwal, and Jiawei Han, “Finding Top-k Shortest Path Distance Changes in an Evolutionary Network”, Proc. of 2011 Int. Symp. on Spatial and Temporal Databases (SSTD'11), Minneapolis, MN, Aug. 2011.

51.  Zhenhui Li, Cindy Xide Lin, Bolin Ding, and Jiawei Han, “Mining Significant Time Intervals for Relationship Detection”, Proc. of 2011 Int. Symp. on Spatial and Temporal Databases (SSTD'11), Minneapolis, MN, Aug. 2011.

52.  Lu-An Tang, Yu Zheng, Xing Xie, Jing Yuan, Xiao Yu, Jiawei Han, “Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations”, Proc. of 2011 Int. Symp. on Spatial and Temporal Databases (SSTD'11), Minneapolis, MN, Aug. 2011.

53.  Quanquan Gu, Zhenhui Li, and Jiawei Han, "Learning a Kernel for Multi-Task Clustering", Proc. of 2011 AAAI Conf. on Artificial Intelligence (AAAI'11), San Francisco, CA, Aug. 2011.

54.  Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu, and Tianyi Wu, “PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks”, Proc. of 2011 Int. Conf. on Very Large Data Bases (VLDB'11), Seattle, WA, Aug. 2011.

55.  Feida Zhu, Qiang Qu, David Lo, Xifeng Yan, Jiawei Han, and Philip S. Yu, “Mining Top-K Large Structural Patterns in a Massive Network”, Proc. of 2011 Int. Conf. on Very Large Data Bases (VLDB'11), Seattle, WA, Aug. 2011.

56.  Liwen Sun, Reynold Cheng, Xiang Li, David W. Cheung, and Jiawei Han, “On Link-Based Similarity Join”, Proc. of 2011 Int. Conf. on Very Large Data Bases (VLDB'11), Seattle, WA, Aug. 2011.

57.  Manish Gupta, Charu Aggarwal, Jiawei Han and Yizhou Sun, "Evolutionary Clustering and Analysis of Bibliographic Networks", Proc. of 2011 Int. Conf. on Advances in Social Network Analysis and Mining (ASONAM'11), Kaohsiung, Taiwan, July 2011.

58.  Yizhou Sun, Rick Barber, Manish Gupta, Charu Aggarwal and Jiawei Han, "Co-Author Relationship Prediction in Heterogeneous Bibliographic Networks", Proc. of 2011 Int. Conf. on Advances in Social Network Analysis and Mining (ASONAM'11), Kaohsiung, Taiwan, July 2011.

59.  Xiao Yu, Ang Pan, Lu-An Tang, Zhenhui Li and Jiawei Han, "Geo-Friends Recommendation in GPS-based Cyber-Physical Social Network", Proc. of 2011 Int. Conf. on Advances in Social Network Analysis and Mining (ASONAM'11), Kaohsiung, Taiwan, July 2011.

60.  Hongbo Deng, Jiawei Han, Bo Zhao, "Collective Topic Modeling for Heterogeneous Networks", Proc. of 2011 Int. ACM SIGIR Conf. on Research & Development in Information Retrieval (SIGIR'11), Beijing, China, July 2011. (poster paper) 

61.  Ming Ji, Jun Yan, Xiaofei He, Jiawei Han, Siyu Gu, "Learning Search Tasks in Queries and Web Pages via Graph Regularization", Proc. of 2011 Int. ACM SIGIR Conf. on Research & Development in Information Retrieval (SIGIR'11), Beijing, China, July 2011.

62.  Sangkyum Kim, Hyungsul Kim, Jiawei Han, Tim Weninger, Hyun Duk Kim, "Authorship Classification: A Discriminative Syntactic Tree Mining Approach", Proc. of 2011 Int. ACM SIGIR Conf. on Research & Development in Information Retrieval (SIGIR'11), Beijing, China, July 2011.

63.  Chi Wang, Jiawei Han, Rajat Raina, David Fong, Ding Zhou, "Learning Relevance in a Heterogeneous Social Network and Its Application in Online Targeting", Proc. of 2011 Int. ACM SIGIR Conf. on Research & Development in Information Retrieval (SIGIR'11), Beijing, China, July 2011.

64.  Quanquan Gu, Zhenhui Li and Jiawei Han, “Joint Feature Selection and Subspace Learning”, Proc. of 2011 Int. Joint Conf. on Artificial Intelligence (IJCAI'11), Barcelona, Spain, July 2011.

65.  Quanquan Gu, Chris Ding and Jiawei Han, “On Trivial Solution and Scale Transfer Problems in Graph Regularized NMF", Proc. of 2011 Int. Joint Conf. on Artificial Intelligence (IJCAI'11), Barcelona, Spain, July 2011.

66.  Peixiang Zhao, Xiaolei Li, Dong Xin, and Jiawei Han, “Graph Cube: On Warehousing and OLAP Multidimensional Networks”, Proc. of 2011 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'11), Athens, Greece, June 2011

67.  Bolin Ding, Marianne Winslett, Jiawei Han, and Zhenhui Li, “Differentially Private Data Cube: Optimizing Noise Source and Consistency”, Proc. of 2011 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'11), Athens, Greece, June 2011

68.  Tim Weninger, Marina Danilevsky, Fabio Fumarola, Joshua Hailpern, Jiawei Han, Ming Ji, Thomas J. Johnston, Surya Kallumadi, Hyungsul Kim, Zhijin Li, David McCloskey, Yizhou Sun, Nathan E. TeGrotenhuis, Chi Wang, and Xiao Yu, “WinaCS: Construction and Analysis of Web-Based Computer Science Information Networks", Proc. of 2011 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'11), (system demo paper), Athens, Greece, June 2011.

69.  Zhijun Yin, Liangliang Cao, Jiawei Han, Jiebo Luo, and Thomas Huang, “Diversified Trajectory Pattern Ranking in Geo-tagged Social Media”, Proc. of 2011 SIAM Conf. on Data Mining (SDM'11), Phoenix, AZ, Apr. 2011.

70.  Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang Zhai, and Thomas Huang, “Geographical Topic Discovery and Comparison”, Proc. of 2011 Int. World Wide Web Conf. (WWW'11), Hyderabad, India, Mar. 2011 (Full paper).

71.  Tim Weninger, Fabio Fumarola, Cindy Xide Lin, Rick Barber, Jiawei Han, and Donato Malerba, “Growing Parallel Paths for Entity-Page Discovery”, Proc. of 2011 Int. World Wide Web Conf. (WWW'11), Hyderabad, India, Mar. 2011 (Poster paper)

72.  Manish Gupta, Yizhou Sun, and Jiawei Han, “Trust Analysis with Clustering", Proc. of 2011 Int. World Wide Web Conf. (WWW'11), Hyderabad, India, March 2011 (Poster paper)

73.  Heli Sun, Jianbin Huang, Jiawei Han, Hongbo Deng, Peixiang Zhao, and Boqin Feng, “gSkeleton-Clu: Density-based Network Clustering via Structure-Connected Tree Division or Agglomeration”, Proc. of 2010 Int. Conf. on Data Mining (ICDM'10), Sydney, Australia, Dec. 2010

74.  Lu-An Tang, Xiao Yu, Sangkyum Kim, Jiawei Han, Chih-Chieh Hung, and Wen-Chih Peng, “Tru-Alarm: Trustworthiness Analysis of Sensor Networks in Cyber-Physical Systems”, Proc. of 2010 Int. Conf. on Data Mining (ICDM'10), Sydney, Australia, Dec. 2010

75.  Jianbin Huang, Heli Sun, Jiawei Han, Hongbo Deng, Yizhou Sun, and Yaguang Liu, “SHRINK: A Structural Clustering Algorithm for Detecting Hierarchical Communities in Networks", Proc. 2010 ACM Int. Conf. on Information and Knowledge Management (CIKM'10), Toronto, Canada, Oct. 2010.

76.  Lu Liu, Jie Tang, Jiawei Han, Meng Jiang, Shiqiang Yang, “Mining Topic-Level Influence in Heterogeneous Networks",  Proc. 2010 ACM Int. Conf. on Information and Knowledge Management (CIKM'10), Toronto, Canada, Oct. 2010

77.  Xin Jin, Andrew Gallagher, Liangliang Cao, Jiebo Luo, and Jiawei Han, “The Wisdom of Social Multimedia: Using Flickr for Prediction and Forecast", Proc. 2010 ACM Multimedia Int. Conf. (ACM-Multimedia’10), Florence, Italy, Oct. 2010

78.  Ming Ji, Yizhou Sun, Marina Danilevsky, Jiawei Han, and Jing Gao, “Graph Regularized Transductive Classification on Heterogeneous Information Networks", Proc. 2010 European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'10), Barcelona, Spain, Sept. 2010

79.  Hyung Sul Kim, Sangkyum Kim, Tim Weninger, Jiawei Han, and Tarek Abdelzaher, “NDPMine: Efficiently Mining Discriminative Numerical Features for Pattern-Based Classification", Proc. 2010 European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'10), Barcelona, Spain, Sept. 2010

 

Ph.D. Dissertations

 

1.      Lu An Tang, Ph.D., August 2013, thesis title: “Mining Sensor and Mobility Data in Cyber Physical Systems”, link to Ph.D. dissertation

2.      Manish Gupta, Ph.D., March 2013, thesis title: “Outlier Detection for Information Networks", link to Ph.D. dissertation

3.      Zhenhui Li, Ph.D., Sept. 2012, thesis title: “Mining periodicity and object relationship in spatial and temporal data", link to Ph.D. dissertation

 

Project Impact

 

§  Education:  Parts of the new research results are used in Data Mining courses (CS412, CS512) for both undergraduate and graduate students being taught in the Department of Computer Science, the University of Illinois at Urbana-Champaign.    Moreover, the research results have been and will continuously be published timely in international conferences and journals and be distributed world-wide for education and research.  The new progress will also be integrated into the new edition of our data mining textbook and other research collections.

§  Collaborations: For this project we have established collaborations with Boeing, ARL, NASA, HP Labs, IBM T.J. Watson Research Center, Yahoo! Research, Microsoft Research, and NCSA (National Center of Supercomputer Applications).  Through such collaborations we expect to have access to real datasets and applications and produce more research results.

 

Current and Future Activities

The following are some of the highlights of our ongoing work.  Please refer to the section: Publications and Products section for related references.

1.      Study object movement mining in the context of cyber-physical networks

2.      Study efficient methods for mining more sophisticated movement patterns than the state-of-the-art

3.      Study methods for anomaly detection for moving objects in sensor network environment

Area Background

 

This project is based on the previous research on data mining, spatiotemporal data analysis, and data cube and multidimensional analysis.    There have been many research papers published on these themes.   Several textbooks on data mining,  information retrieval and information network analysis provide good overviews of the principles and algorithms, including (Han and Kamber, 2006, (Hastie, Tibshirani, and Friedman,  2ed., 2009) and (Miller and Han 2009).

 

Area References

·         Ralf Hartmut Güting and Markus Schneider, Moving objects databases, Morgan Kaufmann, 2005.

·         Jiawei Han, Micheline Kamber, and Jian Pei, Data Mining: Concepts and Techniques, 3rd edition, Morgan Kaufmann, 2011.

·         Hillol Kargupta, Jiawei Han, Philip Yu, Rajeev Motwani, and Vipin Kumar (eds.), Next Generation of Data Mining (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series), Taylor & Francis, 2008.

·         Harvey Miller and Jiawei Han (eds.), Geographical Data Mining and Knowledge Discovery, 2nd edition, Taylor & Francis, 2009.

·         Philip S. Yu, Jiawei Han, and Christos Faloutsos (eds), Link Mining: Models, Algorithms, and Applications, Springer, 2010.

 

Potential Related Projects

·         Information Network Analysis and Discovery (Information Network Academic Research Center: Network Science-Collaborative Technology Alliance) (NSF IIS Infonet Project)

·         Knowledge Discovery in Cyberphysical Systems (NSF/CPS)

·         Sequential and Structured Pattern Discovery: Classification, Clustering and Outlier Analysis

·         Discovery of the Dynamics of Data Streams in Multi-Dimensional Space

·         Multidimensional Analysis and Ranking in Databases, Web, and Other Information Repositories

Project Web site URL:  http://www.cs.uiuc.edu/~hanj/projs/movemine.htm

Online software:  Online software can be downloaded at http://illimine.cs.uiuc.edu, and online system demo is at http://dm.cs.uiuc.edu/movemine

Online resources:  Research publications related to this project can be downloaded at Selected Publications