Wei Wang, Ph. D.
Associate Professor
Computer Science Department
University of North Carolina, Chapel Hill
Chapel Hill, NC 27599
Voice: (919)962-1744
E-mail: weiwang@cs.unc.edu
URL: http://www.cs.unc.edu/~weiwang/


RESEARCH INTEREST       Data Mining, Bioinformatics, Database Systems.
   
EDUCATION      
Jul. 1999 Ph.D., Department of Computer Science, UCLA.
May 1995 M.S., Department of Systems Science and Industrial Engineering, SUNY at Binghamton.
   
WORK EXPERIENCE      
2005 - present Associate Professor at Department of Computer Science, University of North Carolina at Chapel Hill
2002 - 2005 Assistant Professor at Department of Computer Science, University of North Carolina at Chapel Hill
1999 - 2002 Research Staff Member at IBM T.J. Watson Research Centers
   
HONORS AND AWARDS    Best Student Paper Award, ICDE 2008 for the paper "CARE: finding local linear correlations in high dimensional data".
Phillip and Ruth Hettleman Prize for Artistic and Scholarly Achievement, UNC, 2007.
Microsoft Research New Faculty Fellow, Microsoft, 2005.
Faculty Early Career Development (CAREER) Award, NSF, 2005.
Junior Faculty Development Award, UNC, 2003.
Invention Achievement Award, IBM, 2001.
Invention Achievement Award, IBM, 2000.
Dean's Graduate Fellowship, UCLA, 1999.
Adjudged one of the best papers of ICDE 1999 for the paper "STING+: an approach to active spatial data mining".
NCR Graduate Fellowship, 1997 - 1998.
Outstanding Academic Achievement awarded by SUNY at Binghamton, May 1995.
Distinguished Student awarded by Nankai University, 1993.
Fellowships, Nankai University, 1991 - 1993.
   
PROFESSIONAL ACTIVITIES    Associate Editor of the ACM Transactions on Knowledge Discovery in Data (2005 - present)
Guest Editor of the ACM Transactions on Knowledge Discovery in Data Special Issue on Bioinformatics (2007)
Associate Editor of the Knowledge and Information Systems (2007 - present)
Editorial Board Member of the Open Artificial Intelligence Journal (2007 - present)
Editorial Board Member of the International Journal of Data Mining and Bioinformatics (2005 - present)
Associate Editor of the IEEE Transactions on Knowledge and Data Engineering (2003 - 2007)
Editorial Board Member of the Journal of Database Management (2000 - 2005)
Guest Editor of the IEEE Transactions on Knowledge and Data Engineering Special Issue on Mining Biological Data vol. 17 no. 8 (2005)
Intensive Working Group Member of the ACM SIGKDD Curriculum Committee (2003 - present)
Panelist of the NIH BDMA program (2007)
Panelist of the NIH CSR program (2007)
Panelist of the NIH System Biology program (2007)
Panelist of the NSF IIS program (2007)
Panelist of the NIH BDMA program (2006)
Panelist of the NIH CSR program (2006)
Panelist of the NSF EMT program (2006)
Panelist of the EPA SBIR program on Computational Toxicology (2005)
Panelist of the NSF SEIII program (2005)
Panelist of the NSF BDI program (2005)
Panelist of the NSF BDI program (2004)
Panelist of the NSF ITR Medium Award (2003)
Program Committee Co-Chair of the 8th IEEE International Conference on Data Mining (2009)
Program Committee Member of the 35th International Conference on Very Large Data Bases (2009)
Publicity Co-Chair of the SIAM International Conference on Data Mining (2009)
Vice Chair of the 25th International Conference on Data Engineering (2009)
Program Committee Member of the 8th IEEE International Conference on Data Mining (2008)
Program Committee Member of the 17th ACM Conference on Information and Knowledge Management (2008)
Program Committee Member of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2008)
Program Committee Member of the 34th International Conference on Very Large Data Bases (2008)
Program Committee Member of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2008)
Program Committee Member of the 8th International Workshop on Data Mining in Bioinformatics (2008)
Area Chair of the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2008)
Proceedings Chair and Program Committee Member of the SIAM International Conference on Data Mining (2008)
Program Committee Member of the 24th IEEE International Conference on Data Engineering (2008)
Program Committee Member of the 13th International Conference on Database Systems for Advanced Applications (2008)
Program Committee member of the 16th ACM Conference on Information and Knowledge Management (2007)
General Co-chair of the 2nd International Workshop on Data and Text Mining in Bioinformatics in Conjunction with the 16th ACM Conference on Information and Knowledge Management (2007)
Vice Chair of the 7th IEEE International Conference on Data Mining (2007)
Program Committee Co-chair of the Workshop on Mining and Management of Biological Data, in Conjunction with the 7th IEEE International Conference on Data Mining (2007)
Program Committee member of the 2nd Workshop on Data Mining ihn Bioinformatics in Conjunction with the 33rd International Conference on Very Large Data Bases (2007)
Program Committee Member of the 9th International Conference on Data Warehousing and Knowledge Discovery (2007)
Program Committee Member of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2007)
Area Chair of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining (2007)
Program Committee Member of the SIAM International Conference on Data Mining (2007)
Program Committee Member of the 12th International Conference on Database Systems for Advanced Applications (2007)
Program Committee Member of the 6th IEEE International Conference on Data Mining (2006)
Program Committee Member of the 13th International Conference on Management of Data (2006)
Program Committee Member of the 15th ACM Conference on Information and Knowledge Management (2006)
Program Committee Member of the 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (2006)
Program Committee Member of the 32nd International Conference on Very Large Data Bases (2006)
Program Committee Member of the Ph.D. Workshop in Conjunction with the 32nd International Conference on Very Large Data Bases (2006)
Program Committee Member of the Workshop on Data Mining in Bioinformatics in Conjunction with the 32nd International Conference on Very Large Data Bases (2006)
Program Committee Member of the 8th International Conference on Data Warehousing and Knowledge Discovery (2006)
Senior Program Committee Member of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Program Committee Member of the 6th International Workshop on Data Mining in Bioinformatics in Conjunction with 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Program Committee Member of the 2nd International Conference on Advanced Data Mining and Applications (2006)
Program Committee Member of the 11th International Conference on Database Systems for Advanced Applications (2006)
Program Committee Member of the 22nd IEEE International Conference on Data Engineering (2006)
Program Committee Member of the International Conference on Semantics of a Networked World (2006)
Program Committee Member of the 10th International Conference on Extending DataBase Technology (2006)
Program Committee Member of the 4th Asia-Pacific Bioinformatics Conference (2006)
Program Committee Member of the 5th IEEE International Conference on Data Mining (2005)
Program Committee Member of the 5th IEEE Symposium on Bioinformatics and Bioengineering (2005)
Program Committee Member of the 6th International Conference on Web-Age Information Management (2005)
Program Committee Member of the 31st International Conference on Very Large Data Bases (2005)
Program Committee Member of the Ph.D. Workshop at the 31st International Conference on Very Large Data Bases (2005)
Program Committee Member of the 3rd International Workshop on Biological Data Management in Conjunction with the 16th International Conference on Database and Expert Systems Applications (2005)
Program Committee Member of 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
Program Committee Co-chair of the 5th Workshop on Data Mining in Bioinformatics in Conjunction with the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
Program Committee Member of the 1st International Conference on Advanced Data Mining and Applications (2005)
Program Committee Member of the IEEE Workshop on Computer Vision methods for Bioinformatics in Conjunction with IEEE International Conference on Computer Vision and Pattern Recognition (2005)
Program Committee Member of the ACM SIGMOD International Conference on Management of Data (2005)
Corporate Sponsor Committee Member of the ACM SIGMOD International Conference on Management of Data (2005)
Program Committee Member of the 7th Asia Pacific Web Conference (2005)
Program Committee Member of the ACM Symposium on Applied Computing (2005)
Scientific Committee Member of the International Conference on Computational and Information Sciences (2004)
Program Committee Member of the 13th ACM Conference on Information and Knowledge Management (2004)
Program Committee Member of the 4th IEEE International Conference on Data Mining (2004)
Program Committee Member of the ICDM'04 Workshop on Life Sciences Data Mining (2004)
Program Committee Member of the 1st International Workshop on Knowledge Discovery in Data Streams in conjunction with the 15th European Conference on Machine Learning (2004)
Program Committee Member of the 2nd International Workshop on Biological Data Management in conjunction with the 15th International Conference on Database and Expert Systems Applications (2004)
Program Committee Member of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Program Committee Member of the 4th Workshop on Bioinformatics in Data Mining in conjunction with the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Program Committee Member of the 5th International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (2004)
Program Committee Member of the 2nd International Conference on Software Engineering Research, Management & Applications (2004)
Program Committee Member of the 6th Asia Pacific Web Conference (2004)
Scientific Committee Member of the IADIS International Conference on Applied Computing (2004)
Program Committee Member of the ACM Symposium on Applied Computing (2004)
Proceedings Chair of the 4th International Conference on Web-Age Information Management (2003)
Program Committee Member of the 4th International Conference on Web-Age Information Management (2003)
Program Committee Member of the 15th International Conference on Scientific and Statistical Database Management (2003)
Program Committee Member of the International Workshop on Mining Spatial and Temporal Data (2001)
Session Chair of the 24th IEEE International Conference on Data Engineering (2008)
Session Chair of the 7th IEEE International Conference on Data Mining (2007)
Session Chair of the SIAM International Conference on Data Mining (2007)
Session Chair of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Session Chair of the 22nd IEEE International Conference on Data Engineering (2006)
Session Chair of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2005)
Session Chair of the ACM SIGMOD International Conference on Management of Data (2005)
Session Chair of the 4th IEEE International Conference on Data Mining (2004)
Session Chair of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Session Chair of the 3rd SIAM International Conference on Data Mining (2002)
Session Chair of the 1st IEEE International Conference on Data Mining (2001)
Referee for ACM SIGMOD, ACM SIGMETTRICS, VLDB, ACM SIGKDD, ICDE, FODO conferences (1997-present)
   
SERVICES  Undergraduate Review Committee Member, UNC (2004)
Graduate Admission Committee Member, UNC (2003-present)
Mentor for Student Summer Intern, IBM (2000-2001)
Mentor for Graduate Students, UCLA (1998-1999)
   
PUBLICATIONS

ARTICLES IN REFEREED CONFERENCES
  1. Genotype Sequence Segmentation: Handling Constraints and Noise, by Qi Zhang, Wei Wang, Leonard McMillan, Jan Prins Fernando Pardo-Manuel de Villena, and David Threadgill. Proceedings of the 8th Workshop on Algorithms in Bioinformatics (WABI), 2008.
  2. Mining non-redundant high order correlations in binary data, by Xiang Zhang, Feng Pan, Wei Wang, and Andrew Nobel. Proceedings of the 34th International Conference on Very Large Data Bases (VLDB), 2008.
  3. FastANOVA: an efficient algorithm for genome-wide association study, by Xiang Zhang, Fei Zou, and Wei Wang. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2008.
  4. CRD: a general framework for fast co-clustering on large datasets utilizing sample-based matrix decomposition, by Feng Pan, Xiang Zhang, and Wei Wang. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 173-184, 2008.
  5. CARE: finding local linear correlations in high dimensional data, by Xiang Zhang, Feng Pan, and Wei Wang. Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 130-139, 2008.
  6. Mining approximate order preserving clusters in the presence of noise, by Mengsheng Zhang, Wei Wang, and Jinze Liu. Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 160-168, 2008.
  7. Approximate clustering on distributed data streams, by Qi Zhang, Jinze Liu, and Wei Wang. Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 1131-1139, 2008.
  8. A general framework for fast co-clustering on large datasets using matrix decomposition, by Feng Pan, Xiang Zhang, and Wei Wang. Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE), pp. 1337-1339, 2008.
  9. Sample selection for maximal diversity, by Feng Pan, Adam Roberts, Leonard McMillan, Fernando Pardo Manuel de Villena, David Threadgill, and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 262-271, 2007.
  10. Incremental subspace clustering over multiple data streams, by Qi Zhang, Jinze Liu, and Wei Wang. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 727-732, 2007.
  11. Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows, by Adam Roberts, Leonard McMillan, Wei Wang, Joel Parker, Ivan Rusyn, and David Threadgill, Proceedings of the 15th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Bioinformatics, vol. 23, no. 13, pp. i401-i407, 2007.
  12. An efficient algorithm for mining coherent patterns from heterogeneous Microarrays, by Xiang Zhang and Wei Wang. Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 32, 2007.
  13. A fast algorithm for approximate quantiles in high speed data streams, by Qi Zhang and Wei Wang. Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 29, 2007.
  14. Mining RNA tertiary motifs with structure graphs, by Xueyi Wang, Jun Huan, Jack Snoeyink, and Wei Wang, Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 31, 2007.
  15. Intelligent sequential pattern mining via alignment --- optimization techniques for very large databases, by Hye-Chung Kum, Joong Hyuk Chang, and Wei Wang. Proceedings of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp. 587-597, 2007.
  16. On demand phenotype ranking through subspace clustering, by Xiang Zhang, Wei Wang, and Jun Huan. Proceedings of the 7th SIAM Conference on Data Mining (SDM), 2007.
  17. Poclustering: lossless clustering of dissimilarity data, by Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins. Proceedings of the 7th SIAM Conference on Data Mining (SDM), 2007.
  18. Graph database indexing using structured graph decomposition, by David Williams, Jun Huan, and Wei Wang. Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE), pp., 976-985, 2007.
  19. Accelerating profile queries in elevation maps, by Feng Pan, Wei Wang, and Leonard McMillan. Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE), pp., 76-85, 2007.
  20. Mining coherent patterns from heterogeneous microarray data, by Xiang Zhang, and Wei Wang. Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), pp. 838-839, 2006.
  21. Clustering pair-wise dissimilarity data into partially ordered sets, by Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins. Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 637-642, 2006.
  22. Distance-based identification of spatial motifs in proteins using constrained frequent subgraph mining, by Jun Huan, Deepak Bandyopadhyay, Jan Prins, Jack Snoeyink, Alexander Tropsha, and Wei Wang. Proceedings of the LSS Computational Systems Bioinformatics Conference (CSB), pp. 227-238, 2006.
  23. A fast approximation to multidimensional scaling, by Tynia Yang, Jinze Liu, Leonard McMillan, and Wei Wang. Proceedings of the ECCV Workshop on Computation Intensive Methods for Computer Vision (CIMCV), 2006.
  24. Mining Approximate frequent itemset in the presence of noise: algorithm and analysis, by Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, and Jan Prins. Proceedings of the 6th SIAM Conference on Data Mining (SDM), pp. 405-416, 2006.
  25. Mining shifting-and-scaling co-regulation patterns on gene expression profiles, by Xin Xu, Anthony K. H. Tung, Ying Lu, and Wei Wang. Proceedings of the 22nd IEEE International Conference on Data Engineering (ICDE), pp. 89 (10 pages), 2006.
  26. Human motion estimation from a reduced marker set, by Guodong Liu, Jingdan Zhang, Wei Wang, and Leonard McMillan. Proceedings of the Symposium on Interactive 3D Graphics and Games (SI3D), pp. 35-42, 2006.
  27. Finding representative set from massive data, by Feng Pan, Wei Wang, Anthony K. H. Tung, and Jiong Yang. Proceedings of the 5th IEEE International Conference on Data Mining (ICDM), pp. 338-345, 2005.
  28. Mining approximate frequent itemset from noisy data, by Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, and Jan Prins. Proceedings of the 5th IEEE International Conference on Data Mining (ICDM), pp. 721-724, 2005.
  29. Rapid determination of local structural features common to a set of proteins (demo), by Jun Huan, Deepak Bandyopadhyay, Jinze Liu, Jan Prins, Jack Snoeyink, Alexander Tropsha, and Wei Wang. Proceedings of the 13th International Conference on Intelligent Systems for Molecular Biology (ISMB), 2005.
  30. A system for analyzing and indexing human motion databases (demo), by Guodong Liu, Jingdan Zhang, Wei Wang, and Leonard McMillan. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 924-926, 2005.
  31. Revealing true subspace clusters in high dimensions, by Jinze Liu, Karl Strohmaier, and Wei Wang. Proceedings of the 4th IEEE International Conference on Data Mining (ICDM), pp. 463-466, 2004.
  32. AGILE: a general approach to detect transitions in evolving data streams, by Jiong Yang and Wei Wang. Proceedings of the 4th IEEE International Conference on Data Mining (ICDM), pp. 559-562, 2004.
  33. A framework for ontology-driven subspace clustering, by Jinze Liu, Wei Wang, and Jiong Yang. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 623-628, 2004.
  34. SPIN: Mining maximal frequent subgraphs from graph databases, by Jun Huan, Wei Wang, Jan Prins, and Jiong Yang. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 581-586, 2004.
  35. Gene ontology friendly biclustering of expression profiles, by Jinze Liu, Jiong Yang, and Wei Wang. Proceedings of the IEEE Computational Systems Bioinformatics Conference (CSB), pp. 436-447, 2004.
  36. Biclustering of gene expression data by tendency, by Jinze Liu, Jiong Yang, and Wei Wang. Proceedings of the IEEE Computational Systems Bioinformatics Conference (CSB), pp. 182-193, 2004.
  37. BASS: approximate search on large string databases, by Jiong Yang, Wei Wang, and Philip Yu. Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 181-192, 2004.
  38. Fast computation of database operations using graphics processors, by Naga Govindaraju, Brandon Lloyd, Wei Wang, Ming Lin, and Dinesh Manocha. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 215-226, 2004.
  39. Understanding social welfare service patterns using sequential analysis, by Hye-Chung Kum, Dean Duncan, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004.
  40. Successfully adopting IT for social welfare program management, by Dean Duncan, Hye-Chung Kum, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004.
  41. Successfully adopting IT for social welfare program management (demo), by Dean Duncan, Hye-Chung Kum, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), 2004.
  42. Mining spatial motifs from protein structure graphs, by Jun Huan, Wei Wang, Deepak Bandyopadhyay, Jack Snoeyink, Jan Prins, and Alex Tropsha. Proceedings of the 8th Annual International Conference on Research in Computational Molecular Biology (RECOMB), pp. 308-315, 2004.
  43. Accurate classification of protein structural families using coherent subgraph analysis, by Jun Huan, Wei Wang, Anglina Washington, Jan Prins, Ruchir Shah, and Alex Tropsha. Proceedings of the Pacific Symposium on Biocomputing (PSB), pp. 411-422, 2004.
  44. OP-Cluster: clustering by tendency in high dimensional space, by Jinze Liu and Wei Wang. Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM), pp. 187-194, 2003.
  45. Efficient mining of frequent subgraph in the presence of isomorphism, by Jun Huan, Wei Wang, and Jan Prins. Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM), pp. 549-552, 2003.
  46. Discovering compact and highly discriminative features or feature combinations of drug activities using support vector machines, by Hwanjo Yu, Jiong Yang, Wei Wang, and Jiawei Han. Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), pp. 220-228, 2003.
  47. Reconstructing of ancestral gene order after segmental duplication and gene loss, by Jun Huan, Jan Prins, Wei Wang, and Todd Vision. Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), pp. 484-485, 2003.
  48. Social welfare program administration and evaluation and policy analysis using knowledge discovery and data mining (KDD) on administrative data, by Hye-Chung Kum, Dean Duncan, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), pp. 39-44, 2003.
  49. Management assistance for Work First via a dynamic website, by Hye-Chung Kum, Dean Duncan, Kimberly Flair, and Wei Wang. Proceedings of the NSF National Conference on Digital Government Research (DG.O), pp. 296, 2003.
  50. STAMP: discovery of statistically important pattern repeats in a long sequence, by Jiong Yang, Wei Wang, and Philip Yu. Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pp. 224-238, 2003.
  51. ApproxMAP: approximate mining of consensus sequential patterns, by Hye-Chung Kum, Jian Pei, Wei Wang, and Dean Duncan. Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pp. 311-315, 2003.
  52. Enhanced biclustering on gene expression data, by Jiong Yang, Haixun Wang, Wei Wang, and Philip Yu. Proceedings of the 3rd IEEE Conference on Bioinformatics and Bioengineering (BIBE), pp. 321-327, 2003.
  53. CLUSEQ: efficient and effective sequence clustering, by Jiong Yang and Wei Wang, Proceedings of the 19th IEEE International Conference on Data Engineering (ICDE), pp. 101-112, 2003.
  54. InfoMiner+: mining partial periodic patterns with gap penalties, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM), pp. 725-728, 2002.
  55. Comparative study of sequential pattern mining frameworks --- support framework vs. multiple alignment framework, by Hye-Chung Kum, Susan Paulsen, and Wei Wang, Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM) Workshop on the Foundation of Data Mining and Discovery, 2002.
  56. Towards automatic clustering of protein sequences, by Jiong Yang and Wei Wang, Proceedings of the 1st IEEE Computer Society Conference on Bioinformatics (CSB), pp. 175-186, 2002.
  57. Accelerating approximate subsequence search on large protein sequence databases, by Jiong Yang, Wei Wang, Yi Xia, and Philip Yu, Proceedings of the 1st IEEE Computer Society Conference on Bioinformatics (CSB), pp. 207-218, 2002.
  58. Mining long sequential patterns in a noisy environment, by Jiong Yang, Wei Wang, Philip Yu, and Jiawei Han, Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 406-417, 2002.
  59. Clustering by pattern similarity in large data sets, by Haixun Wang, Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 394-405, 2002.
  60. Accelerating search of approximate match on large protein sequence databases, by Wei Wang, Jiong Yang, Yi Xia, and Philip Yu. Proceedings of the 6th ACM International Conference on Research in Computational Molecular Biology (RECOMB), 2002. (poster)
  61. Improving performance of bicluster discovery in a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip Yu. Proceedings of the 6th ACM International Conference on Research in Computational Molecular Biology (RECOMB), 2002. (poster)
  62. A framework towards efficient and effective protein clustering, by Wei Wang and Jiong Yang. Proceedings of the 6th ACM International Conference on Research in Computational Molecular Biology (RECOMB), 2002. (poster)
  63. Efficient filtering of large data sets --- a user-centric paradigm, by Yi Xia, Wei Wang, Jiong Yang, Philip Yu, and Richard Muntz. Proceedings of the 2nd SIAM International Conference on Data Mining (SDM), pp. 112-127, 2002.
  64. Delta-cluster: capturing subspace correlation in a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip Yu, Proceedings of the 18th IEEE International Conference on Data Engineering (ICDE), pp. 517-528, 2002.
  65. A framework towards efficient and effective sequence clustering, by Wei Wang and Jiong Yang, Proceedings of the 18th IEEE International Conference on Data Engineering (ICDE), pp. 282, 2002.
  66. Meta-patterns: revealing hidden periodical patterns, by Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the 1st IEEE International Conference on Data Mining (ICDM), pp. 550-557, 2001.
  67. Info-miner: mining surprising periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 395-400, 2001.
  68. TAR: temporal association rules on evolving numerical attributes, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 17th IEEE International Conference on Data Engineering (ICDE), pp. 283-292, 2001.
  69. Mining asynchronous periodic patterns in time series data, by Jiong Yang, Wei Wang, and Philip Yu, Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 275-279, 2000.
  70. Efficient mining weighted association rules (WAR), by Wei Wang, Jiong Yang, and Philip Yu, Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 270-274, 2000.
  71. Collaborative web caching based on proxy affinities, by Jiong Yang, Wei Wang, and Richard Muntz, Proceedings of the 19th ACM SIGMETRICS Conference on the Measurement and Modeling of Computer Systems (SIGMETRICS), pp. 78-89, 2000.
  72. Dynamic adaptive file management in a local area network, by Jiong Yang, Wei Wang, Richard Muntz, and Silvia Nittel, Proceedings of the 20th IEEE International Conference on Distributed Computer Systems (ICDCS), pp. 368-375, 2000.
  73. STING+: an approach to active spatial data mining, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 15th IEEE International Conference on Data Engineering (ICDE), pp. 116-125, 1999. (Invited to the "Best Papers of ICDE 1999" Special Issue of IEEE Transactions on Knowledge and Data Engineering)
  74. PK-tree: a spatial index structure for high dimensional point data, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 5th International Conference on Foundations of Data Organization (FODO), pp. 27-36, 1998.
  75. DynamO: dynamic objects with persistent storage, by Jiong Yang, Silvia Nittel, Wei Wang, and Richard Muntz, Proceedings of the 8th International Workshop on Persistent Object Systems (POS8) , 1998.
  76. STING: a statistical information grid approach to spatial data mining, by Wei Wang, Jiong Yang, and Richard Muntz, Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB), pp. 186-195, 1997.
  77. Performance analysis of several algorithms for processing joins between textual attributes, by Weiyi Meng, Clement Yu, Wei Wang, and Naphtali Rishe, Proceedings of the 12th IEEE International Conference on Data Engineering (ICDE), pp. 636-644, 1996.
  78. On fuzzy database systems, by Wei Wang and George Klir, Proceedings of the 5th IEEE Annual Dual-use Technologies & Applications Conference, pp. 330-335, 1995.
  79. The absolute continuity of fuzzy measures, Proceedings of International Joint Conference of the Fourth IEEE International Conference of Fuzzy Systems and Second International Fuzzy Engineering Symposium (FUZZ-IEEE/IFES'95), Japan, pp. 131-136, 1995.
  80. Determining fuzzy measures by Choquet integral, Proceedings of ISUMA-NAFIPS'95, pp. 724-727, 1995.
ARTICLES IN REFEREED JOURNALS
  1. The polymorphism architecture of mouse genetic resources elucidated using genome-wide resequencing data: implications for QTL discovery and systems genetics, by Adam Roberts, Fernando Pardo-Manuel de Villena, Wei Wang, Leonard McMillan, and David Threadgill, Mammalian Genome, vol. 18, no. 6, pp. 473-481, 2007.
  2. Benchmarking the effectiveness of sequential pattern mining methods, by Hye-Chung Kum, J. H. Chang, and Wei Wang, Data and Knowledge Engineering, vol. 60, no. 1, pp. 30-50, 2007.
  3. Structure-based function inference using protein family-specific fingerprints, by Deepak Bandyopadhyay, Jun Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, and Alexander Tropsha. Protein Science, vol. 15, pp. 1537-1543, 2006.
  4. Sequential pattern mining in multi-databases via multiple alignment, by Hye-Chung Kum, Joong-Hyuk Chang, and Wei Wang, Data Mining and Knowledge Discovery (DMKD), vol. 12, no. 2-3, pp. 151-180, 2006.
  5. Comparing graph representations of protein structure for mining family-specific residue-based packing motifs, by Jun Huan, Wei Wang, Deepak Bandyopadhyay, Jack Snoeyink, Jan Prins, and Alexander Tropsha. Journal of Computational Biology (JCB), vol. 12, no. 6, pp. 657-671, 2005.
  6. Guest editors' introduction: special issue on mining biological data, by Wei Wang and Jiong Yang, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 17, no. 8, pp. 1019-1020, 2005.
  7. An improved biclustering method for analyzing gene expression profiles, by Jiong Yang, Haixun Wang, Wei Wang, and Philip Yu, International Journal on Artificial Intelligence Tools (IJAIT), vol. 14, no. 5, pp. 771-789, 2005.
  8. Mining surprising periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu. Data Mining and Knowledge Discovery (DMKD), vol. 9, no. 2, pp. 189-216, 2004.
  9. Discovering high order periodic patterns, by Jiong Yang, Wei Wang, and Philip Yu. Knowledge and Information Systems Journal (KAIS), vol. 6, no. 3, pp. 243-268, 2004.
  10. WAR: weighted association rules for item intensities, by Wei Wang, Jiong Yang, and Philip Yu. Knowledge and Information Systems Journal (KAIS), vol. 6, no. 2, pp. 203-229, 2004.
  11. Recent progress on selected topics in database research: a report from nine young Chinese researchers working in the United States (invited paper), by Zhiyuan Chen, Chen Li, Jian Pei, Yufei Tao, Haixun Wang, Wei Wang, Jiong Yang, Jun Yang, and Donghui Zhang. Journal of Computer Science and Technology, vol. 18, no. 5, pp. 538 – 552, 2003.
  12. Mining asynchronous periodic patterns in time series data, by Jiong Yang, Wei Wang, and Philip Yu, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 15, no. 3, pp. 613-628, 2003.
  13. Mining patterns in long sequential data with noise, by Wei Wang, Jiong Yang, and Philip Yu, ACM SIGKDD Explorations, vol. 2, no. 2, pp. 28-33, 2000.
  14. An approach to active spatial data mining based on statistical information, by Wei Wang, Jiong Yang, and Richard Muntz, IEEE Transactions on Knowledge and Data Engineering, Special Issue on Best Papers in the 15th IEEE International Conference on Data Engineering, vol. 12, no. 5, pp. 715-728, 2000.
  15. Dynamo: design, implementation, and evaluation of cooperative persistent object management in a local area network, by Jiong Yang, Wei Wang, Silvia Nittel, Richard Muntz, and Vince Busam, Software - Practice and Experience, vol. 30, no. 4, pp. 419-448, 2000.
  16. Performance analysis of three text-join algorithms, by Weiyi Meng, Clement Yu, Wei Wang, and Naphtali Rishe, IEEE Transactions on Knowledge and Data Engineering, vol. 10, no. 3, pp. 477-492, 1998.
  17. Genetic algorithms for determining fuzzy measures from data, by Wei Wang, Zhenyuan Wang, and George J. Klir, Journal of Intelligent & Fuzzy Systems, vol. 6, no. 2, pp. 171-183, 1998.
  18. Monotone set functions defined by Choquet integral, by Zhenyuan Wang, George J. Klir, and Wei Wang, Fuzzy Sets and Systems, vol. 81, pp. 241-252, 1996.
  19. Fuzzy measures defined by fuzzy integral and their absolute continuity, by Zhenyuan Wang, George J. Klir, and Wei Wang, Journal of Mathematical Analysis and Application, vol. 203, pp. 150-165, 1996.
  20. Constructing fuzzy measures by transformations, by George J. Klir, Zhenyuan Wang, and Wei Wang, International Journal of Fuzzy Mathematics, vol. 4, no. 1, pp. 207-215, 1996.
  21. Constructing fuzzy measures by rational transformations, by Wei Wang, George J. Klir, and Zhenyuan Wang, International Journal of Fuzzy Mathematics, vol. 4, no. 3, pp. 665-675, 1996.
  22. Pan-integrals with respect to imprecise probabilities, by Zhenyuan Wang, Wei Wang, and George J. Klir, International Journal of General Systems, vol. 25, no. 3, pp. 229-243, 1996.
BOOK CHAPTERS
  1. Protein local structure comparison: methods and future directions, by Jun Huan, Wei Wang, and Jan Prins, Advances in Computers by Chau-Wen Tseng (eds.), Elsevier, 2006.
  2. Models for sequential pattern mining, by Hye-Chung Kum, Susan Paulsen, and Wei Wang, A Book on FDM --- Lecture Notes in Computer Science, Springer-Verlag, 2006.
  3. Discovering evolutionary classifier over high speed non-static stream, by Jiong Yang, Xifeng Yan, Jiawei Han, and Wei Wang, Advanced Methods for Knowledge Discovery from Complex Data, pp. 337-364, 2005.
  4. Mining high dimensional data, by Wei Wang and Jiong Yang, Data Mining and Knowledge Discovery Handbook: A Complete Guide for Practitioners and Researchers, Kluwer Academic Publishers, 2005.
  5. PK-tree: a spatial index structure for high dimensional point data (extended version), by Wei Wang, Jiong Yang, and Richard Muntz, Information Organization and Databases, Kluwer Academic Publishers, 2000.
  6. DynamO: dynamic objects with persistent storage, by Jiong Yang, Silvia Nittel, Wei Wang, and Richard Muntz, in Advances in Persistent Object Systems, pp. 199-214, Morgan Kauffmann, 1999.
  7. Extension of lower probabilities and coherence of belief measures, Advances in Intelligent Computing, edited by B. Bouchon - Meunier, R. R. Yager, and L. A. Zadeh, Springer Verlag, pp. 62-69, 1995.
BOOKS
  1. Mining Sequential Patterns from Large Data Sets, by Wei Wang and Jiong Yang, in Series of Advances in Database Systems, edited by Ahmed Elmagarmid, Kluwer, 2005.
  2. Advances in Web-Age Information Management --- Lecture Notes in Computer Science No. 2762, edited by Guozhu Dong, Changjie Tang, and Wei Wang, Springer-Verlag, 2003.
SOFTWARES
  1. Fast Algorithm for Imputing Missing Genotypes in SNPs (NPUTE)
  2. MotifSpace Client for Discovering Spatial Motifs from the Protein Structure Space
  3. An ActiveX Control for Visualizing Proteins and Motifs (RasCtrl)
  4. Fast Frequent Subgraph Mining (FFSM)
  5. Pyramid-K Tree (PK-Tree)
PATENTS
  1. System and method for identifying coherent objects with application to E-commerce, 2003.
  2. System and probabilistic method for mining long patterns, 2001.
  3. System and method for mining patterns with noise, 2001.
  4. System and method for meta pattern discovery, 2001.
  5. Methods for identifying partial periodic patterns and corresponding event subsequences in an event sequence, 2000.
  6. Methods for identifying partial periodic patterns of infrequent events in an event sequence, 2000.
  7. Methods for mining weighted association rule, 2000 (US Patent No. 6415287, issued on July 2nd, 2002).
   
RESIDENCY STATUS       Permanent resident
   
GENDER       Female
   
REFERENCES    Available upon request