picture
picture   Yanfang (Fanny) Ye, Ph.D.

Assistant Professor

Lane Department of Computer Science and Electrical Engineering
Benjamin M. Statler College of Engineering and Mineral Resources
West Virginia University

AERB 255
PO Box 6109, West Virginia University
Morgantown, WV 26506-6109
Office: (304) 293-9128
Email: yanfang.ye (at) mail (dot) wvu (dot) edu


Research Interests

"Innovation, research and education - for a better world!"

My research areas mainly include Cybersecurity, Data Mining, Machine Learning, and Health Intelligence. With long-term and strong collaboration with industry partners, I have proposed and developed cloud-based solutions for mining big data in the area of cybersecurity, especially for malware detection and phishing fraud detection. My research results have been published in several top conferences such as ACM SIGKDD (2018, 2017, 2011, 2010, 2009, 2007), IJCAI (2018), and ACSAC (2018, 2017), as well as top journals such as ACM CSUR (2017), KAIS (2017), IEEE TNNLS (2016), ACM TIST (2015), IEEE TSMC (2012, 2010), JIIS (2010), and JCV (2009, 2008). My proposed techniques have significantly reduced the time needed to detect new malicious software - from WEEKS to SECONDS, which have been incorporated into popular commercial products including Comodo and Kingsoft Antivirus that protect millions of users worldwide. In addition, I have been awarded three patents in the area of malware detection and categorization. I recently received the prestigious SIGKDD 2017 Best Paper Award and SIGKDD 2017 Best Student Paper Award (Applied Data Science Track), the IEEE EISIC 2017 Best Paper Award, and the New Researcher of the Year Award (2016-2017) from the Statler College at WVU. I have also received multiple prestigious awards from the NSF and NIJ in support of my researches. All these awards are highly competitive.



To Perspective Students

  • I am currently looking for Ph.D. students doing supervised research or independent study with me at LCSEE, WVU. If you are a well motivated and dedicated student pursuing a Ph.D. degree related to the areas of Cybersecurity, Data Mining, Machine Learning, and Health Intelligence, please send me an email with your CV.
  • For students who use your government support to study in USA, you can get tuition waiver at WVU. Please contact me if you like to work with me on Cybersecurity, Data Mining, Machine Learning, and Health Intelligence.


Latest News

  • Our paper entitiled "Adversarial Machine Learning in Malware Detection: Arms Race between Evasion Attack and Defense" recently received the IEEE EISIC 2017 Best Paper Award. Congratulations to our team! Congratulations to my student Lingwei Chen!
  • Our paper recently received the SIGKDD 2017 Best Paper Award and the SIGKDD 2017 Best Student Paper Award (Applied Data Science Track). Congratulations to our team for the SIGKDD 2017 Best Paper Award! Congratulations to my student Shifu Hou for the SIGKDD 2017 Best Student Paper Award! Our video won SIGKDD 2017 Audience Appreication Award Finalist (26,033 views on YouTube).

    Shifu Hou, Yanfang Ye (), Yangqiu Song, Melih Abdulhayoglu. "HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD) , 2017. (Download paper here; Download slides here; and Click this link to watch the video)


Selected Publications

* indicates that the author is my student; indicates the corresponding author.

Book Chapters

  • Yanfang Ye. "Intelligent Malware Detection by Applying Data Mining Techniques", In T. Li eds., Data Mining Where Theory Meets Practice, Xiament University Press, 2013, ISBN 978-7-5615-4294-1.

Journal Publications

  • Junxiang Wang, Liang Zhao, Yanfang Ye, Yuji Zhang. "Adverse Event Detection by Integrating Twitter Data and VAERS", Journal of Biomedical Semantics, 9(19), 2018.
  • Yanfang Ye, Tao Li, Donald Adjeroh, S. Sitharama Iyengar. "A Survey on Malware Detection Using Data Mining Techniques", ACM Computing Surveys (ACM CSUR), Vol. 50, Issue 3, Article No. 41, 2017.
  • Yanfang Ye (), Lingwei Chen*, Shifu Hou*, William Hardy*, Xin Li. "DeepAM: A Heterogeneous Deep Learning Framework for Intelligent Malware Detection", Knowledge and Information Systems (KAIS), Vol. PP (52): 1~21, 2017.
  • Gongde Guo, Lifei Chen, Yanfang Ye, Qingshan Jiang. "Cluster Validation Method for Determining the Number of Clusters in Categorical Sequences", IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), Vol. PP (99): 1-13, 2016.
  • Ming Ni, Tao Li, Qianmu Li, Hong Zhang, Yanfang Ye, Qingshan Jiang. "FindMal: A File-to-file Social Network Based Malware Detection Framework", Knowledge-Based Systems (KBS), 112: 142-151, 2016.
  • Yujie Fan*, Yanfang Ye, Lifei Chen. "Malicious Sequential Pattern Mining for Automatic Malware Detection", Expert Systems with Applications (ESWA), Vol. 52, pp. 16~25, 2016.
  • Yanfang Ye (), Tao Li, Haiyin Shen. "Soter: Smart Bracelets for Children's Safety", ACM Transactions on Intelligent Systems and Technology (ACM TIST), Vol.6, No. 4, Article 46, 2015.
  • Lifei Chen, Yanfang Ye, Gongde Guo, Jianping Zhu. "Kernel-based linear classification on categorical data", Soft Computing, pp. 1~13, 2015.
  • Weiwei Zhuang, Yanfang Ye, Yong Chen, Tao Li. "Ensemble Clustering for Internet Security Applications", IEEE Transactions on Systems, Man and Cybernetics-Part C: Applications and Reviews, Vol 42, pp. 1784~1796, 2012.
  • Weiwei Zhuang, Yanfang Ye, Tao Li, Qingshan Jiang. "Intelligent phishing website detection using classification ensemble", Systems Engineering - Theory & Practice, Vol. 31, issue (10): 2008-2020, 2011.
  • Yanfang Ye, Tao Li, Qingshan Jiang, Youyu Wang. "CIMDS: Adapting post-processing techniques of associative classification for malware detection system", IEEE Transactions on Systems, Man and Cybernetics-Part C: Applications and Reviews, Vol 40, pp. 298~307, 2010.
  • Yanfang Ye, Lifei Chen, Dingding Wang, Tao Li, Qingshan Jiang, Min Zhao. "SBMDS: an interpretable string based malware detection system using SVM ensemble with bagging", Journal in Computer Virology, Vol 5, pp. 283~293, 2009.
  • Yanfang Ye, Tao Li, Kai Huang, Qingshan Jiang, Yong Chen. "Hierarchical Associative Classifier (HAC) for Malware Detection from the Large and Imbalanced Gray List", Journal of Intelligent Information Systems, Vol 35, pp. 1~20, 2009.
  • Weiwei Zhuang, Yanfang Ye, Qingshan Jiang, Zhixue Han. "Application of Incremental Associative Classification Method in Malware Detection", Computer Engineering, 35 (4): 159-161, 2009.
  • Yanfang Ye, Dingding Wang, Tao Li, Dongyi Ye, Qingshan Jiang. "An Intelligent PE-Malware Detection System Based on Association Mining", Journal in Computer Virology, Vol 4, pp. 323~334, 2008.

Conference Publications

  • Yuyang Gao, Liang Zhao, Lingfei Wu, Yanfang Ye, Hui Xiong, Chaowei Yang. "Incomplete Label Multi-task Deep Learning for Spatio-temporal Event Subtype Forecasting", P33rd AAAI Conference on Artificial Intelligence (AAAI), 2019. (16.7% acceptance rate)
  • Yujie Fan*, Shifu Hou*, Yiming Zhang*, Yanfang Ye (), Melih Abdulhayoglu. "Gotcha - Sly Malware! Scorpion: A Metagraph2vec Based Malware Detection System", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD), 2018. (22.5% acceptance rate)
  • Yujie Fan*, Yiming Zhang*, Yanfang Ye (), Xin Li. "Automatic Opioid User Detection from Twitter: Transductive Ensemble Built on Different Meta-graph Based Similarities over Heterogeneous Information Network", 27th International Joint Conference on Artificial Intelligence (IJCAI), 2018. (20.5% acceptance rate)
  • Junxiang Wang, Liang Zhao, Yanfang Ye. "Semi-supervised Multi-instance Learning for Flu Shot Adverse Event Detection", IEEE international conference on Big Data (BigData), 2018. (18.9% acceptance rate)
  • Yanfang Ye (), Shifu Hou*, Lingwei Chen*, Xin Li, Liang Zhao, Shouhuai Xu, Jiabin Wang, Qi Xiong. "ICSD: An Automatic System for Insecure Code Snippet Detection in Stack Overflow over Heterogeneous Information Network", Annual Computer Security Applications Conference (ACSAC), 2018. (20.1% acceptance rate)
  • Shifu Hou*, Yanfang Ye (), Yangqiu Song, Melih Abdulhayoglu. "HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD), 2017. SIGKDD 2017 Best Paper Award and SIGKDD 2017 Best Student Paper Award (Applied Data Science Track). (9.2% acceptance rate for oral)
    SIGKDD 2017 Audience Appreciation Award Finalist: 26,033 views on YouTube.
  • Lingwei Chen*, Yanfang Ye (), Thirimachos Bourlai. "Adversarial Machine Learning in Malware Detection: Arms Race between Evasion Attack and Defense", IEEE European Intelligence and Security Informatics Conference (EISIC), 2017. IEEE EISIC 2017 Best Paper Award. (~25% acceptance rate)
  • Lingwei Chen*, Shifu Hou*, Yanfang Ye (). "SecureDroid: Enhancing Security of Machine Learning-based Detection against Adversarial Android Malware Attacks", Annual Computer Security Applications Conference (ACSAC), 2017. (19.7% acceptance rate)
  • Yujie Fan*, Yiming Zhang*, Yanfang Ye (), Xin Li, Wanhong Zheng. "Social Media for Opioid Addiction Epidemiology: Automatic Detection of Opioid Addicts from Twitter and Case Studies", ACM International Conference on Information and Knowledge Management (CIKM), 2017. (~20% acceptance rate)
  • Shifu Hou*, Aaron Saas*, Yanfang Ye (), Lifei Chen. "Deep4MalDroid: A Deep Learning Framework for Android Malware Detection Based on Linux Kernel System Call Graphs", IEEE/WIC/ACM International Conference on Web Intelligence Workshops (WIW) , 2016.
  • Shifu Hou*, Aaron Saas*, Yanfang Ye (), Lifei Chen. "DroidDelver: An Android Malware Detection System Using Deep Belief Network Based on API Call Blocks", Proceedings of International Conference on Web-Age Information Management (WAIM) , 2016.
  • Lingwei Chen*, William Hardy*, Yanfang Ye (), Tao Li. "Analyzing File-to-File Relation Network in Malware Detection", Web Information System Engineering (WISE) , pp. 415~430, 2015.
  • Shifu Hou*, Lifei Chen, Egemen Tas, Igor Demihovskiy, Yanfang Ye (). "Cluster-Oriented Ensemble Classifiers for Malware Detection", IEEE International Conference on Sematic Computing (IEEE ICSC) , pp. 189~196, 2015.
  • Lingwei Chen*, Tao Li, Melih Abdulhayoglu, Yanfang Ye (). "Malware Detection Based on File Relation Graphs", IEEE International Conference on Sematic Computing (IEEE ICSC) , pp. 85~92, 2015 (Invited Paper).
  • Yanfang Ye, Tao Li, Shenghuo Zhu, Weiwei Zhuang, Egemen Tas, Umesh Gupta, Melih Abdulhayoglu. "Combining File Content and File Relations for Cloud Based Malware Detection", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD) , pp. 222~230, 2011. (8% acceptance rate for oral)
  • Yanfang Ye, Tao Li, Yongchen, Qingshan Jiang. "Automatic Malware Categorization Using Cluster Ensemble", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD) , pp. 95~104, 2010. (10.9% acceptance rate for oral)
  • Yanfang Ye, Tao Li, Qingshan Jiang, Zhixue Han, Li Wan. "Intelligent File Scoring System for Malware Detection from the Gray List", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD) , pp. 1385~1394, 2009. (9.8% acceptance rate for oral)
  • Yanfang Ye, Yinming Mei, Rencheng Peng. "MCNS: Intelligent Malware Categorizing and Naming System", The 12th Association of anti Virus Asia Researchers International Conference (AVAR) , pp. 15~25, 2009.
  • Lifei Chen, Yanfang Ye, Qingshan Jiang. "A New Centroid-Based Classifier for Text Categorization", Advanced Information Networking and Applications Workshops (AINAW) , pp. 1217~1222, 2008.
  • Yanfang Ye, Dingding Wang, Tao Li, Dongyi Ye. "IMDS: Intelligent Malware Detection System", Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD) , pp. 1043~1047, 2007. (17.9% acceptance rate)


Current Students

  • Lingwei Chen (Ph.D. Student, Fall 2014 -- )
  • Shifu Hou (Ph.D. Student, Fall 2014 -- )
  • Aaron Saas (Ph.D. Student, Spring 2016 -- )
  • Yujie Fan (Ph.D. Student, Fall 2016 -- )
  • Yiming Zhang (Ph.D. Student, Fall 2016 -- )
  • William B. Hardy (MS Student, Spring 2015 -- )

Graduated Students

  • Jian Liu (MS, June 2018)
  • Sai Venkata Akhil Thammineni (MS, November 2017)
  • Srinivas Garapati, MS (MS, October 2017)
  • Utsav Kirtikumar Upadhyay (MS, September 2017)
  • Madhusudhan Reddy Boddu (MS, March 2017)
  • Sai Ram Nellutla (MS, December 2015)
  • Dominique Amos (BS, December 2015)
  • Alex Finkelstein (BS, May 2015)
  • Kevin Hao (BS, May 2015)
  • Michael Hite (BS, May 2015)
  • Joshua Suess (BS, May 2015)
  • Jacob Sutton (BS, May 2015)
  • Sam Wood (BS, May 2015)
  • Reem AL Alshikh (BS, May 2015)
  • Zainab Alamri (BS, May 2015)

Former Students

  • Madhuri Siddula (Ph.D. Student, Spring 2015 - Summer 2016)


Teaching

  • CS 573: Advanced Data Mining [Fall 2018]
    Lectures: R 5:00pm -- 7:30pm in NRC-E127
    Office Hours: Fridays 2:30pm -- 4:30pm, or by appointment, in AERB-255
  • CS 467: Practicing Cybersecurity: Attacks and Countermeasures [Spring 2018]
  • CS 573: Advanced Data Mining [Spring 2017, Spring 2016, Spring 2015]
  • CS 569: Cybersecurity and Big Data Analytics [Fall 2017, Fall 2016, Fall 2015, Fall 2014]
  • CS 426: Discrete Mathematics [Spring 2014]