About

I am the Director of the Software Engineering Application Technology Lab at Huawei, China. Prior to joining Huawei, I was an ARC DECRA Fellow and a lecturer (equivalent to U.S. assistant professor) at the Faculty of Information Technology, Monash University, Australia. I got my Ph.D degree in June 2014 from the College of Computer Science and Technology, Zhejiang University, China. I was very fortunate to work under the supervision of Prof. Xiaohu Yang and Prof. Jianling Sun. From July 2012 to January 2014, I was a visiting student of Prof. David Lo in Singapore Management University.

To help developers and testers improve their productivity, my current research focuses on data science for software engineering, i.e., mining and analyzing rich data in software repositories to uncover interesting and actionable information. I have employed and customized structured and unstructured data analytics techniques – including data mining, information retrieval, natural language processing, search-based algorithms, and program analysis – to transform passive software engineering data into automated tools and new insights.

Research Interests

Data Science, AI and SE, Mining Software Repositories, Empirical Software Engineering

Google Scholar

Direct link to Google scholar page.

Selected Publications

Top Journal Papers

ACM Computing Surveys

  1. Chao Liu, Xin Xia, David Lo, Cuiyun Gao, Xiaohu Yang, John Grundy. “Opportunities and Challenges in Code Search Tools”. In ACM Computing Surveys (CSUR). Accepted.

IEEE Transactions on Software Engineering

  1. Bowen Xu, Thong Hoang, Abhishek Sharma, Chengran Yang, Xin Xia, David Lo. “Post2Vec: Learning Distributed Representations of Stack Overflow Posts”. IEEE Transactions on Software Engineering (TSE), Accepted.
  2. Jiakun Liu, Xin Xia, David Lo, Haoxiang Zhang, Ying Zou, Ahmed E. Hassan, Shanping Li. “Broken External Links on Stack Overflow”. IEEE Transactions on Software Engineering (TSE), Accepted.
  3. Cuiyun Gao, Jichuan Zeng, Zhiyuan Wen, David Lo, Xin Xia, Irwin King, Michael R. Lyu. “Emerging App Issue Identification via Online Joint Sentiment-Topic Tracing”. IEEE Transactions on Software Engineering (TSE), Accepted.
  4. Xiaoxue Wu, Wei Zheng, Xin Xia, David Lo. “Data Quality Matters: A Case Study on Data Label Correctness for Security Bug Report Prediction”. IEEE Transactions on Software Engineering (TSE), Accepted.
  5. Jiachi Chen, Xin Xia, David Lo, John Grundy, Xiapu Luo, Ting Chen. “DEFECTCHECKER: Automated Smart Contract Defect Detection by Analyzing EVM Bytecode”. IEEE Transactions on Software Engineering (TSE), Accepted.
  6. Tingting Bi, Xin Xia, David Lo, John Grundy, Thomas Zimmermann. “An Empirical Study of Release Note Production and Usage in Practice”. IEEE Transactions on Software Engineering (TSE), Accepted.
  7. Neng Zhang, Qiao Huang, Xin Xia, Ying Zou, David Lo, Zhenchang Xing. “Chatbot4QR: Interactive Query Refinement for Technical Question Retrieval”. IEEE Transactions on Software Engineering (TSE), 27 pages. Accepted.
  8. Chao Ni, Xin Xia, David Lo, Xiang Chen, Qing Gu. “Revisiting Supervised and Unsupervised Methods for Effort-Aware Cross-Project Defect Prediction”. IEEE Transactions on Software Engineering (TSE), 16 pages. Accepted.
  9. Jiachi Chen, Xin Xia, David Lo, John Grundy, Xiapu Luo, Ting Chen. “Defining Smart Contract Defects on Ethereum”. IEEE Transactions on Software Engineering (TSE), 17 pages. Accepted.
  10. Meng Yan, Xin Xia, Yuanrui Fan, Ahmed E. Hassan, David Lo, Shanping Li. “Just-In-Time Defect Identification and Localization: A Two-Phase Framework”. IEEE Transactions on Software Engineering (TSE), 20 pages. Accepted.
  11. Qiang He, Bo Li, Feifei Chen, John Grundy, Xin Xia, Yun Yang. “Diversified Third-Party Library Prediction for Mobile App Development”. IEEE Transactions on Software Engineering (TSE), 16 pages. Accepted.
  12. Zhipeng Gao, Lingxiao Jiang, Xin Xia, David Lo, John Grundy. “Checking Smart Contracts with Structural Code Embedding”. IEEE Transactions on Software Engineering (TSE), 18 pages. Accepted.
  13. Rubing Huang, Weifeng Sun, Yinyin Xu, Haibo Chen, Dave Towey, Xin Xia. “A Survey on Adaptive Random Testing”. IEEE Transactions on Software Engineering (TSE), 32 pages. Accepted.
  14. Zhongxin Liu, Xin Xia, David Lo, Zhenchang Xing, Ahmed E. Hassan, Shanping Li. “Which Variables Should I Log?” IEEE Transactions on Software Engineering (TSE), 20 pages. Accepted.
  15. Weiqin Zou, David Lo, Pavneet Singh Kochhar, Xuan-Bach Dinh Le, Xin Xia, Yang Feng, Zhenyu Chen, Baowen Xu. “Smart Contract Development: Challenges and Opportunities”. IEEE Transactions on Software Engineering (TSE), 20 pages. Accepted.
  16. Zhiyuan Wan, Xin Xia, David Lo, Gail Murphy. “How does Machine Learning Change Software Development Practices?” IEEE Transactions on Software Engineering (TSE), 14 pages. Accepted.
  17. Yuanrui Fan, Xin Xia, Daniel Alencar da Costa, David Lo, Ahmed E. Hassan, Shanping Li. “The Impact of Mislabeled Changes by SZZ on Just-in-Time Defect Prediction”. IEEE Transactions on Software Engineering (TSE), 26 pages. Accepted.
  18. Giovanni Viviani, Michalis Famelis, Xin Xia, Calahan Janik-Jones, Gail C. Murphy. “Locating Latent Design Information in Developer Discussions: A Study on Pull Requests”. IEEE Transactions on Software Engineering (TSE), 13 pages. Accepted.
  19. Zhiyuan Wan, Xin Xia, Ahmed E. Hassan. “What Do Programmers Discuss about Blockchain? A Case Study on the Use of Balanced LDA and the Reference Architecture of a Domain to Capture Online Discussions about Blockchain Platforms across Stack Exchange Communities”. IEEE Transactions on Software Engineering (TSE), 19 pages. Accepted.
  20. Lingfeng Bao, Xin Xia, David Lo, Gail Murphy. “A Large Scale Study of Long-Time Contributor Prediction for GitHub Projects”. IEEE Transactions on Software Engineering (TSE), 22 pages. Accepted.
  21. Zhiyuan Wan, Xin Xia, Ahmed E. Hassan, David Lo, Jianwei Yin, Xiaohu Yang. “Perceptions, Expectations, and Challenges in Defect Prediction”. IEEE Transactions on Software Engineering (TSE), 26 pages. Accepted.
  22. Qiao Huang, Xin Xia, David Lo, Gail Murphy. “Automating Intention Mining”. IEEE Transactions on Software Engineering (TSE), 22 pages. Accepted.
  23. Weiqin Zou, David Lo, Zhenyu Chen, Xin Xia, Yang Feng, Baowen Xu. “How Practitioners Perceive Automated Bug Report Management Techniques”. IEEE Transactions on Software Engineering (TSE), 28 pages. Accepted.
  24. Yuanrui Fan, Xin Xia, David Lo, Ahmed E. Hassan. “Chaff from the Wheat: Characterizing and Determining Valid Bug Reports”. IEEE Transactions on Software Engineering (TSE), 30 pages. Accepted.
  25. Meng Yan, Xin Xia, Emad Shihab, David Lo, Jianwei Yin, Xiaohu Yang. “Automating Change-level Self-admitted Technical Debt Determination”. IEEE Transactions on Software Engineering (TSE), 20 pages, 2018. Accepted.
  26. Lingfeng Bao, Zhenchang Xing, Xin Xia, David Lo. “VT-Revolution: Interactive Programming Video Tutorial Authoring and Watching System”. IEEE Transactions on Software Engineering (TSE), 16 pages, 2018. Accepted.
  27. Xin Xia, Lingfeng Bao, David Lo, Zhenchang Xing, Ahmed E. Hassan, Shanping Li. “Measuring Program Comprehension: A Large-Scale Field Study with Professionals”. IEEE Transactions on Software Engineering (TSE), 26 pages, 2017. Accepted.
  28. Xin Xia, David Lo, Ying Ding, Jafar M. Al-Kofahi, Tien N. Nguyen, Xinyu Wang. “Improving Automated Bug Triaging with Specialized Topic Model”. IEEE Transactions on Software Engineering (TSE) 43(3): 272-297 (2017)
  29. Xin Xia, David Lo, Sinno Jialin Pan, Nachiappan Nagappan, Xinyu Wang. “HYDRA: Massively Compositional Model for Cross-Project Defect Prediction”. IEEE Transactions on Software Engineering (TSE) 42(10): 977-998 (2016) (TSE Featured Paper on October 2016 issue.)

ACM Transactions on Software Engineering and Methodology

  1. Jiachi Chen, Xin Xia, David Lo, John Grundy. “Why Do Smart Contracts Self-Destruct? Investigating the Selfdestruct Function on Ethereum”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  2. Chao Liu, Cuiyun Gao, Xin Xia, David Lo, John Grundy, Xiaohu Yang. “On the Reproducibility and Replicability of Deep Learning in Software Engineering”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  3. Chao Liu, Xin Xia, David Lo, Zhiwei Liu, Ahmed E. Hassan, Shanping Li. “CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  4. Cuiyun Gao, Wenjie Zhou, Xin Xia, David Lo, Qi Xie, Michael R. Lyu. “Automating App Review Response Generation Based on Contextual Knowledge”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  5. Haoye Wang, Xin Xia, David Lo, Qiang He, Xinyu Wang, John Grundy. “Context-Aware Retrieval-based Deep Commit Message Generation”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  6. Qiuyuan Chen, Chunyang Chen, Safwat Hassan, Zhenchang Xing, Xin Xia, Ahmed E. Hassan. “How should I Improve the UI of My App: A Study of User Reviews of Popular Apps in the Google Play”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  7. Qiuyuan Chen, Xin Xia, Han Hu, David Lo, Shanping Li. “Why My Code Summarization Model Does Not Work: Code Comment Improvement with Category Prediction”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  8. Zhipeng Gao, Xin Xia, David Lo, John Grundy. “Technical Q&A Site Answer Recommendation via Question Boosting”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  9. Wenhan Wang, Ge Li, Sijie Shen, Xin Xia, Zhi Jin. “Modular Tree Network for Source Code Representation Learning”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  10. Zhipeng Gao, Xin Xia, John Grundy, David Lo, Yuan-Fang Li. “Generating Question Titles for Stack Overflow from Mined Code Snippets”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  11. Lingfeng Bao, Zhenchang Xing, Xin Xia, David Lo, Minghui Wu, Xiaohu Yang. “psc2code: Denoising Code Extraction from Programming Screencasts”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  12. Jieshan Chen, Chunyang Chen, Zhenchang Xing, Xin Xia, Liming Zhu, John Grundy, Jinshui Wang. “Wireframe-Based UI Design Search Through Image Autoencoder”. In ACM Transactions on Software Engineering and Methodology (TOSEM). Accepted.
  13. Xiaoxue Ren, Zhenchang Xing, Xin Xia, David Lo, Xinyu Wang, John Grundy. “Neural Network Based Detection of Self-admitted Technical Debt: From Performance to Explainability”. In ACM Transactions on Software Engineering and Methodology (TOSEM). 46 pages, 2019. Accepted.

Top Conference Papers

ICSE

  1. Bo Yang, Zhenchang Xing, Xin Xia, Chunyang Chen, Deheng Ye, Shanping Li. “Don’t Do That! Hunting Down Visual Design Smells in Complex UIs against Design Guidelines”. 43rd ACM/IEEE International Conference on Software Engineering (ICSE 2021). Accepted as a Research Full Paper.
  2. Haoye Wang, Xin Xia, David Lo, John Grundy, Xinyu Wang. “Automatic Solution Summarization for Crash Bugs”. 43rd ACM/IEEE International Conference on Software Engineering (ICSE 2021). Accepted as a Research Full Paper.
  3. Yuanrui Fan, Xin Xia, David Lo, Ahmed E. Hassan, Yuan Wang, Shanping Li. “A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms”. 43rd ACM/IEEE International Conference on Software Engineering (ICSE 2021). Accepted as a Research Full Paper.
  4. Zhiyuan Wan, Xin Xia, David Lo, Jiachi Chen, Xiapu Luo, Xiaohu Yang. “Smart Contract Security: a Practitioners’ Perspective”. 43rd ACM/IEEE International Conference on Software Engineering (ICSE 2021). Accepted as a Research Full Paper.
  5. Xiaoxue Ren, Jiamou Sun, Zhenchang Xing, Xin Xia, Jianling Sun. “Demystify Official API Usage Directives with Crowdsourced API Misuse Scenarios, Erroneous Code Examples and Patches”. 42nd ACM/IEEE International Conference on Software Engineering (ICSE 2020). Accepted as a Full Paper.
  6. Xin Xia, Zhiyuan Wan, Pavneet Singh Kochhar, David Lo. ``How Practitioners Perceive Coding Proficiency’’. 41st ACM/IEEE International Conference on Software Engineering (ICSE 2019). Accepted as a Full Paper
  7. Xuan Bach D. Le, Lingfeng Bao, David Lo, Xin Xia, Shanping Li, Corina Pasareanu. ``On Reliability of Patch Correctness Assessment’’. 41st ACM/IEEE International Conference on Software Engineering (ICSE 2019). Accepted as a Full Paper
  8. Dehai Zhao, Zhenchang Xing, Chunyang Chen, Xin Xia, Guoqiang Li. ``ActionNet: Vision-based Workflow Action Recognition From Programming Screencasts’’. 41st ACM/IEEE International Conference on Software Engineering (ICSE 2019). Accepted as a Full Paper. ( Nominated for the ACM SIGSOFT Distinguishied Paper Award)

ESEC/FSE

  1. Zhipeng Gao, Xin Xia, David Lo, John Grundy, Tom Zimmermann. “Automating the Removal of Obsolete TODO Comments”. The 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2021). Accepted as a Research Full paper.
  2. Jiakun Liu, Sebastian Baltes, Christoph Treude, David Lo, Yun Zhang, Xin Xia. “Characterizing Search Activities on Stack Overflow”. The 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2021). Accepted as a Research Full paper.
  3. Bo Li, Qiang He, Feifei Chen, Xin Xia, Li Li, John Grundy, Yun Yang. “Embedding App-Library Graph for Neural Third Party Library Recommendation”. The 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2021). Accepted as a Research Full paper.

ASE

  1. Xing Hu, Zhipeng Gao, Xin Xia, David Lo, Xiaohu Yang. “Automating User Notice Generation for Smart Contract Functions”. 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021): Accepted as a Full Paper.
  2. Jiayuan Zhou, Michael Pacheco, Zhiyuan Wan, Xin Xia, David Lo, Yuan Wang, Ahmed E. Hassan. “Finding A Needle in a Haystack: Automatic Mining of Silent Vulnerability Fixes”. 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021): Accepted as a Full Paper.
  3. Shengyi Pan, Lingfeng Bao, Xiaoxue Ren, Xin Xia, David Lo, Shanping Li. “Automating Developer Chat Mining”. 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021): Accepted as a Full Paper.
  4. Jia Li, Yongmin Li, Ge Li, Xing Hu, Xin Xia, Zhi Jin. “EditSum: A Retrieve-and-Edit Framework for Source Code Summarization”. 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021): Accepted as a Full Paper.
  5. Yanqi Su, Zhenchang Xing, Xin Peng, Xin Xia, Chong Wang, Xiwei Xu, Liming Zhu. “ Reducing Bug Triaging Confusion by Learning from Mistakes with a Bug Tossing Knowledge Graph”. 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021): Accepted as a Full Paper. (ACM SIGSOFT Distinguished Paper Award)
  6. Zhongxin Liu, Xin Xia, Meng Yan, Shanping Li. “Automating Just-In-Time Comment Updating”. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020): Accepted as a Full Paper. (ACM SIGSOFT Distinguished Paper Award)
  7. Zhiyuan Wan, Gail Murphy, Xin Xia. “Predicting Code Context Models for Software Development Tasks”. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020): Accepted as a Full Paper.
  8. Xiaoxue Ren, Xinyuan Ye, Zhenchang Xing, Xin Xia, Xiwei Xu, Liming Zhu, Jianling Sun. “API-Misuse Detection Driven by Fine-Grained API-Constraint Knowledge Graph”. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020): Accepted as a Full Paper.
  9. Bolin Wei, Yongming Li, Ge Li, Xin Xia, Zhi Jin. “Retrieve and Refine: Exemplar-based Neural Comment Generation”. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020): Accepted as a Full Paper.
  10. Zhongxin Liu, Xin Xia, Christoph Treude, David Lo, Shanping Li. “Automatic Generation of Pull Request Descriptions”. 34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019): Accepted as a Full Paper. (ACM SIGSOFT Distinguished Paper Award)
  11. Xiaoxue Ren, Zhenchang Xing, Xin Xia, Guoqiang Li, Jianling Sun. “Discovering, Explaining and Summarizing Controversial Discussions in Community Q&A Sites”. 34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019): Accepted as a Full Paper.
  12. Cuiyun Gao, Jichuan Zeng, Xin Xia, David Lo, Michael R. Lyu, Irwin King. “Automating App Review Response Generation”. 34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019): Accepted as a Full Paper.
  13. Qiao Huang, Xin Xia, Zhenchang Xing, David Lo, Xinyu Wang. API Method Recommendation without Worrying About the Task-API Knowledge Gap. 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE 2018): Accepted as a Full Paper
  14. Zhongxin Liu, Xin Xia, Ahmed E. Hassan, David Lo, Zhenchang Xing, Xinyu Wang. Neural-Machine-Translation-Based Commit Message Generation: How Far Are We? 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE 2018): Accepted as a Full Paper (ACM SIGSOFT Distinguished Paper Award)
  15. Bowen Xu, Zhenchang Xing, Xin Xia, David Lo. “AnswerBot - Automated Generation of Answer Summary to Developers’ Technical Questions”. 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2017): 706-716
  16. Bowen Xu, Deheng Ye, Zhenchang Xing, Xin Xia, Guibin Chen, Shanping Li. “Predicting Semantically Linkable Knowledge in Developer Online Forums via Convolutional Neural Network”. 31st IEEE/ACM International Conference on Automated Software Engineering (ASE 2016): 51-62
  17. Lucia, David Lo, Xin Xia: Fusion fault localizers. 29th IEEE/ACM International Conference on Automated Software Engineering (ASE 2014): 127-138.

ISSTA

  1. Pavneet Singh Kochhar, Xin Xia, David Lo and Shanping Li. “Practitioners’ Expectations on Automated Fault Localization”. ACM International Symposium on Software Testing and Analysis (ISSTA 2016). Accepted as a Full Paper.

NeurIPS

  1. Bolin Wei, Ge Li, Xin Xia, Zhiyi Fu, Zhi Jin. “Code Generation as a Dual Task of Code Summarization”. Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019): Accepted as a Full Paper.

IJCAI

  1. Xing Hu, Ge Li, Xin Xia, David Lo, Shuai Lu, Zhi Jin. “Summarizing Source Code with Transferred API Knowledge”. 27th International Joint Conference on Artificial Intelligence (IJCAI 2018): 2269-2275

UbiComp

  1. Zhiyuan Wan, Lingfeng Bao, Debin Gao, Eran Toch, Xin Xia, Tamir Mendel, David Lo. “AppMoD: Helping Older Adults Manage Mobile Security with Online Social Help”. 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp 2020): Accepted as a Full Paper.

Contact

Xin Xia
Director
Software Engineering Application Technology Lab
Huawei, China
Email: xin.xia@acm.org