Yijia Liu (刘一佳)
Natural language processing, Chinese Word Segmentation, parsing, and machine learning.
Ph.D. (thesis in Chinese), Harbin Institute of Technology2014.9 - 2019.6
Visiting Student, University of Washington2016.10 - 2017.9
Supervisor: Noah A. Smith
M.S., Harbin Institute of Technology2012.9 - 2014.7
Major: Computer Science
B.E., Harbin Institute of Technology2008.9 - 2012.7
Major: Computer Science
Yijia Liu, Wanxiang Che, Bing Qin, and Ting Liu. 2020. Exploring Segment Representations for Neural Semi-Markov Conditional Random Fields. In IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 813-824, 2020.
Yijia Liu, Wanxiang Che, Yuxuan Wang, Bo Zheng, Bing Qin, and Ting Liu. 2019. Deep Contextualized Word Embeddings for Universal Dependency Parsing. In ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP). Volume 19 Issue 1, July 2019.
Yijia Liu, Wanxiang Che, Huaipeng Zhao, Bing Qin, and Ting Liu. 2018. Distilling Knowledge for Search-based Structured Prediction. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL). | [code for parser] | [code for NMT] | [slide]
Yijia Liu, Yi Zhu, Wanxiang Che, Bing Qin, Nathan Schneider, and Noah A. Smith. 2018. Parsing Tweets into Universal Dependency. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL). | [code] | [data] | [poster]
Wanxiang Che, Yijia Liu, Yuxuan Wang, Bo Zheng, and Ting Liu. 2018. Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies (CoNLL). | [shared-task results] | [slide] | [poster]
Yutai Hou, Yijia Liu, Wanxiang Che and Ting Liu, 2018. Sequence-to-Sequence Data Augmentation for Dialogue Language Understanding. In Proceedings of the 27th International Conference on Computational Linguistics (COLING). | [code]
Haoyang Wen, Yijia Liu, Wanxiang Che, Libo Qin and Ting Liu, 2018. Sequence-to-Sequence Learning for Task-oriented Dialogue with Dialogue State Representation. In Proceedings of the 27th International Conference on Computational Linguistics (COLING).
Yijia Liu, Wanxiang Che, Jiang Guo, Bing Qin, and Ting Liu. 2016. Exploring Segment Representations for Neural Segmentation Models. In Proceedings of 25th International Joint Conference on Artificial Intelligence (IJCAI). | [code] | [slide] | [poster]
Yijia Liu, Yue Zhang, Wanxiang Che, and Ting Liu. 2014. Domain Adaptation for CRF-based Chinese Word Segmentation using Free Annotations. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). | [code]
Yijia Liu, Wanxiang Che, and Ting Liu. 2013. Enhancing chinese word segmentation with character clustering. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (CCL).
Meishan Zhang, Wanxiang Che, Yijia Liu, Zhenghua Li, Ting Liu. 2012. HIT dependency parsing: Bootstrap aggregating heterogeneous parsers. In Notes of the First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL). | [shared-task results]
CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2018.4 - 2018.6Our system (HIT-SCIR) was ranked first out of 26 submitted systems according to LAS.
- contributed the ideas of using deep contextualized word embeddings and ensemble
- developed the treebank concatenation strategies.
Language Technology Platform (LTP), 2013.6 - 2019.6LTP is a software package that provides a Chinese natural language processing pipeline along with web service API.
- one of the developers and the major maintainer of LTP.
- developed 4 modules including Chinese word segmentation, POSTagging, NER and Dependency parsing in a perceptron algorithm framework.
- developed the RESTful API and contributed to the development of website.
Conference Reviewer: ACL 2018-2020; EMNLP 2019; NAACL 2018-2019; IJCAI 2016, 2020; AAAI 2020; CCL 2015-2019; NLPCC 2015-2019; SemEval 2016; Computational Linguistics;
Tips for making a good academic slide (in Chinese), at CCL2018 student workshop 2018.10
Tips for improving the clarity of your paper (in Chinese), at NLPCC2018 student workshop 2018.08
Research Assistance, Singapore University of Technology and Design. 2013.10 - 2014.10worked with Dr. Yue Zhang, on statistical machine translation, Chinese tagging and transition based dependency parsing.
Intern Researcher and Developer, Baidu Inc., NLP Department. 2011.7 - 2011.11implemented query template extraction toolkit and built a python extension for baidu wordseg library.
2018 Baidu Scholarship 2018.12
First Class Award in HeiLongJiang Provincial Science and Technology Prizes: The Language Technology Platform and its Applications 2016.9
Hua Wei Scholarship (for graduate student) 2016.9
The National Scholarship for graduate students 2013.9
2010 ACM/ICPC Asia Regional Contest Hangzhou Onsite, Silver Medal 2010.10
Hua Wei Scholarship (for undergraduate student) 2010.9