报告题目：Towards Utilization of Grammatical Information in Text Analysis
报告人：Jyrki Nummenmaa（Professor，University of Tampere, Finland）
Analyzing text is a fundamental task with many applications such as information retrieval, analysis of recommendations, question answering, and various task related with AI. The bag-of-words and k-gram based methods have, as initial basic approaches, given reasonable success in several areas. However, if we try to understand language content by just seeing a bag of words or a set of k-grams, we may realize something about their limitations, even when they are fed to highly advanced modelling methods. Grammar is what creates conceptual content out of words. Therefore, we suggest a grammatical analysis of text as a basis for the analysis. This does not by itself remove the need for neural networks and other machine learning methods, but, rather, provides an optional way to preprocess and represent the data.
Mr. Jyrki Nummenmaa is a full professor at the School of Information Sciences of University of Tampere, Finland and the head of Research Center for Information and Systems (CIS) at the University of Tampere.
Prof. Nummenmaa has done research on algorithms, databases, software development, business intelligence, data mining, open data, and Big Data, and, most recently, in text analysis. He has extensive administrational experience and practical experience from the past working 3,5 years in software companies in Tampere area. He has visited the University of Edinburgh for one year while doing his PhD research and lately several times universities in China, in 2004 University of Chile for 2 months, and in 2017 IT Faculty of Chalmers and University of Gothenburg for 2 months. He has over 70 peer-reviewed scientific publications in scientific journals and conferences.