Marie-Francine Moens

Full professor
Department of Computer Science
Katholieke Universiteit Leuven
Celestijnenlaan 200A (Room 04.25)
B-3001 Heverlee

Phone: +32 (0)16 32 53 83
Fax: +32 (0)16 32 79 96

Short biography

Marie-Francine (Sien) Moens is a full professor at the Department of Computer Science at KU Leuven, Belgium. She holds a M.Sc. and a Ph.D. degree in Computer Science from this university. She is head of the Language Intelligence and Information retrieval (LIIR) research group and is a member of the Human Computer Interaction unit. She is currently also head of the Informatics section of the Department of Computer Science at KU Leuven.

Her main interests are in the domain of automated content recognition in text and multimedia data and its application in information extraction and retrieval using statistical machine learning, and exploiting insights from linguistic and cognitive theories. She investigates topics such as:

  • Machine learning for natural language processing, especially learning with limited supervision.
  • Machine learning for information retrieval, classification and aggregation.
  • Deep learning and latent variable models.
  • Information extraction from text, content recognition, machine reading, semantic parsing, discourse understanding, text mining, knowledge acquisition in the biomedical and legal domains.
  • Information retrieval and search models, text based question answering and reasoning.
  • Multimodal and multilingual processing, alignment, summarization and retrieval of content.
  • Multimedia information retrieval.
  • Processing of noisy text such as user generated content and speech transcripts.
  • e-Forensics: content and email filtering.

She is currently a member of the Council of the Industrial Research Fund of KU Leuven and is the scientific manager of the EU COST action iV&L Net (The European Network on Integrating Vision and Language). She is a member of the editorial board of the journal Foundations and Trends® in Information Retrieval. In 2011 and 2012 she was appointed as chair of the European Chapter of the Association for Computational Linguistics (EACL) and was a member of the executive board of the Association for Computational Linguistics (ACL). From 2010 until 2014 she was a member of the Research Council of KU Leuven.

She is the author of many international publications and books including the publication of:

Moens, M.-F., Li, J. & Chua, T.-S. (Eds.) (2014). Mining User Generated Content. Chapman and Hall/CRC.

Moens, M.-F. (2006). Information Extraction: Algorithms and Prospects in a Retrieval Context (The Information Retrieval Series 21). New York: Springer.

Moens, M.-F. (2000). Automatic Indexing and Abstracting of Document Texts (The Kluwer International Series on Information Retrieval 6). Kluwer Academic Publishers: Boston.

Curriculum vitae


B-KUL-H02C8B - Text Based Information Retrieval

B-KUL-H02B1AE - Natural Language Processing

B-KUL-G0N28A - Beginselen van programmeren

B-KUL-G0K34A - Geschiedenis van de informatica

Research projects

ACCUMULATEACquiring CrUcial Medical information Using LAnguage TEchnology
DISKDIScovery of Knowledge on Chinese Medicinal Plants in Biomedical Texts
iV&LCombining Computer Vision and Language Processing For Advanced Search (EU ICT COST Action)
MARSMAchine Reading of patient recordS
MUSTERMUltimodal processing of Spatial and TEmporal expRessions (EU CHIST-ERA)
PARISPersonalised AdveRtisements buIlt from web Sources
RadicalRadical Innovations
SAEL-RMR A Synergetic Approach to Extraction, Learning and Reasoning for Machine Reading
SAMSemantic Annotation and retrieval of Multimedia
SCATESmart Computer-Aided Translation Environment
TextLink Structuring Discourse in Multilingual Europe (EU ISCH Cost Action)
A4MC3Architectures for Mobile Community Content Creation
ACILAAutomatic Detection and Classification of Arguments in a Legal Case
AcknowledgeAccessible & Open Knowledge Infrastructure for Flanders
AMASS++Advanced Multimedia Alignment and Structured Summarization
AntiPhishAnticipatory Learning for Reliable Phishing Prevention (EU ICT FP6)
CADIALComputer Aided Document Indexing for Accessing Legislation
CLARINCommon Language Resources and Technology Infrastructure
CLASSCognitive-Level Annotation using Latent Statistical Structure (EU ICT FP6)
CORNETTOCombinatorial and Relational Network as Toolkit for Dutch Language Technology
DAISYDutch lAnguage Investigation of Summarization technologY
GIEGeneric Technology for Information Extraction from Texts
INFO-NSIntelligent exploitation tools for nonstructured information for the Belgian federal police
ISISIndexing Structures and retrievability of Information Sources
LINDOLarge scale distributed INDexation of multimedia Objects
MUMIAMUltilingual and Multifaceted Interactive information Access (EU ICT COST Action)
MUSEMachine Understanding for interactive StorytElling (EU ICT FP7 FET)
PuppyIRAn Open Source Environment to Construct Information Services for Children (EU ICT FP7)
Spraak2Taal Extraction of Ideological Content from Political Speeches
SUNSHINEVirtual Data Analyst for Interactive Data-Driven Journalism
TERENCEAn Adaptive Learning System for Reasoning about Stories (EU ICT FP7)
TermWiseTermWise: Resources for Specialized Language Use
TEXT MININGText Mining for the Dissemination of Statistical Information via the Internet
TIMEAdvanced Time Based Text Mining
TOSCA-MPTask-Oriented Search and Content Annotation for Media Production (EU ICT FP7)
WebInsightWebInsight: Towards Modeling Correlation and Evolution of Web Documents

Selection of recent keynotes and invited lectures

Learning Representations for Language Understanding: Experiences from the MUSE Project, Deep Learning Summit, London, UK, 25-9-2015.

Information Extraction from Biomedical Texts: Learning Models with Limited Supervision, Sixth International Workshop on Health Text Mining and Information Analysis at EMNLP 2015, Lisbon, Portugal, 17-9-2015.

Visualizing the Content of a Text in a Virtual World, XIII Conference on Culture and Computer Science, Berlin, Germany, 28-5-2015.

Argumentation Mining, Forum for Information Retrieval Evaluation (FIRE), India International Center, New Delhi, India, 6-12-2013.

Selection of recent invited tutorials

Integrating Language, Semantics, and Multimedia for IR, at the 10th European Summer School in Information Retrieval (ESSIR 2015), Thessaloniki, Greece, 4-9-2015.

Argumentation Mining: Methods, Challenges and Possible Solutions, at the Summer School on Argumentation 2014: Computational and Linguistic Perspectives, University of Dundee, UK, 4-9-2014.

Multilingual Probabilistic Topic Modeling and its Applications in Web Mining and Search, tutorial jointly with Ivan Vulić at the 7th ACM WSDM Conference (WSDM 2014), New York City, 24-2-2014.

Information Extraction and Linking in a Retrieval Context, tutorial at the Erasmus Intensive Programme Information Foraging, Radboud University Nijmegen, The Netherlands, 28-8-2013 – 29-8-2013.


See LIRIAS database.
  1. HEYMAN, Geert, VULIC, Ivan & MOENS, Marie-Francine (2016) C-BiLDA Extracting Cross-lingual Topics from Non-Parallel Texts by Distinguishing Shared from Unshared Content. Data Mining and Knowledge Discovery.
  2. ZOGHBI, Susana, HEYMAN, Geert, GOMEZ, Juan-Carlos & MOENS, Marie-Francine (2016) Cross-modal Fashion Search. In Proceedings of the 22nd International Conference on MultiMedia Modelling (Lecture Notes in Computer Science 9517) (pp. 367-373).
  3. FARNADI, G., SITARAMAN, G., SUSHMITA, S., CELLI, F., KOSINSKI, M., STILLWELLl, D., DAVALOS, S., MOENS, M.-F. & DE COCK, M. (2016) Computational Personality Recognition in Social Media. Journal of Personalization (accepted).
  4. VULIC, Ivan & MOENS, Marie-Francine (2016) Bilingual Distributed Word Representations from Document-Aligned Comparable Data. Journal of Artificial Intelligence Research (accepted).

Go back to the LIIR research group webpages.