Department of Linguistics University of Illinois at Urbana-Champaign

Header Navigation

Dr. Corina Roxana Girju

Director of the Joint Major in Computer Science and Linguistics
Associate Professor of Linguistics
Spanish and Portuguese
Affiliated Faculty of Computer Science
Affiliated Faculty of Beckman Institute
Affiliate of Center for Translation Studies

User Photo
  • Address:
    University of Illinois
    Dept. of Linguistics, MC-168
    707 South Mathews Avenue
    Urbana, IL 61801

    Office: 4016B Foreign Languages Bldg.
  • Telephone: (217) 244-3060
  • Email:
  • Homepage: Visit Website

Research Interests

Natural Language Processing / Computational Linguistics; Information Extraction and Retrieval; Text Data Mining; Computational Semantics, Inference and Reasoning; Machine Learning; Cognitive Linguistics and Computational Cognitive Modeling with applications to Question Answering, Machine Translation, and Knowledge Base Systems; and more recently Computer Vision and Medical Informatics.


  • Ph.D. University of Texas at Dallas (2002)

Current Projects

  • Computational pragmatics (speech acts, presuppositions and entailment)
  • Commonsense and world knowledge acquisition (including socio-cultural norms)
  • Inference and reasoning (causal knowledge acquisition and inference, causal chains, implicit causality)
  • Computational cognitive modeling/ Social computing (cognitive issues of linguistic meaning and language use, perception, intentions)
  • Lexical semantics (semantic parsing, noun phrase interpretation)
  • Publications

    Journal Articles

    Book Contributions

    • Cho, Jason, Tony Gao, and Roxana Girju. "Identifying Medications that Patients Stopped Taking in Online Health Forums." The 11th International Conference on Semantic Computing (IEEE ICSC). 2017.
    • Riaz, Mehwish, and Roxana Girju. "In-depth Exploitation of Noun and Verb Semantics to Identify Causation in Verb-Noun Pairs.." The 15th Annual Conference of the Special Interest Group on Discourse and Dialogue (SIGDial). 2014. <>.
    • Al-Sabbagh, Rania, Roxana Girju, and Jana Diesner. "Unsupervised Learning of Arabic Modal Multiword Expressions." Proceedings of the 10th Workshop on Multiword Expressions at the 14th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 2014. <>.
    • Al-Sabbagh, Rania, Jana Diesner, and Roxana Girju. "Using the Semantics-Syntax Interface for Reliable Arabic Modality Annotation." Proceedings of the 6th International Joint Conference of Natural Language Processing (IJCNLP 2013),. Association for Computational Linguistics, 2013.
    • Al-Sabbagh, Rania. "YADAC: Yet Another Dialectal Arabic Corpus." Proceedings of the 8th Language Resources and Evaluation Conference (LREC 2012). European Language Resources Association (ELRA), 2012. <>.
    • Paul, Michael, Cheng Zhai, and Roxana Girju. "Summarizing Contrastive Viewpoints In Opinionated Text." Proceedings of the Empirical Methods in Natural Language Processing (EMNLP 2010) Conference. Association for Computational Linguistics, 2010.
    • Paul, Michael, and Roxana Girju. "A Two-Dimensional Topic-Aspect Model for Discovering Multi-Faceted Topics." Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI-2010). 2010. <>.
    • Paul, Michael. "Cross-cultural Analysis of Blogs and Forums with Mixed-collection Topic Models." Proceedings of the Empirical Methods in Natural Language Processing (EMNLP 2009) Conference. Association for Computational Linguistics, 2009. <>.