Professor Aline Villavicencio

MPhil, PhD

School of Computer Science

Chair in Natural Language Processing

Member of the Natural Language Processing research group

Aline Villavicencio profile photo
Profile picture of Aline Villavicencio profile photo
a.villavicencio@sheffield.ac.uk

Full contact details

Professor Aline Villavicencio
School of Computer Science
Regent Court (CS)
211 Portobello
Sheffield
S1 4DP
Profile

Aline Villavicencio received her PhD and MPhil degrees from the University of Cambridge (UK) and MSc in Computer Science from the Federal University of Rio Grande do Sul (Brazil).

She was a Visiting Scholar at the Massachusetts Institute of Technology (USA) (in the Department of Linguistics and Philosophy in 2014/2015 and in the Laboratory of Information and Decision Systems in 2011/2012) at the Labo颅ra颅toire LaTTiCe at the 脡cole Normale Sup茅颅rieure (France) in 2014, an Erasmus-Mundus Visiting Scholar at Saarland University (Germany) in 2012/2013, and at the University of Bath in 2006-2009.

From 2007-2017 she held a Research Fellowship from the Brazilian Scientific Research Council (CNPq). She is also affiliated to the Federal University of Rio Grande do Sul (Brazil)

Research interests

Her research interests are in lexical semantics, multilinguality, and cognitively motivated NLP. This work includes techniques for Multiword Expression treatment using statistical methods and distributional semantic models, and applications like Text Simplification and Question Answering, for languages like English and Portuguese.

Publications

Books

  • Poibeau T & Villavicencio A (2017) . Cambridge University Press.
  • Villavicencio A, Poibeau T, Korhonen A & Alishahi A (2013) Cognitive Aspects of Computational Language Acquisition. Springer Science & Business Media.
  • Caseli HDM, Villavicencio A, Teixeira A & Perdig茫o F (2012) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface.
  • (2011) . Wiley.
  • (2002) . Vandenhoeck & Ruprecht.

Edited books

  • Bansal M & Villavicencio A (Ed.) (2019) Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL). Association for Computational Linguistics (ACL).
  • Villavicencio A, Moreira V, Abad A, Caseli H, Gamallo P, Ramisch C, Oliveira H & Paetzold G (Eds.) (2018) . Springer.

Journal articles

  • Yu Q, Zhang C, Jin G, Huang T, Zhou W, Li W, Jin X, Huang B, Zhao Y, Yang G , Lip GYH et al (2026) . IEEE Transactions on Image Processing, 35, 1290-1304.
  • Castro GA, Barioto LG, Cao YH, Silva RM, Caseli HM, Machado-Neto JA, Cerri R, Villavicencio A & Almeida TA (2026) . Artificial Intelligence in Medicine, 171, 103302-103302.
  • Yamaguchi A, Villavicencio A & Aletras N (2025) . Computational Linguistics.
  • Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) Adapting chat language models using only target unlabeled language data. Transactions on Machine Learning Research, 2025(09).
  • He W, Vieira TK, Garcia M, Scarton C, Idiart M & Villavicencio A (2025) . Computational Linguistics, 51(2), 505-555.
  • Soroka G, Idiart M & Villavicencio A (2024) . PLoS ONE, 19(2).
  • Peng B, He W, Chen B, Villavicencio A & Wu C (2024) . Pattern Recognition Letters, 178, 84-90.
  • Gow-Smith E, Phelps D, Tayyar Madabushi H, Scarton C & Villavicencio A (2024) . Proceedings of the 9th Workshop on Representation Learning for NLP (RepL4NLP-2024), 118-135.
  • Yamaguchi A, Villavicencio A & Aletras N (2024) Vocabulary Expansion for Low-resource Cross-lingual Transfer.. CoRR, abs/2406.11477.
  • He W, Idiart M, Scarton C & Villavicencio A (2024) . Findings of the Association for Computational Linguistics ACL 2024, 12473-12485.
  • Yamaguchi A, Villavicencio A & Aletras N (2024) . Findings of the Association for Computational Linguistics: EMNLP 2024, 6760-6785.
  • Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2024) Vocabulary Expansion of Chat Models with Unlabeled Target Language Data.. CoRR, abs/2412.11704.
  • He W, Farrahi K, Chen B, Peng B & Villavicencio A (2024) . Pattern Recognition Letters, 177, 40-46.
  • Wilkens R, Zilio L & Villavicencio A (2024) . Language Resources and Evaluation, 58(1), 175-201.
  • Salle A & Villavicencio A (2022) . Journal of Experimental and Theoretical Artificial Intelligence, 35(8), 1161-1199.
  • Muresan S, Nakov P & Villavicencio A (2022) Message from the Program Chairs. Proceedings of the Annual Meeting of the Association for Computational Linguistics, vii-xi.
  • Boito MZ, Villavicencio A & Besacier L (2021) . Machine Translation, 34, 305-323.
  • Villavicencio A & Van Durme B (2020) Introduction. Emnlp 2020 Conference on Empirical Methods in Natural Language Processing Tutorial Abstracts, III.
  • Ville A, Levine E, Zhi D, Lararia B & Wojcicki JM (2020) . Journal of the American College of Nutrition, 39(1), 47-53.
  • Villavicencio A & Idiart M (2019) . Natural Language Engineering, 25(6), 715-733.
  • Cordeiro S, Villavicencio A, Idiart M & Ramisch C (2019) . Computational Linguistics, 45(1), 1-57.
  • Idiart MAP, Villavicencio A, Katz B, Renn贸-Costa C & Lisman J (2019) . Frontiers in Computational Neuroscience, 13.
  • Wilkens R, Vecchia AD, Boito MZ, Padr贸 M & Villavicencio A (2014) , 129-140.
  • Ramisch C, Villavicencio A & Kordoni V (2013) . ACM Transactions on Speech and Language Processing, 10(2), 1-10.
  • Villavicencio A (2012) . Natural Language Engineering, 18(4), 575-579.
  • De Almeida L, Idiart M, Villavicencio A & Lisman J (2012) . Hippocampus, 22(8), 1647-1651.
  • de Caseli HM, Ramisch C, das Gra莽as Volpe Nunes M & Villavicencio A (2010) . Language Resources and Evaluation, 44(1-2), 59-77.
  • Baldwin T, Kordoni V & Villavicencio A (2009) . Computational Linguistics, 35(2), 119-149.
  • Villavicencio A (2005) . Computer Speech & Language, 19(4), 415-432.
  • Villavicencio A, Bond F, Korhonen A & McCarthy D (2005) . Computer Speech & Language, 19(4), 365-377.
  • Xia F, Derr T, Luu AT, Singh R & Villavicencio A () . ACM Transactions on Intelligent Systems and Technology.
  • He W, Vieira TK, Gonzalez MG, Scarton C, Idiart M & Villavicencio A () Finding Idiomaticity in Word Representations. Computational Linguistics.
  • Becker N, de Lima M眉ller J, de Carvalho Rodrigues J, Villavicencio A & de Salles JF () . 尝别迟谤么苍颈肠补, 7(1), 325-347.
  • Zortea M, Menegola B, Villavicencio A & de Salles JF () . Psicologia: Reflex茫o e Cr铆tica, 27(1), 90-99.
  • Villavicencio A, Sadler L & Arnold D () . Proceedings of the International Conference on Head-Driven Phrase Structure Grammar.
  • Villavicencio A & Copestake A () . Proceedings of the International Conference on Head-Driven Phrase Structure Grammar.

Book chapters

  • Villavicencio A (2020) , IVITRA Research in Linguistics and Literature (pp. vii-xii). John Benjamins Publishing Company
  • Poibeau T & Villavicencio A (2018) , Language Cognition and Computational Models (pp. 3-24).
  • Boos R, Prestes K, Villavicencio A & Padr贸 M (2014) , Lecture Notes in Computer Science (pp. 201-206). Springer International Publishing
  • Parente MA, Villavicencio A, Siqueira M, Chen P & Tonietto L (2013) The Lexical Bootstrapping Hypothesis and conventionality: A crosslinguistic study on verb acquisition by Chinese Mandarin- and Brazilian Portuguese-speaking children, Lexical Bootstrapping the Role of Lexis and Semantics in Child Language Development (pp. 73-97).
  • Poibeau T, Villavicencio A, Korhonen A & Alishahi A (2013) , Theory and Applications of Natural Language Processing (pp. 1-25). Springer Berlin Heidelberg
  • Villavicencio A (2011) , Non鈥怲ransformational Syntax (pp. 404-442). Wiley
  • Arnold D, Sadler L & Villavicencio A (2008) Portuguese: Corpora, coordination and agreement, Roots Linguistics in Search of Its Evidential Base (pp. 9-28).
  • Villavicencio A (2006) , Syntax and Semantics of Prepositions (pp. 115-130). Springer Nature
  • Ramisch C & Villavicencio A () , The Oxford Handbook of Computational Linguistics 2nd edition Oxford University Press (OUP)

Conference proceedings

  • Pickard T, Villavicencio A, Mi M, He W, Phelps D & Idiart M (2025) SemEval-2025 Task 1: AdMIRe - Advancing Multimodal Idiomaticity Representation. Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025) (pp 2597-2609). Vienna, Austria, 31 July 2025 - 31 July 2025.
  • Richardson FL, Villavicencio A & Menezes R (2025) . Proceedings of the 40th ACM/SIGAPP Symposium on Applied Computing (pp 920-927)
  • Mi M, Villavicencio A & Moosavi NS (2025) . Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp 7314-7332), July 2025 - July 2025.
  • Mi M, Villavicencio A & Moosavi NS (2025) . Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (pp 34316-34329), November 2025 - November 2025.
  • Yamaguchi A, Villavicencio A & Aletras N (2024) An empirical study on cross-lingual vocabulary adaptation for efficient language model inference. Findings of the Association for Computational Linguistics: EMNLP 2024 (pp 6760-6785). Miami, Florida, USA, 12 November 2024 - 12 November 2024.
  • Phelps D, Pickard T, Mi M, Gow-Smith E & Villavicencio A (2024) Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection. Joint Workshop on Multiword Expressions and Universal Dependencies Mwe Ud 2024 at Lrec Coling 2024 Workshop Proceedings (pp 178-187)
  • He W, Idiart M, Scarton C & Villavicencio A (2024) Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss.. ACL (Findings) (pp 12473-12485)
  • Gibbons M, Mi M, Song X & Villavicencio A (2024) . Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) (pp 1860-1867), June 2024 - June 2024.
  • Zhao K, Yang B, Lin C, Rong W, Villavicencio A & Cui X (2023) . Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp 562-574), July 2023 - July 2023.
  • Peng B, Wu C, He W, Thorne W, Villavicencio A, Wang Y & Paes A (2023) FLYPE: Multitask prompt tuning for multimodal human understanding of social media. MUWS 2023: Multimodal Human Understanding for the Web and Social Media 2023: Proceedings of the 2nd International Workshop on Multimodal Human Understanding for the Web and Social Media co-located with the 32nd ACM International Conference on Information, Vol. 3566 (pp 18-33). Birmingham, United Kingdom, 22 October 2023 - 22 October 2023.
  • Madabushi HT, Gow-Smith E, Garcia M, Scarton C, Idiart M & Villavicencio A (2022) SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
  • Phelps D, Fan X-R, Gow-Smith E, Madabushi HT, Scarton C & Villavicencio A (2022) Sample Efficient Approaches for Idiomaticity Detection. Proceedings of the 18th Workshop on Multiword Expressions (MWE 2022)
  • Bigoulaeva I, Singh Sachdeva R, Tayyar Madabushi H, Villavicencio A & Gurevych I (2022) . Proceedings of the 3rd Workshop on Figurative Language Processing (FLP) (pp 54-60), December 2022 - December 2022.
  • Gow-Smith E, Tayyar Madabushi H, Scarton C & Villavicencio A (2022) . Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (pp 11430-11443), December 2022 - December 2022.
  • Boito MZ, Yusuf B, Ondel L, Villavicencio A & Besacier L (2022) Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. 1st Annual Meeting of the Elra ISCA Special Interest Group on Under Resourced Languages Sigul 2022 Held in Conjunction with the International Conference on Language Resources and Evaluation Lrec 2022 Proceedings (pp 1-9)
  • Muresan S, Nakov P & Villavicencio A (2022) Message from the Program Chairs. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp vii-xi)
  • Muresan S, Nakov P & Villavicencio A (2022) Message from the Program Chairs. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Vol. 1 (pp vii-xi)
  • Phelps D, Fan XR, Gow-Smith E, Madabushi HT, Scarton C & Villavicencio A (2022) Sample Efficient Approaches for Idiomaticity Detection. Lrec 2022 Workshop Language Resources and Evaluation Conference 18th Workshop on Multiword Expressions Mwe 2022 Proceedings (pp 105-111)
  • Madabushi HT, Gow-Smith E, Garc铆a M, Scarton C, Idiart M & Villavicencio A (2022) SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding.. SemEval@NAACL (pp 107-121)
  • Boito MZ, Yusuf B, Ondel L, Villavicencio A & Besacier L (2021) Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. Proceedings of 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022), 24 June 2022 - 25 June 2022.
  • Garcia M, Kramer Vieira T, Scarton C, Idiart MAP & Villavicencio A (2021) . Proceedings of ACL-IJCNLP 2021 (pp 2730-2741). Bangkok, Thailand, 1 August 2021 - 1 August 2021.
  • Vickers P, Wainwright R, Tayyar Madabushi H & Villavicencio A (2021) . Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2021) (pp 125-133). Mexico City, Mexico, 10 June 2021 - 10 June 2021.
  • Garcia M, Vieira TK, Scarton C, Idiart M & Villavicencio A (2021) Probing for idiomaticity in vector space models. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (pp 3551-3564). Virtual conference, 19 April 2021 - 19 April 2021.
  • Villavicencio A (2021) What if the whole is greater than the sum of the parts? Modelling Complex (Multiword) Expressions. Ceur Workshop Proceedings, Vol. 2944 (pp 1-10)
  • H眉rriyeto臒lu A, Tanev H, Zavarella V, Piskorski J, Yeniterzi R, Yuret D & Villavicencio A (2021) . Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021) (pp 1-9), August 2021 - August 2021.
  • Tayyar Madabushi H, Gow-Smith E, Scarton C & Villavicencio A (2021) . Findings of the Association for Computational Linguistics: EMNLP 2021. Punta Cana, Dominican Republic, 7 November 2021 - 11 November 2021.
  • Madabushi HT, Gow-Smith E, Scarton C & Villavicencio A (2021) AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models.. EMNLP (Findings) (pp 3464-3477)
  • Boito MZ, Villavicencio A & Besacier L (2020) Investigating language impact in bilingual approaches for computational language documentation. 1st Joint Workshop of Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages (SLTU-CCURL 2020) (pp 79-87). Marseille, France, 10 May 2020 - 10 May 2020.
  • Gamallo P, Garcia M, Mart铆n-Rodilla P, Pereira-Fari帽a M, Real L, Tonelli S, Quaresma P, Vieira R, Dias G, Oostdijk N , Villavicencio A et al (2020) Preface. Ceur Workshop Proceedings, Vol. 2693
  • Boito MZ, Villavicencio A & Besacier L (2019) . Interspeech 2019 - Proceedings of the Annual Conference of the International Speech Communication Association (pp 2688-2692). Graz, Austria, 15 September 2019 - 15 September 2019.
  • Villavicencio A (2019) . Proceedings of the Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019). Florence, Italy, 2 August 2019 - 2 August 2019.
  • Villavicencio A & Bansal M (2019) Introduction. Conll 2019 23rd Conference on Computational Natural Language Learning Proceedings of the Conference (pp iii-iv)
  • Wagner Filho JA, Wilkens R, Idiart M & Villavicencio A (2019) The brWaC corpus: A new open resource for Brazilian Portuguese. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp 4339-4344). Miyazaki, Japan, 7 May 2018 - 7 May 2018.
  • Godard P, Boito MZ, Ondel L, Berard A, Yvon F, Villavicencio A & Besacier L (2018) . Proceedings of Interspeech 2018 (pp 2678-2682). Hyderabad, India, 2 September 2018 - 2 September 2018.
  • Zanon Boito M, Anastasopoulos A, Lekakou M, Villavicencio A & Besacier L (2018) . The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages. Gurugram, India, 29 August 2018 - 29 August 2018.
  • Ramisch C, Ramisch R, Zilio L, Villavicencio A & Cordeiro S (2018) . Computational Processing of the Portuguese Language (pp 24-34). Canela, Brazil, 24 September 2018 - 26 September 2018.
  • Salle A & Villavicencio A (2018) . ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, Vol. 2 (pp 8-13)
  • Paula F, Wilkens R, Idiart M & Villavicencio A (2018) . Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), 1 June 2018 - 6 June 2018.
  • Boito MZ, B茅rard A, Villavicencio A & Besacier L (2018) . 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 458-465). Okinawa, Japan, 16 December 2017 - 16 December 2017.
  • Wilkens R, Zilio L, Cordeiro S, Paula FSF, Ramisch C, Idiart M & Villavicencio A (2017) LexSubNC: A dataset of lexical substitution for nominal compounds. 12th International Conference on Computational Semantics Iwcs 2017 Short Papers
  • Cordeiro S, Ramisch C, Idiart M & Villavicencio A (2016) . Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1: Long papers(-) (pp 1986-1997). Berlin, Germany, 7 August 2016 - 12 August 2016.
  • Salle A, Villavicencio A & Idiart M (2016) . Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp 419-424), 7 August 2016 - 12 August 2016.
  • Wilkens R, Zilio L, Ferreira E & Villavicencio A (2016) (pp 333-339)
  • Zilio L, Wilkens R, M枚llmann L, Wehrli E, Cordeiro S & Villavicencio A (2016) (pp 233-238)
  • Filho JAW, Wilkens R, Zilio L, Idiart M & Villavicencio A (2016) (pp 306-318)
  • Zilio L, Finatto MJB & Villavicencio A (2016) Verblexpor: A lexical resource with semantic roles for Portuguese. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp 2656-2661). Portoro啪, Slovenia, 23 May 2016 - 23 May 2016.
  • Wilkens R, Idiart M & Villavicencio A (2016) Multiword expressions in child language. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp 2307-2311). Portoro啪, Slovenia, 23 May 2016 - 23 May 2016.
  • Ramisch C, Cordeiro S, Zilio L, Idiart M & Villavicencio A (2016) . Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), August 2016 - August 2016.
  • Ramisch C, Cordeiro S & Villavicencio A (2016) . Proceedings of the 12th Workshop on Multiword Expressions. Berlin, Germany, 11 August 2016 - 11 August 2016.
  • Cordeiro SR, Ramisch C & Villavicencio A (2016) . Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016) (pp 910-917). San Diego, California, 16 June 2016 - 16 June 2016.
  • Wilkens R, Zilio L, Ferreira E & Villavicencio A (2016) B2SG: a TOEFL-like task for Portuguese. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp 3659-3662). Portoro啪, Slovenia, 23 May 2016 - 23 May 2016.
  • Cordeiro S, Ramisch C & Villavicencio A (2016) mwetoolkit+sem: Integrating word embeddings in the mwetoolkit for semantic MWE processing. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp 1221-1225). Portoro啪, Slovenia, 23 May 2016 - 23 May 2016.
  • Scheller Boos RA, Prestes KV & Villavicencio A (2014) Identification of multiword expressions in the brWaC. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) (pp 728-735). Reykjavik, Iceland, 26 May 2014 - 26 May 2014.
  • Padr贸 M, Idiart M, Ramisch C & Villavicencio A (2014) . Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp 419-424). Doha, Qatar, 25 October 2014 - 25 October 2014.
  • Laranjeira BR, Moreira VP, Villavicencio A, Ramisch C & Finatto MJ (2014) Comparing the quality of focused crawlers and of the translation resources obtained from them. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) (pp 3572-3578). Reykjavik, Iceland, 26 May 2014 - 26 May 2014.
  • Padro M, Idiart M, Villavicencio A & Ramisch C (2014) Comparing similarity measures for distributional thesauri. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) (pp 2964-2971). Reykjavik, Iceland, 26 May 2014 - 26 May 2014.
  • (2014)
  • Villavicencio A, Idiart M, Berwick R & Malioutov I (2013) Language acquisition and probabilistic models : keeping it simple. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Vol. Vol 1: Long Papers (pp 1321-1330). Sofia, Bulgaria, 4 August 2013 - 4 August 2013.
  • Kordoni V, Ramisch C & Villavicencio A (2013) Introduction. Proceedings of the 9th Workshop on Multiword Expressions Mwe 2013 in Conjunction with the 2013 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Naacl Hlt 2013 (pp III-IV)
  • Villavicencio A, Yankama B, Idiart MAP & Berwick R (2012) A large scale annotated child language construction database. Proceedings of the 8th International Conference on Language Resources and Evaluation Lrec 2012 (pp 2370-2374)
  • Gon莽alves G, Wilkens R & Villavicencio A (2011) Semi-automatic acquisition system of ontologies. Ceur Workshop Proceedings, Vol. 776 (pp 189-194)
  • Prestes K, Wilkens R, Zillio L & Villavicencio A (2011) Extraction and validation of ontologies from digital resources. Ceur Workshop Proceedings, Vol. 776 (pp 183-188)
  • Acosta OC, Villavicencio A & Moreira VP (2011) Identification and treatment of multiword expressions applied to information retrieval. Workshop on Multiword Expressions from Parsing and Generation to the Real World Mwe 2011 at the 49th Annual Meeting of the Association for Computational Linguistics Human Language Technologies Acl Hlt 2011 Proceedings (pp 101-109)
  • De Araujo V, Ramisch C & Villavicencio A (2011) Fast and flexible MWE candidate generation with the MWE toolkit. Workshop on Multiword Expressions from Parsing and Generation to the Real World Mwe 2011 at the 49th Annual Meeting of the Association for Computational Linguistics Human Language Technologies Acl Hlt 2011 Proceedings (pp 134-136)
  • Duran MS, Ramisch C, Alu铆sio SM & Villavicencio A (2011) Identifying and analyzing Brazilian portuguese complex predicates. Workshop on Multiword Expressions from Parsing and Generation to the Real World Mwe 2011 at the 49th Annual Meeting of the Association for Computational Linguistics Human Language Technologies Acl Hlt 2011 Proceedings (pp 74-82)
  • Kordoni V, Ramisch C & Villavicencio A (2011) Introduction. Workshop on Multiword Expressions from Parsing and Generation to the Real World Mwe 2011 at the 49th Annual Meeting of the Association for Computational Linguistics Human Language Technologies Acl Hlt 2011 Proceedings (pp III-IV)
  • Ramisch C, de Medeiros Caseli H, Villavicencio A, Machado A & Finatto MJ (2010) (pp 65-74)
  • Wilkens R & Villavicencio A (2010) (pp 173-182)
  • Ramisch C, Villavicencio A & Boitet C (2010) Web-based and combined language models: A case study on noun compound identification. Coling 2010 23rd International Conference on Computational Linguistics Proceedings of the Conference, Vol. 2 (pp 1041-1049)
  • Ramisch C, Villavicencio A & Boitet C (2010) Multiword expressions in the wild? the mwetoolkit comes in handy. Coling 2010 23rd International Conference on Computational Linguistics Proceedings of the Conference, Vol. 2 (pp 57-60)
  • Wilkens R, Villavicencio A, Muller D, Wives L, De Silva F & Loh S (2010) COMUNICA - A question answering system for brazilian portuguese. Coling 2010 23rd International Conference on Computational Linguistics Proceedings of the Conference, Vol. 2 (pp 21-24)
  • Germann DC, Villavicencio A & Siqueira M (2010) An investigation on the influence of frequency on the lexical organization of verbs. Acl 2010 Textgraphs 2010 2010 Workshop on Graph Based Methods for Natural Language Processing Proceedings of the Workshop (pp 19-23)
  • Ramisch C, Villavicencio A & Boitet C (2010) Mwetoolkit: A framework for multiword expression identification. Proceedings of the 7th International Conference on Language Resources and Evaluation Lrec 2010 (pp 662-669)
  • Germann DC, Villavicencio A & Siqueira M (2010) An investigation on the influence of frequency on the lexical organization of verbs. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp 19-23)
  • Linardaki E, Ramisch C, Villavicencio A & Fotopoulou A (2010) Towards the Construction of Language Resources for Greek Multiword Expressions: Extraction and Evaluation. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (pp H31-H40)
  • Caseli HDM, Villavicencio A, Machado A & Finatto MJ (2009) . Proceedings of the Workshop on Multiword Expressions Identification, Interpretation, Disambiguation and Applications - MWE '09 (pp 1-1), 6 August 2009 - 6 August 2009.
  • Villavicencio A, de Medeiros Caseli H & Machado A (2009) . 2009 Seventh Brazilian Symposium in Information and Human Language Technology (pp 27-35), 8 September 2009 - 11 September 2009.
  • (2009) . 2009 Seventh Brazilian Symposium in Information and Human Language Technology (pp viii-ix), 8 September 2009 - 11 September 2009.
  • Ramisch C, Villavicencio A, Moura L & Idiart M (2008) . Proceedings of the Twelfth Conference on Computational Natural Language Learning - CoNLL '08 (pp 49-49), 16 August 2008 - 17 August 2008.
  • Ramisch C, Villavicencio A, Moura L & Idiart M (2008) Picking them up and figuring them out: Verb-particle constructions, noise and idiomaticity. Conll 2008 Proceedings of the Twelfth Conference on Computational Natural Language Learning (pp 49-56)
  • Acosta OC, Geraldo AP, Orengo VM & Villavicencio A (2008) UFRGS@CLEF2008: Indexing Multiword Expressions for Information Retrieval. Ceur Workshop Proceedings, Vol. 1174
  • Villavicencio A, Kordoni V, Zhang Y, Idiart M & Ramisch C (2007) Validation and evaluation of automatically acquired multiword expressions for grammar engineering. Emnlp Conll 2007 Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (pp 1034-1043)
  • Zhang Y, Kordoni V, Villavicencio A & Idiart M (2006) . Proceedings of the Workshop on Multiword Expressions Identifying and Exploiting Underlying Properties - MWE '06 (pp 36-36), 23 July 2006 - 23 July 2006.
  • Villavicencio A, Copestake A, Waldron B & Lambeau F (2004) . Proceedings of the Workshop on Multiword Expressions Integrating Processing - MWE '04 (pp 80-87), 26 July 2004 - 26 July 2004.
  • Villavicencio A, Baldwin T & Waldron B (2004) A multilingual database of idioms. Proceedings of the 4th International Conference on Language Resources and Evaluation Lrec 2004 (pp 1127-1130)
  • Copestake A, Lambeau F, Villavicencio A, Bond F, Baldwin T, Sag IA & Flickinger D (2002) Multiword expressions: Linguistic precision and reusability. Proceedings of the 3rd International Conference on Language Resources and Evaluation Lrec 2002 (pp 1941-1947)
  • Baldwin T & Villavicencio A (2002) Extracting the Unextractable: A Case Study on Verb-particles. Proceedings of the Annual Meeting of the Association for Computational Linguistics
  • Villavicencio A (2002) Learning to Distinguish PP Arguments from Adjuncts. Proceedings of the Annual Meeting of the Association for Computational Linguistics
  • Villavicencio A (2000) The acquisition of word order by a computational learning system. Proceedings of the 4th Conference on Computational Natural Language Learning Conll 2000 and of the 2nd Learning Language in Logic Workshop Lll 2000 Held in Cooperation with Icgi 2000 (pp 209-218)
  • Villavicencio A (1999) Representing a system of lexical types using default unification. 9th Conference of the European Chapter of the Association for Computational Linguistics Eacl 1999 (pp 261-264)
  • McFetridge P & Villavicencio A (1995) (pp 302-311)
  • Villavicencio A, Lopes JGP, Marques NMC & Villavicencio F (1995) (pp 323-332)
  • Paula F, Wilkens R, Idiart M & Villavicencio A () . LatinX in AI at Neural Information Processing Systems Conference 2018

Preprints

  • Hoff L, Soroka G, Guimar茫es M, Villavicencio A & Idiart M (2026) Formation of Artificial Neural Assemblies by Biologically Plausible Inhibition Mechanisms, arXiv.
  • Phelps D, Wilkens R, Gow-Smith E, Hubner L, Malcorra BR, Renn脙鲁-Costa CS, Idiart M, Villa-Uriol M-C & Villavicencio A (2025) Beyond surface form: A pipeline for semantic analysis in Alzheimer's Disease detection from spontaneous speech, arXiv.
  • Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates, arXiv.
  • Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) , arXiv.
  • Mi M, Villavicencio A & Moosavi NS (2025) From Input Perception to Predictive Insight: Modeling Model Blind Spots Before They Become Errors, arXiv.
  • Phelps D, Wilkens R, Gow-Smith E, Pickard T, Mi M & Villavicencio A (2025) , arXiv.
  • Mi M, Villavicencio A & Moosavi NS (2025) , arXiv.
  • He W, Vieira TK, Garcia M, Scarton C, Idiart M & Villavicencio A (2024) , arXiv.
  • Ribeiro M, Malcorra B, Mota NB, Wilkens R, Villavicencio A, Hubner LC & Renn贸-Costa C (2024) , arXiv.
  • He W, Idiart M, Scarton C & Villavicencio A (2024) , arXiv.
  • Yamaguchi A, Villavicencio A & Aletras N (2024) , arXiv.
  • Phelps D, Pickard T, Mi M, Gow-Smith E & Villavicencio A (2024) , arXiv.
  • Knietaite A, Allsebrook A, Minkov A, Tomaszewski A, Slinko N, Johnson R, Pickard T, Phelps D & Villavicencio A (2024) , arXiv.
  • Gow-Smith E, Phelps D, Madabushi HT, Scarton C & Villavicencio A (2024) , arXiv.
  • Wilkens R, Zilio L & Villavicencio A (2023) , arXiv.
  • Bigoulaeva I, Sachdeva R, Madabushi HT, Villavicencio A & Gurevych I (2022) , arXiv.
  • Madabushi HT, Gow-Smith E, Scarton C & Villavicencio A (2021) AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models.
  • Soares F (2020) , Cold Spring Harbor Laboratory.
  • Boito MZ, Villavicencio A & Besacier L (2020) , arXiv.
  • Boito MZ, Villavicencio A & Besacier L (2019) , arXiv.
  • Boito MZ, Villavicencio A & Besacier L (2019) , arXiv.
  • Boito MZ, Anastasopoulos A, Lekakou M, Villavicencio A & Besacier L (2018) , arXiv.
  • Godard P, Zanon-Boito M, Ondel L, Berard A, Yvon F, Villavicencio A & Besacier L (2018) , arXiv.
  • Boito MZ, Berard A, Villavicencio A & Besacier L (2017) , arXiv.
  • Salle A, Idiart M & Villavicencio A (2016) , arXiv.
  • Salle A, Idiart M & Villavicencio A (2016) , arXiv.
Grants
  • Atrium: , Horizon Europe, 01/2024 - 12/2027, 拢370,950, as PI
  • , EPSRC, 12/2020 - 11/2024, 拢446,163, as PI
  • Modelling the link between working memory and language deficits in schizophrenia, Royal Society, 12/2020 - 09/2024, 拢74,000, as Co-PI
Professional activities and memberships

Some of her recent activities include being the PC co-chair of the Conference on Computational Natural Language Learning (CoNLL-2019), Area Chair for events like ACL-2019, , , and General co-chair for the  (PROPOR 2018).

She is a member of the advisory board of WiNLP, of the editorial board of TACL, JNLE, Journal of Language Modelling and Linguamatica, and a reviewer for various conferences, in addition to having co-chaired numerous *ACL workshops on Cognitive Aspects of Computational Language Acquisition and on Multiword Expressions. She has also co-edited special issues and books dedicated to these topics.

She is a member of the Natural Language Processing group at the 爆料TV and of the  of the  (Brazil).