Prof. dr. Dan Tufiş
About
teaching activities,
invited conferences & tutorials,
PhD students,
membership in PhD awarding committees
Teaching activities
- Data Driven and Machine Learning based NLP, 1st year, Master in Artificial Intelligence, Faculty of Automatic Control and Computers, Politehnica University, Bucharest
- Introduction to Computational Linguistics, 1st year, Master in Computational Linguistics, Faculty of Computer Science, Iasi
- Corpus Linguistics, 1st year, Master in Computational Linguistics, Faculty of Computer Science, Iasi
- Machine Translation, 2nd year, Master in Computational Linguistics, Faculty of Computer Science, Iasi
- AI methods and technologies in MT, ICIA Doctoral School
- WSD in NLP; applications, ICIA Doctoral School

Invited conferences & tutorials
- May 14-15, 2010 (invited conference) Multi-factor Lexical Mark-up of Connotation in a WordNet Lexical Ontology, Categorizing Human Experience: Classification in Languages and Knowledge Systems, Ecole Normale Supérieure (ENS), Paris, France
- January 18, 2010 (invited conference) A differential semantics method for annotating the
WordNet synsets, Max Plank Institute, Neijmegen, The Netherlands
- October 15, 2009 (invited conference) Going Beyond Word-form Factored Statistical Machine
Translation, Institute of Bulgarian Language, Bulgarian Academy of Sciences, Sofia, Bulgaria
- June 18-21, 2009 (invited conference, with Alexandru Ceauşu) Factored Phrase-Based Statistical
Machine Translation, 5th International Conference on Speech and Human-Computer Dialogue (SPED 2009), Constanţa, Romania
- March 16-19, 2009 (vicepresident) Annual meeting of the CLARIN Scientific Board, (member) Annual meetings of the CLARIN Strategic Coordination and Executive Boards, Oxford, UK
- February 11-14, 2009 (invited conference) Going for a Hunt? Don't Forget the Bullets!, FLaReNet Launching Event: The European Language Resources and Technologies Forum:
Shaping the Future of the Multilingual Digital Europe, Vienna, Austria
- July 10-12, 2008 (invited talk) Paradigmatic Morphology and Subjectivity Mark-up in the RO-WordNet Lexical Ontology, European Conference on Intelligent Systems and Technologies (ECIT 2008), Iasi, Romania
- May 15-17, 2008 (invited talk) Mind Your Words! You Might Convey What You Wouldn't Like To, Exploratory Workshop on NL-Computation: From Natural Language to Soft Computing: New Paradigms in Artificial Intelligence, Felix Spa, Romania
- February 29, 2008 (invited talk) Sisteme de întrebare-răspuns în limbaj natural pentru spaţii de căutare deschise, Seminar Internaţional Instrumente pentru asistarea traducerii, Academia Română, Bucureşti
- September 15-27, 2007 (tutorial) Exploiting Multilinguality in Developing Training Data for Statistics-Based NLP, NATO Advanced Study Institute on Advances in Language Engineering for Low- and Middle-Density Languages, Batumi, Georgia
- September 10, 2007 Tehnologii lingvistice pentru limba română, Tehnologiile lingvistice, prioritate a prezentului şi necesitate a viitorului: contribuţii, direcţii de acţiune şi proiecte româneşti, Academia Română, Bucureşti
- July 23 - August 3, 2007 (tutorial with Radu Ion) Cross lingual and cross cultural textual encoding of opinions and sentiments, EUROLAN 2007 Summer School on NLP & HLT: Semantics, Opinion and Sentiment in Text, Iasi, Romania
- April 20, 2007 (invited talk) Multilingual Technologies and Required Linguistic Resources to Support Mini-Minority Languages, Norwegian University of Science and Technology, Trondheim, Norway
- October 18-20, 2006 (invited talk) Cross-Lingual Knowledge Induction from Parallel Corpora, Fifth International Conference Formal Approaches to South Slavic and Balkan Languages, Sofia, Bulgaria
- October 14, 2006 (invited talk) An Outlook of the Ongoing Projects at RACAI, LT4eL general project meeting, Karol University, Prague, Czech Republic
- September 20, 2006 (invited talk) Recent advances in language technology in Romania: a view form the Research Institute for Artificial Intelligence of the Romanian Academy, CNRS Laboratoire d'Informatique Fondamentale (LIF) Marseille, France
- June 7-9, 2006 (invited talk) Word Senses: The Stepping Stones in Semantic-Based Natural Language Processing, IFIP Section on Semantics in Multimedia Analysis and Natural Language Processing, Athens, Greece
- June 1-3, 2006 (invited talk) Robust statistical translation models: The case for word alignment, International Conference on Computers, Communications & Control , Felix Spa, Romania
- May 27, 2006 (invited talk) Tagset Design for High Accuracy POS Tagging and Automatically Building Mappings between Arbitrary Tagsets, LREC 2006 workshop on Annotation Science: State of the Art in Enhancing Automatic Linguistic Annotation, Genoa, Italy
- April 3, 2006 (invited talk) Cross-lingual and cross-corpora knowledge induction: RACAI experience, EACL 2006 workshop on Cross-Language Knowledge Induction, Trento, Italy
- November 3, 2005 (lucrare invitată, cu Alexandru Ceauşu, Dan Ştefănescu) Resurse şi instrumente lingvistice dezvoltate la ICIA, ConsILR-2005 - Atelier internaţional Resurse Lingvistice Româneşti şi Instrumente pentru Prelucrarea Limbii Române, Iaşi, Romania
- October 24-27, 2005 (invited talk) Reifying the Alignments, AZBUKY-Net International Conference, Sofia, Bulgaria
- September 26-27, 2005 (invited talk, with Radu Ion, Verginica Barbu Mititelu) Word sense disambiguation and annotation transfer in parallel text, JRC Enlargement and Integration Workshop: Exploiting parallel corpora in up to 20 languages, Arona, Italy
- September 26-27, 2005 (invited talk, with Alexandru Ceauşu, Radu Ion, Dan Ştefănescu) An integrated platform for high-accuracy word alignment, JRC Enlargement and Integration Workshop: Exploiting parallel corpora in up to 20 languages, Arona, Italy
- July 25 - August 6, 2005 (tutorial with Nancy Ide) Word Senses and Cross-lingual Word Sense Disambiguation, EUROLAN 2005 Summer School on NLP & HLT: The Multilingual Web: Resources, Technologies, and Prospects, Cluj Napoca, Romania
- June 1-2, 2005 (invited talk) Ontologies and Reified Alignments, KnowledgeWeb Workshop on Heterogeneity, Heraklion, Crete, Greece
- 17 noiembrie, 2004 (lucrare invitată) Traducerea termenilor în corpusuri paralele: extragerea automată şi verificarea consistenţei traducerilor, Al 3-lea Colocviu Naţional de Terminologie, Terminografie, Terminotică (3T), Bucureşti
- November 5, 2004 (invited talk) Term Mining in Parallel Corpora, Leeds University, UK
- November 4, 2004 (invited talk) Word sense disambiguation in parallel corpora using aligned wordnets, Sheffield University, UK
- 23-24 septembrie, 2004 (lucrare invitată) BalkaNet: ontologie lexicală multilingvă, Prima Conferinţă Naţională de Interacţiune Om-Calculator (ROCHI 2004), Bucureşti
- august, 2004 (lucrare invitată) Câteva probleme actuale ale prelucrării limbajului natural- modele şi soluţii pentru limba română, Conferinţa Societăţii de Matematică din Republica Moldova, Chişinău, Moldova
- 6 iulie, 2004 (lucrare invitată) Multilingvism şi Tehnologia Informaţiei: Premise ale educaţiei şi culturii în secolul XXI, Fundaţia Română pentru Ştiinţă şi Cultură, Bucureşti
- May 12-15, 2004 (invited talk) Aligning multilingual lexical ontologies, KnowledgeWeb General Meeting, Heraklion, Crete, Greece
- May 4, 2004 (invited talk) Automatic morpho-syntactic disambiguation: following the good practices, University of Sofia, Bulgaria
- January 20-23, 2004 (invited talk) BALKANET - A General Overview, 2nd Global WordNet Conference, Brno, Czech Republic
- December 3-6, 2003 (invited talk) Integration of Knowledge, Infrastructures and Resources, CHiME network of excellence set-up meeting, Florence, Italy
- June 25-25, 2003 (invited talk) High Performance Word-Alignment Algorithms, Vrije Universiteit Brussel, Belgium
- April 12, 2003 (tutorial, with Nancy Ide and Adam Kilgarriff) Word senses, disambiguation and parallel corpora, EACL 2003 - 10th Conference of the European Chapter of the Association for Computational Linguistics, Budapest, Hungary
- March 6-7, 2003 (invited talk) Term Discovery in a Multilingual Corpus, Vrije Universiteit Brussel, Belgium
- February 2-4, 2003 (invited talk) The TRANSEQ model, Xerox Research Europe, Grenoble, France
- November 23-27, 2002 (invited talk) Building a Romanian Wordnet; problems, solutions and prospects, Seminar fur Sprachwissenschaft Abt. Computerlinguistik, Eberhard Karls University, Tubingen, Germany
- November 23-27, 2002 (invited talk) Reconciling data sparseness and precision in using large tagsets for morpho-lexical disambiguation, Seminar fur Sprachwissenschaft Abt. Computerlinguistik, Eberhard Karls University, Tubingen, Germany
- July 30 - August 11, 2001 (tutorial) Corpus Based Lexical Knowledge Acquisition, EUROLAN 2001 Summer School on NLP & HLT: Creation and Exploitation of Annotated Language Resources, Iasi, Romania
- March 21, 2001 (invited talk) Automatic Extraction of Bilingual Lexicons from Parallel Corpora, Luis Pasteur University Strasbourg, France
- March 15, 2001 (invited talk) How Hard is Machine Translation? A New Approach to an Old Problem, UNESCO Paris, France
- February 14, 2001 (invited talk) Automatic Extraction of Multilingual Lexicons from Parallel Corpora, University of Geneva, Switzerland
- November 24, 2000 (invited talk) Statistical Methods in Morpho-Syntactic Tagging; Combining Language Classifiers, Potsdam University, Poland
- November 23, 2000 (invited talk) Corpus-Based Multilingual Lexicography, AI-LAB Agentscape, Berlin, Germany
- November 17, 2000 (invited talk) Statistical lexical knowledge extraction from parallel corpora, Calouste Gulbenkian Foundation, Lisbon, Portugal
- June 2-4, 2000 (tutorial) Experience in Accurate Tagging Using Large Tagsets, NLP 2000 - 2nd International Conference on Natural Language Processing, Patras University, Greece
- May, 2000 (invited talk) Frequency based selection of head-words in explanatory dictionaries, CONCEDE Project Meeting, Athens, Greece
- April 14, 2000 (invited talk) Modelarea lingvistică învinge algoritmul, Academia de Ştiinţe Economice Chişinău, Moldova
- March 12, 2000 (invited talk) Experiments in lexical alignment of French-Romanian parallel texts, LIMSI-Université Paris Sud, France
- March 11, 2000 (invited talk) Tiered Tagging and Combined Classifiers, Université Paris 7, France
- February 22, 2000 (invited talk) Inductive desing of ''optimal'' tagsets, Research Institute for Linguistics of the Hungarian Academy of Sciences, Budapesta, Hungary
- December 1999 (invited talk) Converting a Dictionary from TEI2 dictionary into CONCEDE format, Vassar University College, USA
- December 1999 (invited talk) Computational Lexicographic Projects at RACAI, University of Pennsylvania, USA
- November 1999 (invited talk) High accuracy morpho-syntactic tagging with a large tagset, New Mexico State University, USA
- November 1999 (invited talk) Fine-grained and high accuracy morpho-syntactic disambiguation of arbitrary natural language texts, George Mason University, USA
- November 1999 (invited talk) High accuracy morpho-syntactic tagging with a large tagset, Southern Methodist University of Dallas, USA
- November 1999 (invited talk) Accurate morpho-syntactic tagging using large tagsets, UMIACS Computational Linguistics Colloquium Series, University of Maryland Institute for Advanced Computer Studies, USA
- July 19-31, 1999 (tutorial) Printed dictionaries, from lexical databases to lexical ontologies, EUROLAN 1999 Summer School on NLP & HLT: Lexical Semantics and Multilinguality, Iasi, Romania
- iulie 1999 (lucrare invitată) Lingvistica corpusului: lingvistica evidenţei, Secţia de Ştiinţa şi Tehnologia Informaţiei, Academia Română, Bucureşti
- June 1999 (invited talk) TT-CLAM: Why should one loose information when not necessary?, Research Institute for Linguistics of the Hungarian Academy of Sciences, Budapesta, Hungary
- November 1998 (invited talk) LINGUASTAT-Corpora Processing Platform, Information Society Technology Conference and Exhibition - IST'98, Vienna, Austria
- November 1998 (invited talk) EGLU - Generalized Environment for Unification Based Linguistics, Information Society Technology Conference and Exhibition - IST'98, Vienna, Austria
- November 1998 (invited talk) PAIL - Portable AI Laboratory, Information Society Technology Conference and Exhibition - IST'98, Vienna, Austria
- June 1, 1998 (invited panelist) Language Technology in Romania, Panel Discussion on International Cooperation in Language Technology, organized by European Commission DGXIII, Granada, Spain
- July 1997 (lucrare invitată) Tehnologia limbajului premisă a informatizării globale, Al doilea Seminar al Comisiei Europene ''Limbaj şi Tehnologie'', Tuşnad, Romania
- July 13-26, 1997 (tutorial) Corpus-based morpho-syntactic processing in a multilingual environment, EUROLAN 1997 Summer School on NLP & HLT: Corpus Linguistics, Tusnad, Romania
- September 1995 (invited demo) GULiveR: A Generalized Unification LR Parser, TELRI Conference, Tihany, Hungary
- September 1995 (invited demo) KRILL: A Knowledge Representation Interface to an Interlingual Natural Language Generator, TELRI Conference, Tihany, Hungary
- September 1995 (invited demo) Unification-Based Implementation of a Wide Coverage Romanian Morphology, TELRI Conference, Tihany, Hungary
- September 1995 (invited demo) Parsing Portable Laboratory, TELRI Conference, Tihany, Hungary
- July 19-29, 1993 (tutorial) Abduction as a unification process for understanding and generating language, and for machine translation, EUROLAN 1993 Summer School on NLP & HLT: Natural Language Processing and Logic Programming, Iasi, Romania

Former and current PhD students (Domain: Computer Science)
- Radu Ion (successfully defended Magna cum Laudae in 2007): Automatic Word Sense Disambiguation Techniques. Applications for Romanian and English
- Alexandru Ceauşu (successfully defended Cum Laudae in May 2009): Machine Translation Techniques and Their Applicability to Romanian as a Source Language
- Elena Irimia (successfully defended in May 2009): Example-Based Machine Translation Methods. Applications for English and Romanian
- Dan Ştefănescu (successfully defended Magna cum Laudae in February 2010): Intelligent Information Mining from Multilingual Corpora
- Nadia Luiza Huţuliac (successfully defended in December 2010): Naturalness of Artificial Language - Applications on Verbal Group Automatic Translation
- Corina Forăscu: Natural Language Processing Using Discourse Analysis. Applications for Romanian and English
- Tiberiu Boroş: Contribuţii la modelarea şi implementarea sistemelor de sinteză a vorbirii. Studiu de caz: limba română
- Iuliana Dobre (Aldescu): Contribuţii la elaborarea unui sistem de e-learning utilizând tehnologii de prelucrare a limbajului natural

Membership in PhD awarding committees
- September 2011: thesis Predication Driven Textual Entailment of Alexandru Mihai Moruz, Faculty of Computer Science, University Al. I. Cuza of Iasi, Romania
- June 2011: thesis Methods and Resources for Sentiment Analysis in Multilingual Documents of Different Text Types of Alexandra Balahur, University of Alicante, Spain
- March 2011: thesis Automated Processing of Natural Language of Ionuţ Pistol, Faculty of Computer Science, University Al. I. Cuza of Iasi, Romania
- December 2010 (as PhD advisor): thesis Naturaleţea limbajului artificial – studiu despre traducerea automată a grupului verbal of Nadia Luiza (Huţuliac) Dincă, Romanian Academy, Bucharest
- December 2010: thesis Interfeţe avansate pentru sisteme suport pentru decizii of Ana Maria Suduc, Romanian Academy, Bucharest
- December 2010: thesis Sisteme suport pentru decizii bazate pe comunicaţii of Mihai Bîzoi, Romanian Academy, Bucharest
- March 2010: thesis Natural Language Processing Using Semantic Frames of Diana Trandabăţ, Faculty of Computer Science, University Al. I. Cuza of Iasi, Romania
- February 2010 (as PhD advisor): thesis Intelligent Information Mining from Multilingual Corpora of Dan Ştefănescu, Romanian Academy, Bucharest
- December 2009: thesis Unităţile frazeologice. Abordare contrastivă franco- română. Aplicaţii pe corpus paralel of Maria Husarciuc, Faculty of Letters, University Al. I. Cuza of Iasi, Romania
- June 2009: thesis Improving Statistical Alignment and Translation Using Highly Multilingual Corpora of Camelia Ignat, Université de Strasbourg
- May 2009 (as PhD advisor): thesis Tehnici de traducere automată şi aplicabilitatea lor limbii române ca limbă sursă of Alexandru Ceauşu, Romanian Academy, Bucharest
- May 2009 (as PhD advisor): thesis Metode de traducere automată prin analogie. Aplicaţii pentru limbile română şi engleză of Elena Irimia, Romanian Academy, Bucharest
- March 2009: thesis Textual Entailment of Adrian Iftene, Faculty of Computer Science, University Al. I. Cuza of Iasi, Romania
- November 2008: thesis Formalisation des contraintes pragmatiques pour la génération des énoncés en dialogue homme-machine à plusieurs locuteurs of Vladimir Popescu, Laboratoire d'Informatique de Grenoble, France
- November 2008: thesis Analiza unor sisteme neliniare cu aplicaţii în prelucrarea semnalelor of Vasile Apopei, Institute for Computer Science, Romanian Academy, Iasi branch, Romania
- September 2008: thesis Ontology-Based Modeling and Recommendation Techniques for Adaptive Hypermedia Systems of Mihaela Brut, Faculty of Computer Science, University Al. I. Cuza of Iasi, Romania
- October 2007: ''Très honorable'' thesis Analyse syntaxique automatique du roumain of Ioana Milutinovici, Département de Linguistique, Université Blaise-Pascal, Clermont-Ferrand, France
- May 2007 (as PhD advisor): ''Magna Cum Laudae'' thesis Metode de dezambiguizare semantică automată. Aplicaţii pentru limbile engleză şi română of Radu Ion, Romanian Academy, Bucharest
- February 2007: ''Doctor Habilitat of Informatics'' thesis Interfeţe inteligente pentru sisteme de calcul simbolic of Svetlana Cojocaru, Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova
- August 2006: thesis Studies in Semantic Analysis of Text for Keyphrase Extraction of Srinivas Medimi, Faculty of Computer Science, Mumbai Institute of Technology, India
- October 2005: thesis Tehnici avansate de interogare şi extragere de informaţii din colecţii mari de documente of Luminiţa Chiran, Faculty of Computer Science, University Al. I. Cuza of Iasi, Romania
- April 2004: thesis Apprentissage de grammaires catégorielles pour simuler l'acquisition du langage naturel à l'aide d'informations sémantiques of Daniela Dudău Sofronie, Université des Sciences et Technologies de Lille, France
- October 2003: thesis Contributions in the Development of Informatics Systems based on Distributed Collaboration of Cristina Niculescu, Polytechnic University of Bucharest, Romania
- December 2002: thesis Contribuţii la descrierea gramaticii limbii române literare asistată de calculator of Ana-Maria Barbu, University of Craiova, Romania
- May 2002: thesis Estetica imaginii video-digitale of Marilena Preda Sânc, National University of Arts, Bucharest, Romania
- February 2001: thesis Influenţa limbii engleze asupra limbii române în terminologia informaticii of Radu-Nicolae Trif, Romanian Academy, Bucharest, Romania
- January 2001: thesis Semantic indexing for document retrieval systems of Amalia Todiraşcu, Institute for Computer Science, Romanian Academy, Iasi branch, Romania
- May 2000: thesis Contribuţii privind descrierea limbii române scrise ca sursă de informaţie of Adrian Mitrea, Polytechnic University of Bucharest, Romania
- April 1999: thesis Prelucrarea specificaţiilor formale cu aplicaţie în proiectarea asistată a programelor of Lorina Negreanu, Polytechnic University of Bucharest, Romania
- 1995: thesis O arhitectură şi teorie unitară pentru reprezentarea cunoaşterii şi inferenţe în Inteligenţa Artificială of Liviu Badea, Polytechnic University of Bucharest, Romania
- 1994: thesis Probleme de interogare în limbaj natural a bazelor de date of Dan Cristea, Polytechnic University of Bucharest, Romania
