Summary of PhD Achievements
Research ActivityDuring my PhD program I have been involved in the following projects: • Social sensing for breakingnews – SMART NEWS
A 2-year project (2016-2018) funded by Regione Toscana (BANDO FAR-FAS 2014) in collaboration with the IT company Hyperborea s.r.l. and the Web Application for the Future Internet (WAFI) Lab-oratory at the Institute of Informatics and Telematics (IIT) and the Multimedia Information Retrieval (MIR) group at the Institute of In-formation Science and Technologies (ISTI) of CNR Pisa.
• Voci della Grande Guerra
A 18-month project (2016-2018) funded by Presidenza Consiglio dei Ministri in the framework of the First World War Centenary events. In collaboration with the CoPhiLab of the Institute for Computational Linguistics A. Zampolli (ILC), the CoLing Lab of the Department of Philology, Literature, and Linguistics (University of Pisa), Accademia della Crusca, Interuniversity Center for Historical-Military Research (University of Siena). The project aims at building a corpus of dif-ferent types of documents (letters, war bulletins, journals, diaries) to investigate how Italians have perceived and narrated the First World War and how this war contributed to change the Italian language. The project includes: i.) the digitalization of the corpus; ii.) the develop-ment and application of NLP-based modules for event extraction and georeferencing of the war locations; iii.) design and development of the Web search interface.
• SCRIBE – Short writings, Linguistic simplification, social inclusion: models and applications
A 3-year project (2013-2016) funded by the Italian Ministry of Ed-ucation, University and Research (PRIN 2010FWM3B4 Area 10). Project partners: Universit di Tor Vergata, Universit LOrientale di NAPOLI, Universit ROMA TRE, Universit di MACERATA, Univer-sit di PISA, ILC-CNR. The project aims at studying both from a synchronic and from a diachronic perspective the phenomenon of the synthetic and shorted messages production, from its contemporary expressions (short writings used for e-mails, sms and chats) to the other abbreviations strategies peculiar to Italian and dialectal graphic and linguistic systems. The goal of the ItaliaNLP Lab is to develop advanced computational linguistics methods for the analysis of these varieties of the Italian language.
• iSLe – intelligent Semantic Liquid eBook
A 2-year project funded by Regione Toscana (POR CReO 2007 2013) 1
in collaboration with IT companies (M.E.T.A SRL, 01Servizi SRL, VIDITRUST SRL, SPACE SPA). The aim of the project is to de-velop an innovative software platform for digital educational publish-ing augmented with NLP-based functionalities for knowledge manage-ment and readability assessmanage-ment.
Awards
• Achieved the 1st place in the PoSTWITA PoS tagging for Italian Social Media Text classification task in the context of EVALITA 2016, the evaluation campaign of Natural Language Processing and Speech tools for Italian.
• Achieved the 2nd place in the Sentipolc subjectivity classification task in the context of EVALITA 2016, the evaluation campaign of Natural Language Processing and Speech tools for Italian.
• Achieved the 3th place in the Sentipolc sentiment polarity classifica-tion task in the context of EVALITA 2016, the evaluaclassifica-tion campaign of Natural Language Processing and Speech tools for Italian.
• Achieved the 2nd place in the Sentipolc sentiment polarity classifica-tion task in the context of EVALITA 2014, the evaluaclassifica-tion campaign of Natural Language Processing and Speech tools for Italian.
Publications
International Journals
1. Cimino A., Chiarello F., Dell’Orletta F., Fantoni G. (2016) “Auto-matic Advantages and Drawbacks Extraction From Patents”. Scien-tometrics (Under review), Elsevier.
2. Cimino A., Chiarello F., Dell’Orletta F., Fantoni G. (2016) “Auto-matic Users Extraction From Patents”. World Patent Information (Under review), Elsevier.
International Conferences/Workshops with Peer Review
1. Brunato D., Cimino A., Dell’Orletta F., Venturi G. (2016) “CSAS-IT: A Parallel Corpus of Complex-Simple Sentences for Automatic Text Simplification”. In Proceedings of Conference on Empirical Meth-ods in Natural Language Processing (EMNLP 2016), 1-5 November, Austin, Texas, USA. (forthcoming)
2. Richter S., Cimino A., Dell’Orletta F., Venturi G. (2015) “Tracking the Evolution of Written Language Competence: an NLP-based Ap-proach”. In Proceedings of the Second Italian Conference on
tational Linguistics (CLiC-it), 3-4 December, Trento, Italy, pp. 236-240.
3. Cresci S., Cimino A., Dell’Orletta F., Tesconi M. (2015) “Crisis Map-ping during Natural Disasters via Text Analysis of Social Media Mes-sages”. In Proceedings of 16th International Conference on Web In-formation System Engineering (WISE 2015) , 1-3 November, Miami, Florida, USA.
4. Cresci S., Tesconi M., Cimino A., Dell’Orletta F. (2015) “A Linguistically-driven Approach to Cross-Event Damage Assessment of Natural Dis-asters from Social Media Messages”. In Proceedings of the 24th In-ternational Conference Companion on World Wide Web (WWW15 Companion) ACM, 18 May, Florence, Italy.
5. Cimino A., Cresci S., Dell’Orletta F., Tesconi M. (2014) “Linguistically-motivated and Lexicon Features for Sentiment Analysis of Italian Tweets”. In Proceedings of 4th Evaluation of NLP and Speech Tools for Italian (EVALITA 2014), 11 December, Pisa, Italy.
6. Dell’Orletta F., Wieling M., Cimino A., Venturi G., Montemagni S. (2014) “Assessing the Readability of Sentences: Which Corpora and Features?”. In Proceedings of 9th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2014), 26 June, Balti-more, Maryland, USA.
7. Dell’Orletta F., Venturi G., Cimino A., Montemagni S. (2014) “T2K2: a System for Automatically Extracting and Organizing Knowledge from Texts”. In Proceedings of 9th Edition of International Confer-ence on Language Resources and Evaluation (LREC 2014), 26-31 May, Reykjavik, Iceland.
8. Boschetti F., Cimino A., Dell’Orletta F., Lebani G.E., Passaro L., Pic-chi P., Venturi G., Montemagni S. Lenci A. (2014) “Computational Analysis of Historical Documents: An Application to Italian War Bul-letins in World War I and II”. In Proceedings of workshop on Language resources and technologies for processing and linking historical docu-ments and archives- Deploying Linked Open Data in Cultural Heritage LREC 2014, 26 May, Reykjavik, Iceland.
9. Cimino A., Dell’Orletta F., Venturi G., Montemagni S. (2013), “Lin-guistic Profiling based on Generalpurpose Features and Native Lan-guage Identification”. In Proceedings of Eighth Workshop on Inno-vative Use of NLP for Building Educational Applications, Atlanta, Georgia, June 13, pp. 207-215.