Accepted Papers

10th edition of the Language Resources and Evaluation Conference, 23-28 May 2016, Portorož (Slovenia)

Held under the Honorary Patronage of His Excellency Mr. Borut Pahor, President of the Republic of Slovenia

Conference Venue & Travel

Submission

Registration

Accommodation & Tours

Accepted Papers

ID	Title	Authors
3	Comparison of emotional understanding in modality-controlled environments using multimodal online emotional communication corpus	Yoshiko Arimoto and Kazuo Okanoya
4	SUMMA: Deep Learning for NextGen Media Monitoring	Guntis Barzdins and Didzis Gosko
6	Exploring the Realization of Irony in Twitter Data	Cynthia Van Hee, Els Lefever and Veronique Hoste
9	IRIS: English-Irish Machine Translation System	Mihael Arcan, Caoilfhionn Lane, Eoin Ó Droighneáin and Paul Buitelaar
12	Towards Automatic Transcription of ILSE – an Interdisciplinary Longitudinal Study of Adult Development and Aging	Jochen Weiner, Claudia Frankenberg, Dominic Telaar, Britta Wendelstein, Johannes Schröder and Tanja Schultz
13	Speech Synthesis of Code-Mixed Text	Sunayana Sitaram and Alan W Black
15	Operational Assessment of Keyword Search on Oral History	Elizabeth Salesky, Jessica Ray and Wade Shen
16	Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech	Daniil Kocharov
17	Measuring lexical quality of a historical Finnish newspaper collection – analysis of garbled OCR data with basic language technology tools and means	Kimmo Kettunen and Tuula Pääkkönen
18	Wikipedia Titles As Noun Tag Predictors	Armin Hoenen
19	A corpus of images and text in online news	Laura Hollink, Adriatik Bedjeti, Martin van Harmelen and Desmond Elliott
28	Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction	Dain Kaplan, Neil Rubens and Simone Teufel
29	An Evaluation of Dialog Clarification in Speech-to-Speech Translation Systems	Audrey Tong, Kay Peterson, Lukas Diduch and Mark Przybocki
30	Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness	Dane Bell, Daniel Fried, Luwen Huangfu, Mihai Surdeanu and Stephen Kobourov
31	WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet Format	Marcus Klang and Pierre Nugues
32	Odin's Runes: A Rule Language for Information Extraction	Marco A. Valenzuela-Escárcega, Gus Hahn-Powell and Mihai Surdeanu
33	Linguistically Inspired Language Model Augmentation for MT	George Tambouratzis and Vasiliki Pouli
35	FABIOLE, a speech database for forensic speaker comparison	Ajili Moez, Jean-françois Bonastre, Juliette Kahn, Solange Rossato and Guillaume Bernard
37	Encoding adjective scales for fine-grained resources	Cédric Lopez, frederique segond and Christiane Fellbaum
38	Automatic Enrichment of WordNet with Common-Sense Knowledge	Luigi Di Caro and Guido Boella
39	Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida	Isabell Hubert, Antti Arppe and Jordan Lachler
42	A Classification-based Approach to Economic Event Detection in Dutch News Text	Els Lefever and Véronique Hoste
43	EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis	David Vilares, Miguel A. Alonso and Carlos Gómez-Rodríguez
44	Argument Mining: the bottleneck of knowledge and language resources	patrick saint-dizier
45	An Auto-Adaptative System to Acquire Lexical Knowledge in Technical Texts	patrick saint-dizier
46	Palabras: Crowdsourcing Transcriptions of L2 Speech	Eric Sanders, Pepi Burgos, Catia Cucchiarini and Roeland van Hout
47	Can Tweets Predict TV Ratings?	Bridget Sommerdijk, Eric Sanders and Antal van den Bosch
52	Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform	Sharmin Muzaffar, Pitambar Behera and Girish Jha
54	Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification	Maximilian Köper, Melanie Zaiß, Qi Han, Steffen Koch and Sabine Schulte im Walde
55	Automatically Generated Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German Lemmas	Maximilian Köper and Sabine Schulte im Walde
58	Evaluating Machine Translation in a Usage Scenario	Rosa Gaudio, Aljoscha Burchardt and António Branco
59	SCARE – The Sentiment Corpus of App Reviews with Fine-grained Annotations in German	Mario Sänger, Ulf Leser, Steffen Kemmerer, Peter Adolphs and Roman Klinger
61	A Dataset for Aspect-Based Sentiment Analysis in French	Marianna Apidianaki, Xavier Tannier and Cécile Richart
62	Finding Alternative Translations in a Large Corpus of Movie Subtitle	Jörg Tiedemann
63	Rude waiter but mouthwatering pastries! An exploratory study into Dutch Aspect-Based Sentiment Analysis	Orphee De Clercq and Veronique Hoste
65	Towards Lexical Encoding of Multiword Expressions in Spanish Dialects	Diana Bogantes, Eric Rodríguez, Alejandro Arauco, Alejandro Rodríguez and Agata Savary
66	From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse	Ekaterina Lapshinova-Koltunski, Kerstin Anna Kunz and Anna Nedoluzhko
67	Laughter in French spontaneous conversational dialogs	Brigitte BIGI and Roxane Bertrand
68	CirdoX: an on/off-line multisource speech and sound analysis software	Frédéric Aman, Michel Vacher, François Portet, William Duclot and Benjamin Lecouteux
69	Discriminating Similar Languages: an Evaluation	Cyril Goutte, Serge Léger, Shervin Malmasi and Marcos Zampieri
70	BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains	Necati Cihan Camgöz, Ahmet Alp Kındıroğlu, Serpil Karabüklü, Meltem Kelepir, Lale Akarun and Ayşe Sumru Özsoy
71	The CIRDO corpus: comprehensive audio/video database of domestic falls of elderly people	Michel Vacher, Saïda Bouakaz, Marc-Eric BOBILLIER CHAUMON, Frédéric Aman, R. A. Khan, Slima Bekkadja, François Portet, Erwan Guillou, Solange Rossato and Benjamin Lecouteux
72	Legacy language atlas data mining: mapping Kru languages	Dafydd Gibbon
73	VPS-GradeUp: Graded Decisions on Usage Patterns	Vít Baisa, Silvie Cinkova, Ema Krejčová and Anna Vernerová
75	AppDialogue: Multi-App Dialogues for Intelligent Assistants	Ming Sun, Yun-Nung Chen, Zhenhao Hua, Yulian Tamres-Rudnicky, Arnab Dash and Alexander Rudnicky
76	Urdu Summary Corpus	Muhammad Humayoun, Rao Muhammad Adeel Nawab, Muhammad Uzair, Saba Aslam and Omer Farzand
77	OSMAN – A Novel Arabic Readability Metric	Mahmoud El-Haj and Paul Rayson
79	Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages	Samira Shaikh, Kit Cho, Tomek Strzalkowski, Laurie Feldman, John Lien, Ting Liu and George Aaron Broadwell
80	Error typology and remediation strategies for requirements written in English by non-native speakers	Marie Garnier and patrick saint-dizier
81	Detection of Reformulations in Spoken French	Natalia Grabar and Iris Eshkol-Taravela
82	Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization	Muhammad Humayoun and Hwanjo Yu
83	A rule-based shallow-transfer machine translation system for Scots and English	Gavin Patrick Abercrombie
84	Evaluating Interactive System Adaptation	Edouard Geoffrois
85	Reuse and plagiarism in LREC papers	Gil Francopoulo, Joseph Mariani and Patrick Paroubek
86	The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions	Joachim Daiber and Rob van der Goot
89	Predictive modelling: guessing the NLP terms of tomorrow	Gil Francopoulo, Joseph Mariani and Patrick Paroubek
94	Building A Case-based Semantic English-Chinese Parallel Treebank	huaxing shi, Tiejun Zhao and Keh-Yih Su
95	Challenges and Solutions for Consistent Annotation of Vietnamese Treebank	Quy Nguyen, Yusuke Miyao, Ha Le and Ngan Nguyen
96	Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin	Marco Passarotti, Berta González Saavedra and Christophe Onambele
97	Potsdam Twitter Sentiment Corpus.	Uladzimir Sidarenka
98	FlexTag: A Highly Flexible PoS Tagging Framework	Torsten Zesch and Tobias Horsmann
101	Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces	Matthias Sperber, Graham Neubig, Satoshi Nakamura and Alex Waibel
102	The Gavagai Living Lexicon	Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Anders Holst, Jussi Karlgren, Fredrik Olsson, Per Persson and Akshay Viswanathan
103	Punctuation Prediction for Unsegmented Transcript Based on Word Vector	Xiaoyin Che, Cheng Wang, Haojin Yang and Christoph Meinel
104	Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans	Henk van den Heuvel and Nelleke Oostdijk
105	Complementarity, F-score, and NLP Evaluation	Leon Derczynski
108	Sense-annotating a lexical substitution data set with Ubyline	Tristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho and Iryna Gurevych
110	Creating Open Corpora for Named Entity Recognition of Historical Newspapers	Clemens Neudecker
111	DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining	Mauro Dragoni, Andrea Tettamanzi and Célia da Costa Pereira
113	A framework for collecting realistic recordings of dysarthric speech - the homeService corpus	Mauro Nicolao, Heidi Christensen, Stuart Cunningham, Phil Green and Thomas Hain
114	Discriminative Analysis of Linguistic Features for Typological Study	Hiroya Takamura, Ryo Nagata and Yoshifumi Kawasaki
115	SemAligner: A Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores	Nabin Maharjan, Rajendra Banjade, Nobal Bikram Niraula and Vasile Rus
117	Privacy Issues in Free Online Machine Translation Services - European Perspective	Pawel Kamocki, Jim O'Regan and Marc Stauch
119	DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in Context	Rajendra Banjade and Vasile Rus
121	The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis	Beata Megyesi, Jesper Näsman and Anne Palmér
122	Universal Dependencies for Japanese	Takaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori and Yuji Matsumoto
123	SEMRELDATA MULTILINGUAL CONTEXTUAL ANNOTATION OF SEMANTIC RELATIONS BETWEEN NOMINALS: DATASET AND GUIDELINES	Darina Benikova and Chris Biemann
126	An empirical study of Arabic formulaic sequence extraction methods	ayman alghamdi, Eric Atwell and Claire Brierley
127	Concepticon: A Resource for the Linking of Concept Lists	Johann-Mattis List, Michael Cysouw and Robert Forkel
129	Homing in on Twitter users: Evaluating an Enhanced Geoparser for User Profile Locations	Beatrice Alex, Clare Llewellyn, Claire Grover, Jon Oberlander and Richard Tobin
131	Japanese Word–Color Associations with and without Contexts	Jun Harashima
132	Data formats and management strategies from the perspective of language resource producers -- personal diachronic and social synchronic data sharing --	Kazushi Ohya
135	Phonetic Inventory for an Arabic Speech Corpus	Nawar Halabi and Mike Wald
136	A Language Resource of German Errors Written by Children with Dyslexia	Maria Rauschenberger and Luz Rello
137	MarsaGram: an excursion in the forests of parsing trees	Philippe Blache, Stéphane Rauzy and Grégoire Montcheuil
138	Discovering Fuzzy Synsets from the Redundancy in Different Lexical-Semantic Resources	Hugo Gonçalo Oliveira and Fábio Santos
140	Agentivity and Abstractness Influence Verbal Telicity. A computational experiment.	Ingrid Falk and Fabienne Martin
141	The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language	Dan Tufiș, Verginica Barbu Mititelu, Elena Irimia, Ștefan Daniel Dumitrescu and Tiberiu Boroș
142	Compilation of an Arabic Children’s Corpus	Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley and Eric Atwell
144	CoRuSS - a new prosodically annotated corpus of Russian spontaneous speech	Tatiana Kachkovskaia, Daniil Kocharov, Pavel Skrelin and Nina Volskaya
147	Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene	Izaskun Etxeberria, Iñaki Alegria, Larraitz Uria and Mans Hulden
148	Corpus for Children’s Writing with Enhanced Output for Specific Spelling Patterns (2nd and 3rd Grade)	Kay Berkling
150	Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing	Hao Zhou, Yue Zhang, Shujian Huang, XIN-YU DAI and Jiajun Chen
151	Defining and counting phonological classes in cross-linguistic segment databases	Dan Dediu and Scott Moisik
152	Annotating Logical Forms for EHR Questions	Kirk Roberts and Dina Demner-Fushman
153	Modelling multi-issue bargaining dialogues: data collection, annotation design and corpus	Volha Petukhova, Christopher Stevens, Harmen de Weerd, Niels Taatgen, Fokie Cnossen and Andrei Malchanau
154	Evaluating a Topic Modelling Approach to Measuring Corpus Similarity	Richard Fothergill, Paul Cook and Timothy Baldwin
155	Benchmarking Lexical Simplification Systems	Gustavo Paetzold and Lucia Specia
156	Syntax-based multi-system machine translation	Matīss Rikters and Inguna Skadina
159	AIMU: Actionable Items for Meeting Understanding	Yun-Nung Chen and Dilek Hakkani-Tur
163	Combining Heuristics with Confidence-Based Sequence Tagging for German Statement Extraction	Thomas Bögel and Michael Gertz
164	Farasa: Fast and Accurate Arabic Word Segmenter	Kareem Darwish and Hamdy Mubarak
170	Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages	Stefan Ecker, Andrea Horbach and Stefan Thater
171	Arabic to English Person Name Transliteration using Twitter	Hamdy Mubarak and Ahmed Abdelali
172	Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario	Lena Keiper, Andrea Horbach and Stefan Thater
174	A Multi-Layered Annotated Corpus of Scientific Papers	Beatriz Fisas, Francesco Ronzano and Horacio Saggion
175	Korean TimeML and Korean TimeBank	Young-Seob Jeong, Won-Tae Joo, Hyun-Woo Do, Chae-Gyun Lim, Key-Sun Choi and Ho-Jin Choi
176	TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns	Kathrin Eichler, Feiyu Xu, Hans Uszkoreit, Leonhard Hennig and Sebastian Krause
179	Use of Domain-Specific Language Resources in Machine Translation	Sanja Štajner, Andreia Querido, Nuno Rendeiro, João Rodrigues and António Branco
181	LVF-lemon – Towards a Linked Data Representation of "Les Verbes français"	Ingrid Falk and Achim Stein
184	Domain ontology learning enhanced by optimized relation instance in DBpedia	Liumingjing Xiao, Chong Ruan, An Yang and Junfeng Hu
186	SYN2015: design and compilation of a new representative corpus of contemporary written Czech	Michal Křen, Václav Cvrček, Tomáš Čapka, Anna Čermáková, Milena Hnátková, Lucie Chlumská, Tomáš Jelínek, Vladimír Petkevič, Pavel Procházka, Hana Skoumalová, Michal Škrabal, Petr Truneček and Pavel Vondřička
188	Challenges of Evaluating Sentiment Analysis Tools on Social Media	Diana Maynard and Kalina Bontcheva
189	EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis	Jasy Suet Yan Liew, Howard Turtle and Elizabeth Liddy
192	WikiCoref: An English Coreference-annotated Corpus of Wikipedia Articles	Abbas Ghaddar and Phillippe Langlais
195	Exploitation of Co-reference in Distributional Semantics	Dominik Schlechtweg
196	POS-tagging of Historical Dutch	Dieuwke Hupkes and Rens Bod
197	A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic Resources	Yoshihiko Hayashi
198	Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization	Hiroki Mori, Atsushi Nagaoka and Yoshiko Arimoto
201	An Annotated Corpus for Information Extraction from Ad-Hoc Embedded Structures	Dayne Freitag, John Niekrasz, Richard Rohwer and Eric Yeh
204	A Large DataBase of Hypernymy Relations Extracted from the Web.	Julian Seitner, Christian Bizer, Stefano Faralli, Robert Meusel, Heiko Paulheim and Simone Paolo Ponzetto
206	The VU Sound Corpus: Adding more fine-grained annotations to the Freesound database	Emiel van Miltenburg, Benjamin Timmermans and Lora Aroyo
207	Automatic anomaly detection for dysarthria across two speech styles: read vs spontaneous speech	Imed Laaridh, Corinne Fredouille and Christine Meunier
208	A Taxonomy for Specific Problem Classes in Text-to-Speech Synthesis Comparing Commercial and Open Source Perfomance	Felix Burkhardt and Uwe D. Reichel
210	User, who art thou? User Profiling for Oral Corpus Platforms	Christian Fandrych, Hanna Hedeland, Daniel Jettka, Thomas Schmidt, Cordula Meißner, Franziska Wallner, Kathrin Weigert and Anna Iliash
211	JATE 2.0: Java Automatic Term Extraction with Apache Solr	Ziqi Zhang, Jie Gao and Fabio Ciravegna
213	A Preliminary Bilingual Discourse Corpus and Its Applications	yang liu, Jiajun Zhang and Chengqing Zong
214	Quality Assessment of the Reuters Vol. 2 Multilingual Corpus	Robin Eriksson
217	Crowdsourcing ontology lexicons	Bettina Lanser, Christina Unger and Philipp Cimiano
219	Language Resource Addition Strategies for Raw Text Parsing	Atsushi Ushiku, Tetsuro Sasada and Shinsuke Mori
221	Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems	Nikos Katris, Richard Sutcliffe and Theodore Kalamboukis
222	Information structure in the Potsdam Commentary Corpus: Topics	Manfred Stede and Sara Mamprin
223	Curation of Dutch Regional Dictionaries	Nicoline van der Sijs, Eric Sanders, Henk van den Heuvel and Aukje Borkent
224	A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon	Tibor Kiss, Francis Jeffry Pelletier, Halima Husic, Roman Nino Simunic and Johanna Marie Poppek
225	A lexicon of perception for the identification of synaesthetic metaphors in corpora	Francesca Strik Lievers and Chu-Ren Huang
226	CATaLog Online: Porting a Post-editing Tool to the Web	Santanu Pal, Marcos Zampieri, Mihaela Vela, Tapas Nayak and Sudip Kumar Naskar
227	Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts	Anne-Kathrin Schumann, Stefan Fischer and Jörg Knappen
228	The SpeDial datasets: datasets for Spoken Dialogue Systems analytics	José Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka, Elias Iosif and Alexandros Potamianos
229	A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds	Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner and Manfred Pinkal
231	The ILMT-s2s Corpus	Akira Hayakawa, Saturnino Luz, Loredana Cerrato and Nick Campbell
232	A Dataset for Detecting Stance in Tweets	Saif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiaodan Zhu and Colin Cherry
234	Sentiment Lexicons for Arabic Social Media	Saif Mohammad, Mohammad Salameh and Svetlana Kiritchenko
237	Building Chinese Affective Resources in Valence-Arousal Dimensions	Liang-Chih Yu, Lung-Hao Lee, Shuai Hao, Jun Hu and K. Robert Lai
238	Semi-automatically Alignment of Predicates between Speech and OntoNotes data	Niraj Shrestha and Marie-Francine Moens
240	The Negochat Corpus of Human-agent Negotiation Dialogues	Vasily Konovalov, Ron Artstein, Oren Melamud and Ido Dagan
242	A Sentiment Lexicon of Phrases with Mixed Polarities	Svetlana Kiritchenko and Saif Mohammad
243	KorAP Architecture – Diving in the Deep Sea of Corpus Data	Nils Diewald, Michael Hanl, Eliza Margaretha, Joachim Bingel, Marc Kupietz, Piotr Banski and Andreas Witt
246	Name translation based on fine-grained named entity recognition in a single language	Kugatsu Sadamitsu, Itsumi Saito, Taichi Katayama, Hisako Asano and Yoshihiro Matsuo
249	Wikification for an Unsegmented Language	Yugo Murawaki and Shinsuke Mori
250	A Terminological Approach to EC Framework Programmes Titles (1994-2012)	Gabriella Pardelli, Sara Goggi, Silvia Giannini and Stefania Biagioni
251	The IFCASL corpus of French and German non-native and native read speech	Juergen Trouvain
253	Legal Text Interpretation: Identifying Hohfeldian Relations from Text	Wim Peters and Adam Wyner
255	A novel method for evaluation of morphological segmentation	Javad Nouri and Roman Yangarber
256	Learning Tone and Attribution for Financial Text Mining	Mahmoud El-Haj, Paul Rayson, Steve Young, Andrew Moore, Martin Walker, Thomas Schleicher and Vasiliki Athanasakou
257	Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages	Scott Piao, Paul Rayson, Dawn Archer, Francesca Bianchi, Carmen Dayrell, Mahmoud El-Haj, Ricardo-María Jiménez, Dawn Knight, Michal Křen, Laura Löfberg, Rao Muhammad Adeel Nawab, Jawad Shafi, Phoey Lee Teh and Olga Mudraya
258	Mirroring Facial Expressions and Emotions in Dyadic Conversations	Costanza Navarretta
259	SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies	Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg and Monica Sandell
260	Uzbek-English and Turkish-English Morpheme Alignment Corpora	Xuansong Li, Jennifer Tracey, Stephen Grimes and Stephanie Strassel
261	Text segmentation of digitized clinical texts	Cyril Grouin
264	E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses	Yuval Marton and Kristina Toutanova
265	A Morphological Lexicon of Esperanto with Morpheme Frequencies	Eckhard Bick
266	How does Dictionary Size Influence Performance of Vietnamese Word Segmentation?	Wuying Liu and Lin Wang
268	The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources	Georg Rehm
270	Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus	Xuansong Li, Martha Palmer, Nianwen Xue, Lance Ramshaw, Mohamed Maamouri, Ann Bies, Kathryn Conger, Stephen Grimes and Stephanie Strassel
271	mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing	Silvio Cordeiro, Carlos Ramisch and Aline Villavicencio
274	Adding Semantic Relations to a Large-Coverage Connective Lexicon of German	Tatjana Scheffler and Manfred Stede
275	SVALex: a CEFR-graded lexical resource for Swedish foreign and second language learners	Thomas Francois, Elena Volodina, Ildikó Pilán and Anaïs Tack
276	Creating annotated dialogue resources: cross-domain dialogue act classification	Dilafruz Amanova, Volha Petukhova and Dietrich Klakow
278	Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties	Alexandra Balahur and Hristo Tanev
279	Giving lexical resources a second life: Démonette, a multi-sourced morpho-semantic network for French	Nabil Hathout and Fiammetta Namer
280	Investigation on OCR Error Correction for Historical Texts	Haithem Afli, Zhengwei Qiu, Andy Way and Páraic Sheridan
281	Lexical resources to enrich English Malayalam Machine Translation	Sreelekha S and Pushpak Bhattacharyya
282	Crossmodal Network-Based Distributional Semantic Models	Elias Iosif and Alexandros Potamianos
283	Building a corpus of errors and quality in Machine Translation: experiments on error impact	Angela Costa, Rui Correia and Luisa Coheur
284	Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset	Vuk Batanović, Boško Nikolić and Milan Milosavljević
285	Creating a General Russian Sentiment Lexicon	Natalia Loukachevitch and Anatolii Levchik
286	TTS for Low Resource Languages: A Bangla Synthesizer	Richard Sproat, Alexander Gutkin, Linne Ha, Martin Jansche and Knot Pipatsrisawat
288	A Semantically Compositional Annotation Scheme for Time Normalization	Steven Bethard and Jonathan Parker
289	Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language	Yow-Ting Shiue and Hsin-Hsi Chen
291	PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors	Gözde Özbal, Carlo Strapparava and Serra Sinem Tekiroglu
292	Corpus annotation within the French FrameNet: methodology and results	Marianne Djemaa, Marie Candito, Philippe Muller and Laure Vieu
294	Extraction of English Spelling Errors using a Word Typing Game	Ryuichi Tachibana and Mamoru Komachi
295	Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.	Jon Chamberlain, Massimo Poesio and Udo Kruschwitz
296	TermoPL - a flexible tool for terminology extraction	Malgorzata Marciniak, Agnieszka Mykowiecka and Piotr Rychlik
298	Correcting Errors in a Treebank Based on Tree Mining	Kanta Suzuki, Yoshihide Kato and Shigeki Matsubara
300	GhoSt-NN: A representative Gold Standard of German Noun-Noun Compounds	Sabine Schulte im Walde, Anna Hätty, Stefan Bott and Nana Khvtisavrishvili
302	A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances	Roman Sergienko, Muhammad Shan and Wolfgang Minker
304	A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection	Jérémy Ferrero, Frédéric Agnès, Laurent Besacier and Didier Schwab
305	Corpus resources for dispute mediation discourse	Mathilde Janier and Chris Reed
306	The SemDaX corpus – sense annotations with scalable sense inventories	Bolette Pedersen, Anna Braasch, Anders Johannsen, Héctor Martínez Alonso, Sanni Nimb, Sussi Olsen, Anders Søgaard and Nicolai Hartvig Sørensen
307	A Corpus of Argument Networks: Using graph properties to analyse divisive issues	Barbara Konat, John Lawrence, Joonsuk Park, Katarzyna Budzynska and Chris Reed
308	LibN3L:A Lightweight Package for Neural NLP	Meishan Zhang, Jie Yang, Zhiyang Teng and Yue Zhang
310	Novel annotation schemes for sentential and sub-sentential alignments of bi-texts	Yong Xu and François Yvon
311	Extractive Summarization under Strict Length Constraints	Yashar Mehdad, Amanda Stent, Kapil Thadani, Dragomir Radev, Youssef Billawala and Karolina Buchner
312	Temporal annotation: it is the right time to improve ISO TimeML!	Anaïs Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol and Delphine Battistelli
313	A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability	Elif Ahsen Tolgay, Deniz Zeyrek, Murathan Kurfalı and Cem Bozşahin
314	Monitoring disease outbreak events on the web using text-mining approach and domain expert knowledge	Elena Arsevska, Mathieu Roche, Sylvain Falala, Renaud Lancelot, David Chavernac, Pascal Hendrikx and Barbara Dufour
315	4Couv: A New Treebank for French	Philippe Blache, Gregoire de Montcheuil, Laurent Prévot and Stéphane Rauzy
316	Domain-Specific Corpus Expansion with Focused Webcrawling	Steffen Remus and Chris Biemann
317	Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest	Dragomir Radev, Amanda Stent, Joel Tetreault, Aasish Pappu, Aikaterini Iliakopoulou, Agustin Chanfreau, Paloma de Juan, Jordi Vallmitjana, Alejandro Jaimes, Rahul Jha and Robert Mankoff
320	A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research	Jun Harashima, Michiaki Ariga, Kenta Murata and Masayuki Ioki
324	Building concept graphs from monolingual dictionary entries	Gábor Recski and Gábor Borbély
325	Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context	Félicien VALLET, Jim URO, Jérémy ANDRIAMAKAOLY, Hakim NABI, Mathieu DERVAL and Jean CARRIVE
327	PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation	Liane Guillou and Christian Hardmeier
329	CORILSE: a Spanish Sign Language Repository for Linguistic Analysis	María del Carmen Cabeza-Pereiro, José Mª García-Miguel, Carmen García Mateo and José Luis Alba Castro
330	Detecting optional arguments of verbs	Andras Kornai, Dávid Márk Nemeskey and Gábor Recski
332	EstNLTK - NLP Toolkit for Estonian	Siim Orasmaa, Timo Petmanson, Alexander Tkachenko, Sven Laur and Heiki-Jaan Kaalep
333	A Comparative Analysis of Crowdsourced Natural Language Corpora for In-Car Spoken Dialog Systems	Patricia Braunger, Hansjörg Hofmann, Steffen Werner and Maria Schmidt
334	Leveraging Native Data to Correct Preposition Errors in Learners' Dutch	Lennart Kloppenburg and Malvina Nissim
335	A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings	Aitor García Pablos, Montse Cuadros and German Rigau
337	South African National Centre for Digital Language Resources	Justus Roux
339	Discourse structure and dialogue acts in multiparty dialogue: the STAC corpus	Nicholas Asher, Julie Hunter, Mathieu Morey, Eric Kow, Stergos Afantenos, Philippe Muller, Benamara Farah and Jérémy Perret
340	New inflectional lexicons for improved processing of Croatian and Serbian	Nikola Ljubešić, Filip Klubička, Željko Agić and Ivo-Pavao Jazbec
341	An Arabic-Moroccan Darija Code-Switched Corpus	Younes Samih and Wolfgang Maier
342	Automatically Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus	SoHyun Park, Paul Cook, Afsaneh Fazly, Annie Lee, Brandon Seibel and Wenjie Zi
343	The OFAI Multi-Modal Task Description Corpus	Stephanie Schreitter and Brigitte Krenn
345	A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers	Carlos Valmaseda, Juan Martinez-Romo and Lourdes Araujo
346	A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults	Victoria Yaneva, Irina Temnikova and Ruslan Mitkov
347	DeQue: A Lexicon of Complex Prepositions and Conjunctions in French	Carlos Ramisch, Alexis Nasr, André Valli and José Deulofeu
348	Universal Dependencies v1: A Multilingual Treebank Collection	Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty and Daniel Zeman
349	SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking	Marie-Jean Meurs, Hayda Almeida, Ludovic Jean-Louis and Eric Charton
351	A Japanese Chess Commentary Corpus	Shinsuke Mori, John Richardson, Tetsuro Sasada, Hirotaka Kameko and Yoshimasa Tsuruoka
352	InScript: Narrative texts annotated with script information	Ashutosh Modi, Tatjana Anikina and Manfred Pinkal
353	Finding Definitions in Large Corpora with Sketch Engine	Vojtěch Kovář, Monika Močiariková, Milos Jakubicek and Vít Baisa
354	Towards a multi-dimensional taxonomy of stories in dialogue	Kathryn J. Collins and David Traum
356	A Corpus of Personal Narratives and Their Story Intention Graphs	Stephanie Lukin
357	Fine-Grained Chinese Discourse Relation Labelling	Huan-Yuan Chen, Wan-Shan Liao, Hen-Hsen Huang and Hsin-Hsi Chen
359	Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation	Chenhui Chu and Sadao Kurohashi
361	Corpus-based diacritic restoration for South Slavic languages	Nikola Ljubešić, Tomaž Erjavec and Darja Fišer
362	AfriBooms: An online treebank for Afrikaans	Liesbeth Augustinus, Peter Dirix, Daniel Van Niekerk, Ineke Schuurman, Vincent Vandeghinste, Frank Van Eynde and Gerhard Van Huyssteen
363	Parallel Sentence Extraction from Comparable Corpora with Neural Network Features	Chenhui Chu, Raj Dabre and Sadao Kurohashi
364	UPPC - Urdu Paraphrase Plagiarism Corpus	Muhammad Sharjeel, Paul Rayson and Rao Muhammad Adeel Nawab
366	A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization	Fajri Koto and Omar Abdillah
367	A singing voice database in Basque for statistical singing synthesis of bertsolaritza	Xabier Sarasola, Eva Navas, David Tavarez, Daniel Erro, Ibon Saratxaga and Inma Hernaez
368	Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin	Edoardo Maria Ponti and Marco Passarotti
370	A Semi-Supervised Approach to Gender Identification	Juan Soler and Leo Wanner
371	How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News	Imran Sheikh, Irina Illina and Dominique Fohr
373	A New Text Simplification Gold Standard for Readers with Learning Disabilities	Victoria Yaneva, Irina Temnikova and Ruslan Mitkov
375	AMISCO: The Austrian German Multi-Sensor Corpus	Hannes Pessentheiner, Thomas Pichler and Martin Hagmüller
376	Emotion Analysis on Twitter: The Hidden Challenge	dini luca and André Bittar
377	A Document Repository for Social Media and Speech Conversations	Adam Funk, R. Gaizauskas and Benoit Favre
378	A DATABASE OF LARYNGEAL HIGH-SPEED VIDEOS WITH SIMULTANEOUS HIGH-QUALITY AUDIO RECORDINGS OF PATHOLOGICAL AND NON-PATHOLOGICAL VOICES	Philipp Aichinger, Immer Roesner, Matthias Leonhard, Doris-Maria Denk-Linnert, Wolfgang Bigenzahn and Berit Schneider-Stickler
380	Enhancing Access to Online Education: Quality Machine Translation of MOOC Content	Valia Kordoni, Antal van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx and Matthias Huck
382	Semantic layer of the valence dictionary of Polish Walenty	Elżbieta Hajnicz, Anna Andrzejczuk and Tomasz Bartosiak
384	Identifying content types of messages related to Open Source Software projects	Ioannis Korkontzelos, Paul Thompson and Sophia Ananiadou
386	WTF-LOD – A New Resource for Large-Scale NER Evaluation	Pavel Smrz and Lubomir Otrusina
387	Ensemble Classification of Grants using LDA-based features	Ioannis Korkontzelos, Beverley Thomas, Makoto Miwa and Sophia Ananiadou
388	C4Corpus: Multilingual Web-size corpus with free license	Ivan Habernal, Omnia Zayed and Iryna Gurevych
390	Improving Information Extraction from Wikipedia Texts using Basic English	Teresa Rodriguez-Ferreira, Adrian Rabadan, Raquel Hervas and ALBERTO DIAZ
392	Word embedding evaluation and combination	Sahar Ghannay, Benoit Favre, Yannick Estève and Nathalie Camelin
393	Riddle Generation using Word Associations	Paloma Galvan, Virginia Francisco, Raquel Hervas and Gonzalo Mendez
394	Exploiting a Large Strongly Comparable Corpus	Thierry Etchegoyhen, Andoni Azpeitia and Naiara Pérez
396	Purely Corpus-based Automatic Conversation Authoring	Guillaume Dubuisson Duplessis, Vincent Letard, Anne-Laure Ligozat and Sophie Rosset
397	FOLK-Gold – A Gold Standard for Part-of-Speech-Tagging of Spoken German	Swantje Westpfahl and Thomas Schmidt
398	Ambiguity Diagnosis for Terms in Digital Humanities	Béatrice Daille, Evelyne Jacquey, Gaël Lejeune, Luis Felipe Melo and Yannick Toussaint
400	Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions	Daniela Beltrami, Laura Calzà, Gloria Gagliardi, Enrico Ghidoni, Marcella Norino, Rema Rossini Favretti and Fabio Tamburini
401	CINTIL DependencyBank PREMIUM - A corpus of grammatical dependencies for Portuguese	Rita de Carvalho, Andreia Querido, Marisa Campos, Rita Valadas Pereira, João Silva and António Branco
403	A general framework for the annotation of causality based on FrameNet	Laure Vieu, Philippe Muller, Marie Candito and Marianne Djemaa
405	PE2rr corpus: manual error annotation of automatically pre-annotated MT post-edits	Maja Popović and Mihael Arcan
406	Cognitively Motivated Distributional Representations of Meaning	Elias Iosif, Spiros Georgiladakis and Alexandros Potamianos
407	AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis	Kevin El Haddad, Huseyin Cakmak, Stéphane Dupont and Thierry Dutoit
411	Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies	Kadri Muischnek, Kaili Müürisep and Tiina Puolakainen
412	D(H)ante: A new set of tools for XIII century Italian	Angelo Basile and Federico Sangati
413	Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis	Marta Martinez, Rocio Varela, Carmen Garcia-Mateo, Elisa Fernandez Rei and Adela Martinez Calvo
416	LexFr: adapting the LexIt framework to build a corpus-based French subcategorization lexicon	Giulia Rambelli, Gianluca Lebani, Laurent Prévot and Alessandro Lenci
417	QUEMDISSE? Reported speech in Portuguese	Cláudia Freitas, Bianca Freitas and Diana Santos
418	Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles	Alakananda Vempala and Eduardo Blanco
419	Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data	Thomas Hanke
420	A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives	Zhichao Hu, Michelle Dick, Chung-Ning Chang, Michael Neff, Jean Fox Tree and Marilyn Walker
422	Construction of an English Dependency Corpus incorporating Compound Function Words	Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto
423	South African language resources: phrase chunkers	Roald Eiselen
425	Impact of automatic segmentation on the quality, productivity and self-reported post-editing effort of intralingual subtitles	Aitor Alvarez, Marina Balenciaga, Arantza del Pozo, Haritz Arzelus, Anna Matamala and Carlos-D. Martínez-Hinarejos
426	Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual Settings	Yoshihiko Hayashi and Luo Wentao
427	Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons	Antoine Bourlon, Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi
428	Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms	Guillaume Jacquet, Maud Ehrmann, Ralf Steinberger and Jaakko Vaeyrynen
429	Design and development of the MERLIN learner corpus platform	Verena Lyding and Karin Schöne
430	ENJA15: a free corpus of English à Japanese Translation Process Data	Michael Carl, Akiko Aizawa and Masaru Yamada
433	Towards a Universal Dependency Treebank of Spoken Slovenian	Kaja Dobrovoljc and Joakim Nivre
434	The Hebrew FrameNet Project	Avi Hayoun and Michael Elhadad
435	Introducing the Asian Language Treebank (ALT)	Ye Kyaw Thu, Win Pa Pa, Masao Utiyama, Andrew Finch and Eiichiro Sumita
437	Addressing the MFS bias in WSD systems	Marten Postma, Ruben Izquierdo, Eneko Agirre, German Rigau and Piek Vossen
438	Distribution of Valency Complementations in Czech Complex Predicates: Between Verb and Noun	Václava Kettnerová and Eduard Bejček
439	The COPLE2 corpus: a learner corpus for Portuguese	Amália Mendes, Sandra Antunes, Maarten Janssen and Anabela Gonçalves
440	Reliability of phonetic and prosodic annotation	Andreas Kirkedal
441	A Lexical Resource of Hebrew Verb Noun Multi-Word Expressions	Chaya Liebeskind and Yaakov HaCohen-Kerner
442	Italian VerbNet: A Construction-based Approach to Italian Verb Classification	Lucia Busso and Alessandro Lenci
444	TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics	Andy Luecking, Armin Hoenen and Alexander Mehler
448	1 Million Captioned Dutch Newspaper Images	Desmond Elliott and Martijn Kleppe
449	A Language Independent Method for Generating Large Scale Polarity Lexicons	Giuseppe Castellucci, Danilo Croce and Roberto Basili
450	ANTUSD: A Large Chinese Sentiment Dictionary	Shih-Ming Wang and Lun-Wei Ku
451	Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015	Johann Poignant, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau and Thomas Tamisier
452	Multimodal Resources for Human-Robot Communication Modelling	Stavroula–Evita Fotinea, Eleni Efthimiou, Maria Koutsombogera, Athanasia-Lida Dimou, Theodore Goulas and Kyriaki Vasilaki
453	Metrical annotation of a large corpus of Spanish sonnets: representation, scansion and evaluation.	Borja Navarro, María Ribes-Lafoz and Noelia Sánchez
455	Discriminating Hypernyms, Co-Hyponyms and Randoms	Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang
456	The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents	Johann Poignant, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset and Thomas Tamisier
457	Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks	Ines Rehbein, Merel Scholman and Vera Demberg
458	Syllable based DNN-HMM Cantonese Speech to Text System	Timothy WONG, Claire LI, Sam LAM, Billy Chiu and Qin LU
459	Corpus for Customer Purchase Behavior Prediction in Social Media	Shigeyuki Sakaki, Francine Chen, Mandy Korpusik and Yan-Ying Chen
460	Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof	Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese and Uriel Pascal Elingui
461	metaTED: a Corpus of Metadiscourse for Spoken Language	Rui Correia, Nuno Mamede, Jorge Baptista and Maxine Eskenazi
462	Norwegian Universal Dependencies	Lilja Øvrelid
465	TweetMT: A parallel microblog corpus	Iñaki San Vicente, Iñaki Alegria, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martinez Garcia, Antonio Toral and Arkaitz Zubiaga
466	Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recogntion	Nurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura and Kazuhiro Nakadai
468	Polarity lexicon building: to what extent is the manual effort worth?	Iñaki San Vicente and Xabier Saralegi
469	A multi-layered annotation scheme for perspectives	Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo and Piek Vossen
471	Nederlab: towards a single portal and research environment for diachronic Dutch text corpora	Hennie Brugman, Martin Reynaert, Nicoline van der Sijs, René van Stipriaan, Erik Tjong Kim Sang, Antal van den Bosch, Jan Pieter Kunst, Rob Zeeman, Dieuwertje Kooij, Ineke Brussee, Matthijs Brouwer, marc kemps-snijders and Hans Bennis
473	NLP and public engagement: The case of the Italian School Reform	Tommaso Caselli, Giovanni Moretti, Rachele Sprugnoli, Sara Tonelli, Damien Lanfrey and Donatella Solda Kutzmann
474	Enhancing the RATP-DECODA corpus with linguistic annotations for performing a large range of NLP tasks	Carole Lailler, Anaïs Landeau, Frédéric Béchet, Yannick Estève and Paul Deléglise
476	FLAT: constructing a CLARIN compatible home for language resources	Menzo Windhouwer, Marc Kemps-Snijders, Paul Trilsbeek, André Moreira, Bas Van der Veen and Guilherme Silva
477	Parallel discourse annotations on a corpus of short texts	Manfred Stede, Stergos Afantenos, Andreas Peldszus, Nicholas Asher and Jérémy Perret
478	BulPhonC: Bulgarian speech corpus for the development of ASR technology	Neli Hateva, Petar Mitankin and Stoyan Mihov
479	Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian	Mārcis Pinnis, Askars Salimbajevs and Ilze Auzina
481	Challenges of Adjective Mapping between plWordNet and Princeton WordNet	Ewa Rudnicka, Wojciech Witkowski and Katarzyna Podlaska
484	SCALE: A Scalable Language Engineering Toolkit	Joris Pelemans, Lyan Verwimp, Kris Demuynck, Hugo Van hamme and Patrick Wambacq
486	Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions	Vincent Vandeghinste, Liesbeth Augustinus and Tom Vanallemeersch
487	Web Chat Conversations from Contact Centers: a Descriptive Study	Geraldine Damnati, Aleksandra Guerraz and Delphine Charlet
488	MEANTIME, the NewsReader Multilingual Event and Time Corpus	Anne-Lyse Minard, Manuela Speranza, Ruben Urizar, Begoña Altuna, Marieke van Erp, Anneleen Schoen and Chantal van Son
489	LanguageCrawl: A Generic Tool for Building Language Models Upon Common-Crawl	Szymon Roziewski and Wojciech Stokowiec
490	The Digital Language Diversity Project	Claudia Soria and Irene Russo
492	COULD SPEAKER, GENDER, OR AGE AWARENESS BE BENEFICIAL IN SPEECH-BASED EMOTION RECOGNITION?	Maxim Sidorov and Alexander Schmitt
494	Crowdsourcing a large dataset of domain-specific context-sensitive semantic verb relations	Maria Sukhareva, Judith Eckle-Kohler, Ivan Habernal and Iryna Gurevych
495	Evaluating translation quality and CLIR performance of Query Sessions	Xabier Saralegi, Eneko Agirre and Iñaki Alegria
498	The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation	Jorge Proença, Dirce Celorico, Sara Candeias, Carla Lopes and Fernando Perdigão
499	Crowdsourced Corpus with Entity Salience Annotations	Milan Dojchinovski, Dinesh Reddy, Tomáš Kliegr, Tomas Vitvar and Harald Sack
501	ELMD: An automatically generated entity linking gold standard in the music domain	Sergio Oramas, Luis Espinosa Anke, Mohamed Sordo, Horacio Saggion and Xavier Serra
502	The BAS speech data repository	Uwe Reichel, Florian Schiel, Thomas Kisler and Christoph Draxler
503	Features for Generic Corpus Querying	Thomas Eckart, Christoph Kuras and Uwe Quasthoff
505	Relation- and phrase-level linking of FrameNet with Sar-graphs	Aleksandra Gabryszak, Sebastian Krause, Leonhard Hennig, Feiyu Xu and Hans Uszkoreit
506	Graded and Word-Sense-Disambiguation decisions in Corpus Pattern Analysis: a pilot study	Silvie Cinkova, Ema Krejčová and Anna Vernerová
511	Combining manual and automatic prosodic annotation for expressive speech synthesis	Sandrine Brognaux, Thomas Francois and Marco Saerens
513	Cysill Ar-lein: A Corpus of written contemporary Welsh compiled from an on-line spelling and grammar checker	Delyth Prys, Gruffudd Prys and Dewi Bryn Jones
514	Identification of Drug-Related Medical Conditions in Social Media	François Morlane-Hondère, Cyril Grouin and Pierre Zweigenbaum
515	Emotion Corpus Construction Based on Selection from Noisy Natural Labels	Minglei LI, Yunfei Long and Lu Qin
517	Like a Foreign Student: Unsupervised Measure of Word Similarity in ESL and TOEFL	Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang
518	Mining the Spoken Wikipedia for Speech Data and Beyond	Arne Köhn, Florian Stegen and Timo Baumann
519	On the use of a serious game for recording a speech corpus of people with intellectual disabilities	Mario Corrales-Astorgano, David Escudero-Mancebo, Yurena Gutiérrez-González, Valle Flores-Lucas, César González-Ferreras and Valentín Cardeñoso-Payo
521	A corpus of clinical practice guidelines annotated with the importance of recommendations	Jonathon Read, Erik Velldal, Marc Cavazza and Gersende Georg
523	Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse	Artemis Parvizi, Matt Kohl and Meritxell Gonzàlez
524	Construction and analysis of a large Vietnamese text corpus	Dieu-Thu Le and Uwe Quasthoff
525	The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics	Ryuichiro Higashinaka, Kotaro Funakoshi, Yuka Kobayashi and Michimasa Inaba
526	CLARIAH in the Netherlands	Jan Odijk
527	Using Contextual Information for Machine Translation Evaluation	Marina Fomicheva and Núria Bel
529	The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts	Amy Isard
530	SpaceRef: A corpus of street-level geographic descriptions	Jana Götze and Johan Boye
531	That'll do fine!: A coarse lexical resource for English-Hindi MT, using polylingual topic models	Diptesh Kanojia, Aditya Joshi, Pushpak Bhattacharyya and Mark James Carman
532	Constructing a Norwegian Academy Vocabulary List	Janne M Johannessen, Arash Saidi and Kristin Hagen
533	Remote access to Walenty—a valence dictionary of Polish	Bartłomiej Nitoń, Tomasz Bartosiak and Elżbieta Hajnicz
534	Tweeting and being ironic in the debate about a political reform: the French annotated corpus TWitter-MariagePourTous	Cristina Bosco, Mirko Lai, Viviana Patti and Daniela Virone
535	Syntactic analysis of phrasal compounds in corpora: a challenge for NLP tools	Carola Trips
536	CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence	Alessia Barbagli, Pietro Lucisano, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi
537	Forecasting Emerging Trends from Scientific Literature based on Keyword Extraction and Prediction	Kartik Asooja, Georgeta Bordea, Gabriela Vulcu and Paul Buitelaar
538	CEPLEXicon – A lexicon of child European Portuguese	Ana Lúcia Santos, Maria João Freitas and Aida Cardoso
539	Evaluating the Impact of Light Post-Editing on Usability	Sheila Castilho and Sharon O'Brien
541	Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus	Andy Luecking and Alexander Mehler
542	Case study: Qamus Muhit, a medieval Arabic lexicon in LMF	Ouafae Nahli, Francesca Frontini, Monica Monachini, Fahad Khan and Arsalan Zarghili
543	Crosswalking from CMDI to Dublin Core and MARC 21	Claus Zinn, Thorsten Trippel, Steve Kaminski and Emanuel Dima
544	Evaluating lexical simplification and vocabulary knowledge for learners of French: possibilities of using the FLELex resource	Anaïs Tack, Thomas Francois, Anne-Laure Ligozat and Cédrick Fairon
547	Learning Morphological Transformations for a Language-independent Corpus-based Compound Splitter	Patrick Ziering and Lonneke van der Plas
548	A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation	Robert Herms
549	Parallel Speech Corpora of Japanese Dialects	Koichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama and Hiroshi G. Okuno
550	EasyTree: a Graphical Tool for Dependency Tree Annotation	Alexa Little and Stephen Tratz
551	Automatic recognition of linguistic replacements in text series generated from keystroke logs	Daniel Couto-Vale, Stella Neumann and Paula Niemietz
553	Towards a corpus of reports of violence in Arabic social media	Ayman Alhelbawy, Poesio Massimo and Udo Kruschwitz
554	Affective lexicon creation for the Greek language	Elisavet Palogiannidi, Elias Iosif, Polychronis Koutsakis and Alexandros Potamianos
555	The TYPALOC Corpus: A collection of various dysarthric speech in read and spontaneous speech	Christine Meunier, cecile fougeron, Corinne Fredouille, Brigitte BIGI, Lise Crevier-Buchman, Elisabeth DELAIS-ROUSSARIE, Laurianne Georgeton, Alain Ghio, Imed Laaridh, Thierry Legou, claire Pillot-Loiseau and Gilles Pouchoulin
556	Bootstrapping a Hybrid MT System to a New Language Pair	João António Rodrigues, Nuno Rendeiro, Andreia Querido, Sanja Štajner and António Branco
557	Large rated lexicon with French medical words	Natalia Grabar and Thierry Hamon
559	Multilevel Annotation of Agreement and Disagreement in Italian News Blogs	Fabio Celli, Giuseppe Riccardi and Firoj Alam
561	Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation	Navid Rekabsaz, Serwah Sabetghadam, Mihai Lupu, Linda Andersson and Allan Hanbury
562	Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik	Patrick Littell, Chris Dyer, Kartik Goyal, David Mortensen and Lori Levin
563	PentoRef: A Corpus of Spoken References in Task-oriented Dialogues	Sina Zarrieß, Julian Hough, Casey Kennington, Ramesh Manuvinakurike, David DeVault, Raquel Fernandez and David Schlangen
564	Building Language Resources for Exploring Autism Spectrum Disorders	Julia Parish-Morris, Christopher Cieri, Mark Liberman, Leila Bateman, Emily Ferguson and Robert T. Schultz
566	Comprehensive and Consistent PropBank Light Verb Annotation	Claire Bonial and Martha Palmer
569	Chatbot technology with synthetic voices in the acquisition of an endangered language: motivation, development and evaluation of a platform for Irish	Neasa Ní Chiaráin and Ailbhe Ní Chasaide
570	Summ-it++: an enriched version of the Summ-it corpus	Evandro Fonseca, André Antonitsch, Sandra Collovini, Daniela Amaral and Renata Vieira
571	Automatic corpus extension for data-driven natural language generation	Elena Manishina, Bassam Jabaian, Stéphane Huet and Fabrice Lefevre
572	European Union Language Resources in Sketch Engine	Vít Baisa, Jan Michelfeit and Marek Medveď
573	ADAPTING AN ENTITY CENTRIC MODEL FOR PORTUGUESE COREFERENCE RESOLUTION	Evandro Fonseca, Renata Vieira and Aline Vanin
574	Utilising Linked Linguistic Resources for Semantic Role Labeling in Hungarian	Balázs Indig, Márton Miháltz and András Simonyi
578	FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies	Milan Dojchinovski, Felix Sasaki, Tatjana Gornostaja, Sebastian Hellmann, Erik Mannens, Frank Salliau, Michele Osella, Phil Ritchie, Giannis Stoitsis, Kevin Koidl, Markus Ackermann and Nilesh Chakraborty
579	The Tilburg DialogBank corpus	Harry Bunt, Volha Petukhova, Andrei Malchanau and Kars Wijnhoven
581	Extracting Structured Scholarly Information from the Machine Translation Literature	Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight and Daniel Marcu
582	Edit Categories and Editor Role Identification in Wikipedia	Diyi Yang, Aaron Halfaker, Robert Kraut and Eduard Hovy
584	Inconsistency Detection in Semantic Annotation	Nora Hollenstein, Nathan Schneider and Bonnie Webber
588	Automatic Identification of Bibliographical Zone in Papers	Amal Htait, Sebastien Fournier and Patrice Bellot
589	Iterative NLP-assisted refinement for Clinical Annotations of Chronic Disease Events	Stephen Wu, Chung-Il Wi, Sunghwan Sohn, Hongfang Liu and Young Juhn
591	Analysis of Image Description Datasets and Evaluation Metrics by Jackknifing	Josiah Wang and Robert Gaizauskas
595	Developing a Dataset for Evaluating Approaches for Document Expansion with Images	Debasis Ganguly, Iacer Calixto and Gareth Jones
596	OCR post-correction evaluation of Early Dutch Books Online -- revisited	Martin Reynaert
597	Using BabelNet to Augment Statistical Machine Translation	Jinhua Du, Andy Way and Andrzej Zydron
598	The Artwalk Corpus of Mobile Referential Communication	Kris Liu, Jean Fox Tree and Marilyn Walker
599	A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety	Mathieu Chollet, Torsten Wörtwein, Louis-Philippe Morency and Stefan Scherer
600	Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping	Fahad Albogamy and Allan Ramsay
607	WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words	Luisa Bentivogli, Mauro Cettolo, M. Amin Farajian and Marcello Federico
608	LREC as a Graph: People and Resources in a Network	Riccardo Del Gratta, Francesca Frontini, Monica Monachini, Gabriella Pardelli, Irene Russo, Roberto Bartolini, Sara Goggi, Fahad Khan, Valeria Quochi, Claudia Soria and Nicoletta Calzolari
610	Integration of lexical and semantic knowledge For sentiment analysis in SMS.	wijden khiari, Mathieu Roche and Asma Bouhafs Hafsia
611	DART: a Dataset of Arguments and their Relations on Twitter	Tom Bosc, Elena Cabrio and Serena Villata
612	Multi-prototype Chinese Character Embedding	Yanan Lu, Yue Zhang and Donghong Ji
613	Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries	Marta Villegas, Maite Melero, Núria Bel and Jorge Gracia
616	Hypergraph Modelization of a Syntactically Annotated Wikipedia Dump	Edmundo Pavel Soriano Morales, Julien Ah-Pine and Sabine Loudcher
617	Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization	Şaziye Betül Özateş, Arzucan Özgür and Dragomir Radev
619	MADAD: A Readability Annotation Tool for Arabic Text	Nora Al-Twairesh
621	ASPEC: Asian Scientific Paper Excerpt Corpus	Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi and Hitoshi Isahara
622	Bilingual Lexicon Extraction at the Morpheme Level Using Distributional Analysis	Amir Hazem and Béatrice Daille
624	Extracting Weighted Language Lexicons from Wikipedia	Gregory Grefenstette
625	Best of Both Worlds: Making Word Sense Embeddings Interpretable	Alexander Panchenko
626	Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models	Amir Hazem and Emmanuel MORIN
628	Discontinuous Verb Phrases in Parsing and Machine Translation of English and German	Sharid Loáiciga and Kristina Gulordava
629	A Large-Scale Multilingual Disambiguation of Glosses	José Camacho-Collados, Claudio Delli Bovi, Alessandro Raganato and Roberto Navigli
630	Experiments on the Enrichment of a Topic Model with Lexical-Semantic Knowledge	Adriana Ferrugento, Hugo Gonçalo Oliveira, Ana Alves and Filipe Rodrigues
632	Domain adaptation in MT using titles in Wikipedia as a parallel corpus: resources and evaluation	Gorka Labaka, Iñaki Alegria, Cristina España-Bonet and Alberto Barrón-Cedeño
633	IMS HotCoref DE: A data-driven coreference resolver for German	Ina Roesiger
634	Resources for building applications with Dependency Minimal Recursion Semantics	Ann Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alexander Kuhlne and Ewa Muszyńska
635	Crowdsourcing Salient Information from News and Tweets	Oana Inel, Tommaso Caselli and Lora Aroyo
636	More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing	Pablo Ruiz, Clément Plancq and Thierry Poibeau
637	Evaluating context selection strategies to build Emotive Vector Space Models	Lucia C. Passaro and Alessandro Lenci
638	Large Scale Arabic Diacritized Corpus: Guidelines and Framework	Wajdi Zaghouani, Houda Bouamor, Abdelati Hawwari, Mona Diab, Ossama Obeid, Mahmoud Ghoneim, Sawsan Alqahtani and Kemal Oflazer
639	A Dutch dysarthric speech database for individualized speech therapy research	Emre Yilmaz, Mario Ganzeboom, Lilian Beijer, Catia Cucchiarini and Helmer Strik
640	Vocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech	Naim Terbeh and Mounir Zrigui
644	Personality traits on Twitter for less-resourced languages	Barbara Plank and Ben Verhoeven
647	A Sequence Model Approach to Relation Extraction in Portuguese	Sandra Collovini, Gabriel Machado and Renata Vieira
649	Neural Scoring Function for MST Parser	Jindřich Libovický
651	TEITOK: Text-Faithful Annotated Corpora	Maarten Janssen
652	Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLP	Cristina Mota, Paula Carvalho and Anabela Barreiro
653	Extracting interlinear glossed text from LaTeX documents	Mathias Schenner and Sebastian Nordhoff
654	A Shared Task for Spoken CALL?	Claudia Baur, Johanna Gerlach, Manny Rayner, Martin Russell and Helmer Strik
656	Lemmatization and morphological tagging in German and Latin: A comparison and a survey of the state-of-the-art	Steffen Eger, Alexander Mehler and Rüdiger Gleim
660	TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields	Tim vor der Brück and Alexander Mehler
662	Managing linguistic and terminological variation in a medical dialogue system	Leonardo Campillos Llanos, Dhouha Bouamor, Pierre Zweigenbaum and Sophie Rosset
663	Subtask Mining from Search Query Logs for How-Knowledge Acceleration	Chung-Lun Kuo and Hsin-Hsi Chen
665	Typology of adjectives benchmark for compositional distributional models	Daria Ryzhova, Maria Kyuseva and Denis Paperno
666	MultiVec: a multilingual and multilevel representation learning toolkit for NLP	Alexandre Berard, Christophe Servan, Olivier Pietquin and Laurent Besacier
668	BAS Speech Science Web Services - an Update of Current Developments	Thomas Kisler, Uwe Reichel, Florian Schiel, Christoph Draxler, Bernhard Jackl and Nina Pörner
669	Comparing the Level of Code-Switching in Corpora	Björn Gambäck and Amitava Das
671	Evaluation Set for Slovak News Information Retrieval	Daniel Hladek, Ján Staš and Jozef Juhár
676	Applying Argumentative Zoning and CoreSC to context-based citation recommendation	Daniel Duma, Maria Liakata, Amanda Clare, James Ravenscroft and Ewan Klein
677	Evaluation of the KIT Lecture Translation System	Markus Müller, Sarah Fünfer, Sebastian Stüker and Alex Waibel
678	Annotating opposition relations among Italian verb senses using crowdsourcing	Anna Feltracco, Simone Magnolini, Elisabetta Jezek and Bernardo Magnini
679	A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System	Ajda Gokcen, Evan Jaffe, Johnsey Erdmann, Michael White and Douglas Danforth
680	Assessing the Potential of Metaphoricity of verbs using corpus data	Marco Del Tredici and Nuria Bel
681	The ACL RD-TEC 2.0: A Language Resource for Evaluating Term and Entity Extraction Tools	Behrang Q. Zadeh and Anne-Kathrin Schumann
682	Sentiment Analysis in Social Networks through Topic modeling	Sidahmed Mokaddem, Debashis Naskar, Miguel Rebollo and Eva Onaindia
683	Filtering Wiktionary triangles by linear mapping between distributed models	Márton Makrai
685	A comparison of Named-Entity Disambiguation and Word Sense Disambiguation evaluation datasets	Angel Chang, Valentin I. Spitkovsky, Christopher D. Manning and Eneko Agirre
686	Persian Proposition Bank	Azadeh Mirzaei and Amirsaeid Moloodi
689	Dialogue System Characterization by Back-channelling Patterns Extracted from Dialogue Corpus	Masashi Inoue and Hiroshi Ueno
690	Creation of comparable corpora for English-{Urdu, Arabic, Persian}	Murad Abouammoh, Kashif Shah and Ahmet Aker
691	Detecting Annotation Scheme Variation in Out-of-Domain Treebanks	Yannick Versley
692	AraParl: A Parallel Corpus for Statistical Machine Translation between Arabic and European Languages	Nizar Habash and Nasser Zalmout
694	Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests	Takakazu Imada, Yusuke Inoue, Lei Chen, Syunya Doi, Tian Nie, Chen Zhao, Takehito Utsuro and Yasuhide Kawada
695	SciCorp: A corpus of English scientific articles annotated for information-structural analysis	Ina Roesiger
696	Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation	Wajdi Zaghouani, Nizar Habash, Ossama Obeid, Behrang Mohit and Kemal Oflazer
697	Universal Dependencies for Persian	Mojgan Seraji and Joakim Nivre
698	Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation	Md Shad Akhtar, Asif Ekbal and Pushpak Bhattacharyya
703	Cross-lingual and supervised models for morphosyntactic annotation: a comparison on Romanian	Lauriane Aufrant, Guillaume Wisniewski and François Yvon
704	A longitudinal radio broadcast database in Frisian designed for code-switching research	Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Frits Van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel and David van Leeuwen
705	GULF ARABIC RESOURCE BUILDING FOR SENTIMENT ANALYSIS	Wafia Adouane and Richard Johansson
706	Modeling Language Change in Historical Corpora: The Case of Portuguese	Marcos Zampieri, Shervin Malmasi and Mark Dras
708	Segmenting Hashtags using Automatically Created Training Data	Arda Celebi and Arzucan Özgür
709	If you even don't have a bit of Bible: learning delexicalized POS taggers	David Mareček, Daniel Zeman and Zdeněk Žabokrtský
711	Tools and Guidelines for Principled Machine Translation Development	Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondrej Klejch, Martin Popel and Maja Popović
712	A lexical resource for the identification of “weak words” in German specification documents	Jennifer Krisch, Melanie Dick, Ronny Jauch and Ulrich Heid
714	Detection of Major ASL Sign Types in Continuous Signing For ASL Recognition	Polina Yanovich, Carol Neidle and Dimitris Metaxas
718	PARSEME Survey on MWE Resources	Gyri S. Losnegaard, Federico Sangati, Carla Parra Escartín, Agata Savary, Sascha Bargmann and Johanna Monti
719	Generating Task-Pertinent sorted Error Lists for Speech Recognition	Olivier Galibert, Mohamed Ameur Ben Jannet, Juliette Kahn and Sophie Rosset
722	The Event and Implied Situation Ontology (ESO): Application and Evaluation	Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau and Anne-Lyse Minard
727	Annotating Named Entities in Consumer Health Questions	Halil Kilicoglu, Asma Ben Abacha, Yassine Mrabet, Kirk Roberts and Dina Demner-Fushman
730	Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text	Ravindra Harige and Paul Buitelaar
733	Data Management Plans and Data Centers	Denise DiPersio
734	Interoperability of Annotation Schemes: Using the Pepper framework to display AWA documents in the ANNIS interface	Talvany Carlotto, Zuhaitz Beloki, Xabier Artola and Aitor Soroa
735	What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis	Francesco Barbieri, Francesco Ronzano and Horacio Saggion
736	The SI TEDx-UM speech database: a new Slovenian spoken language resource	Andrej Zgank, Mirjam Sepesy Maucec and Darinka Verdonik
737	PARC 3.0: A Corpus of Attribution Relations	Silvia Pareti
738	Hard Time Parsing Questions: Building a QuestionBank for French	Djamé Seddah and Marie Candito
742	SuperCAT: the (new and improved) Corpus Analysis Toolkit	K. Bretonnel Cohen, William A. Baumgartner Jr. and Irina Temnikova
745	A Morphologically Annotated Corpus and a Morphological Analyzer for Moroccan Arabic	Aidan Kaplan, Ramy Eskander, Nizar Habash and Owen Rambow
746	Using lexical and dependency features to disambiguate discourse connectives in Hindi	Rohit Jain, Himanshu Sharma and Dipti Sharma
749	ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Method	Jinhua Du, Qun Liu and Andy Way
750	SPLIT: Smart Preprocessing (Quasi) Language Independent Tool	Mohamed Al-Badrashiny, Arfath Pasha, Mona Diab, Nizar Habash and Owen Rambow
755	A Verbal and Gestural Corpus of Story Retellings to an Expressive Virtual Character	Jackson Tolins, Jean Fox Tree, Marilyn Walker and Michael Neff
759	A Multimodal Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs	Jackson Tolins, Kris Liu, Yingying Wang, Jean E. Fox Tree, Marilyn Walker and Michael Neff
760	DALILA: The Dialectal Arabic Linguistic Learning Assistant	Salam Khalifa, Houda Bouamor and Nizar Habash
761	Refurbishing a Morphological Database for German	Petra Steiner
762	Towards Automatic Identification of Effective Clues for Team Word-Guessing Games	Eli Pincus and David Traum
766	Fostering the Next Generation of European Language Technology: Recent Developments – Emerging Initiatives – Challenges and Opportunities	Georg Rehm, Jan Hajic, Josef van Genabith and Andrejs Vasiļjevs
769	A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations	Laurent Prévot, Jan Gorisch and Roxane Bertrand
774	UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central — Towards State-of-the-Art Software Engineering of BioNLP Pipelines	Udo Hahn, Franz Matthies, Johannes Hellrich and Erik Faessler
778	Parallel Global Voices: a collection of multilingual corpora with citizen media stories	Prokopis Prokopidis, Vassilis Papavassiliou and Stelios Piperidis
779	Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks	Sebastian Schuster and Christopher D. Manning
780	The hunvec framework for NN-CRF-based sequential tagging	Katalin Pajkossy and Attila Zséder
782	Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports	Mathieu Lafourcade and Lionel Ramadier
783	Facilitating metadata interoperability in CLARIN-DK	Lene Offersgaard and Dorte Haltrup Hansen
784	Typed Entity and Relation Annotation on Computer Science Papers	Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao and Akiko Aizawa
785	Speech corpus spoken by young-old, old-old and oldest-old Japanese	Yurie Iribe, Norihide Kitaoka and Shuhei Segawa
786	EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs	Liu Hongchao, Karl Neergaard, Enrico Santus and Chu-Ren Huang
788	Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations	Morena Danieli, Balamurali A R, Evgeny Stepanov, Benoit Favre, Frederic Bechet and Giuseppe Riccardi
790	The Automatic Construction of Discourse Corpus for Dialogue Translation	Longyue Wang, Xiaojun Zhang, Zhaopeng Tu, Andy Way and Qun Liu
791	TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation	Adrien Bougouin, Sabine Barreaux, Laurent Romary, Florian Boudin and Beatrice Daille
792	The Royal Society Corpus: From Uncharted Data to Corpus	Stefania Degaetano-Ortlieb, Hannah Kermes, Ashraf Khamis, Jörg Knappen and Elke Teich
795	SPA: web-based platform for easy access to speech modules	Fernando Batista, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eugénio Ribeiro, Helena Moniz, David Martins de Matos and Ricardo Ribeiro
796	Towards Building Semantic Role Labeler for Indian Languages	Maaz Anwar and Dipti Sharma
799	Analysing Constraint Grammars with a SAT-solver	Inari Listenmaa and Koen Claessen
800	The Scielo Corpus: a parallel corpus of scientific publications for biomedicine	Mariana Neves, Antonio Jimeno Yepes and Aurélie Névéol
801	ArchiMob - A corpus of Spoken Swiss German	Tanja Samardzic, Yves Scherrer and Elvira Glaser
802	Building Evaluation Datasets for Consumer-Oriented Information Retrieval	Lorraine Goeuriot, Liadh Kelly, Guido Zuccon and Joao Palotti
805	Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0	Zygmunt Vetulani, Grażyna Vetulani and Bartłomiej Kochanowski
806	Studying the Effects of Automatically Correcting Prepositions Using Source- and Target-Side Features in English–German SMT	Marion Weller, Alexander Fraser and Sabine Schulte im Walde
807	Polish rhythmic database – new resources for speech timing and rhythm analysis	Agnieszka Wagner, Katarzyna Klessa and Jolanta Bachan
808	Annotating Topic Development in Conversational Search Queries	Marta Andersson, Adnan Ozturel and Silvia Pareti
810	The Trials and Tribulations of Predicting Post-Editing Productivity	Lena Marg
811	Corpus vs. lexicon supervision in morphosyntactic tagging: the case of Slovene	Nikola Ljubešić and Tomaž Erjavec
812	Towards Multiple Antecedent Coreference Resolution in Specialized Discourse	Alicia Burga, Sergio Cajal, Joan Codina-Filba and Leo Wanner
813	Towards Building Proposition Bank for Urdu	Maaz Anwar and Dipti Sharma
814	Lin\|gu\|is\|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data	Christian Chiarcos, Christian Fäth, Heike Renner-Westermann, Frank Abromeit and Vanya Dimitrova
815	A Hungarian Sentiment Corpus Manually Annotated at Aspect Level	Martina Katalin Szabó, Veronika Vincze, Katalin Ilona Simkó, Viktor Varga and Viktor Hangya
816	Word Segmentation for Akkadian Cuneiform	Timo Homburg and Christian Chiarcos
820	Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing	Manuel Burghardt, Daniel Granvogl and Christian Wolff
822	Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low German	Maria Sukhareva and Christian Chiarcos
823	A Large Scale Corpus of Gulf Arabic	Salam Khalifa, Nizar Habash and Dana Abdulrahim
826	CHATR the Corpus; a 20-year-old archive of Concatenative Speech Synthesis	Nick Campbell
829	Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View	Achim Stein
831	French Learners Audio Corpus of German Speech (FLACGS)	Jane Wottawa and Martine Adda-Decker
832	Effect functors for opinion inference	Josef Ruppenhofer and Jasper Brandes
835	A Regional News Corpora for Contextualized Entity Discovery and Linking	Adrian Brasoveanu, Lyndon J.B. Nixon, Albert Weichselbraun and Arno Scharl
836	Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Conversational Japanese	Hanae Koiso, Tomoyuki Tsuchiya, Ryoko Watanabe, Daisuke Yokomori, Masao Aizawa and Yasuharu Den
837	A Dataset for Open Event Extraction in English	Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret and Romaric Besançon
842	Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages	Muhammad Imran, Prasenjit Mitra and Carlos Castillo
847	Coreference Annotation Scheme and Relation Types for Hindi	Vandan Mujadia, Palash Gupta and Dipti Misra Sharma
849	Adapting the Cognitive MT Evaluation Approach to Arabic	Irina Temnikova, Wajdi Zaghouani, Stephan Vogel and Nizar Habash
850	Yes, We Care! Results of the Ethics and Natural Language Processing Surveys	Karën Fort and Alain Couillault
851	The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud	John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora and Jonathan Pool
852	Context-enhanced Adaptive Entity Linking	Giuseppe Rizzo, Filip Ilievski, Marieke van Erp, Julien Plu and Raphael Troncy
853	A Reading Comprehension Corpus for Machine Translation Evaluation	Carolina Scarton and Lucia Specia
854	Exposing Predicate Models as Linked Data by Extending the Lemon Model	Francesco Corcoglioniti, Alessio Palmero Aprosio, Marco Rospocher and Sara Tonelli
855	Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?	Shammur Absar Chowdhury, Evgeny Stepanov and Giuseppe Riccardi
857	Producing monolingual web corpora and bitext at the same time -- SpiderLing and bitextor's love affair	Nikola Ljubešić, Miquel Esplà-Gomis, Sergio Ortiz Rojas and Filip Klubička
859	A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction	Kalin Stefanov and Jonas Beskow
862	Multiword Expressions in Child Language	Rodrigo Wilkens, Marco Idiart and Aline Villavicencio
863	A framework for automatic acquisition of Croatian and Serbian verb aspect from corpora	Tanja Samardzic and Maja Miličević
864	Towards a Language Service Infrastructure for Mobile Environments	Ngoc Nguyen, Donghui Lin, Takao Nakaguchi and Toru Ishida
865	Specialising Paragraph Vectors for Text Polarity Detection	Fabio Tamburini
867	NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models	Frederico Tommasi Caroli, André Freitas and João Carlos Pereira da Silva
868	Database of Mandarin Neighborhood Statistics	Karl Neergaard, Hongzhi Xu and Chu-Ren Huang
870	Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature	Kata Gábor, Haifa Zargayouna, Davide Buscaldi, Isabelle Tellier and Thierry Charnois
871	Designing A Long Lasting Linguistic Project: The Case Study of ASIt	Maristella Agosti, Emanuele Di Buccio, Giorgio Maria Di Nunzio, Cecilia Poletto and Esther Rinke
873	UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing	Milan Straka, Jan Hajic and Jana Straková
874	Controlled propagation of concept annotations in textual corpora	Cyril Grouin
875	Wow! What a useful extension to wordnet!	Luís Morgado da Costa and Francis Bond
878	Exploiting Arabic Diacritization for High Quality Automatic Annotation	Nizar Habash, Anas Shahrour and Muhamed Al-Khalil
879	Graph-Based Induction of Word Senses in Croatian	Marko Bekavac and Jan Šnajder
880	The Public License Selector:  Making Open Licensing Easier	Pawel Kamocki, Pavel Straňák and Michal Sedlák
881	An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation	Peter Viszlay, Ján Staš, Tomáš Koctúr, Martin Lojka and Jozef Juhár
882	Coreference in Prague Czech-English Dependency Treebank	Anna Nedoluzhko, Michal Novák, Silvie Cinkova, Marie Mikulová and Jiří Mírovský
885	SlangNet: A WordNet like resource for English Slang	Shehzaad Dhuliawala, Diptesh Kanojia and Pushpak Bhattacharyya
886	A Conventional Orthography for Maghrebi Arabic	Houcemeddine Turki, Emad Adel, Tariq Daouda and Nassim Regragui
887	Towards Standardization of Linguistic Graph Banks for Semantic Parsing	Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinkova, Dan Flickinger, Jan Hajic, Angelina Ivanova and Zdenka Uresova
888	Joining-in-type Humanoid Robot Assisted Language Learning System	AlBara Khalifa, Tsuneo Katou and Seiichi Yamamoto
889	Searching in the Penn Discourse Treebank using the PML-Tree Query	Jiří Mírovský, Lucie Poláková and Jan Štěpánek
892	Rapid Development of Morphological Analyzers for Typologically Diverse Languages	Seth Kulick and Ann Bies
895	DBpedia Abstracts: A Large, Multilingual NLP Training Corpus	Martin Brümmer, Milan Dojchinovski and Sebastian Hellmann
896	Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings	Eda Okur, Hakan Demir and Arzucan Özgür
898	B2SG: a TOELF-like task for Portuguese	Rodrigo Wilkens, Leonardo Zilio, Eduardo Ferreira and Aline Villavicencio
899	A Multi-domain Corpus of Swedish Word Sense Annotation	Richard Johansson, Yvonne Adesam, Gerlof Bouma and Karin Hedberg
902	A Corpus of Native, Non-native and Translated Texts	Sergiu Nisioi, Ella Rabinovich, Liviu P. Dinu and Shuly Wintner
904	Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics	Dirk Hovy and Anders Johannsen
905	“He Said She Said” – Male/Female Corpus of Polish	Filip Graliński, Łukasz Borchmann and Piotr Wierzchoń
908	Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)	Damir Cavar, Malgorzata Cavar and Lwin Moe
912	Annotating Ideological Perspective in Arabic Social Media	Heba Elfardy and Mona Diab
913	A crowdsourced database of event sequence descriptions for the acquisition of high-quality script knowledge	Lilian D. A. Wanzare, Alessandra Zarcone, Stefan Thater and Manfred Pinkal
915	GATE-Time: Extraction of Temporal Expressions and Events	Leon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood and Manuel Jung
916	Wiktionnaire's wikicode GLAWIfied: a workable French Machine-Readable Dictionary	Nabil Hathout and Franck Sajous
917	Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus	Simon Clematide, Lenz Furrer and Martin Volk
918	corpus-tools.org: An interoperable generic software tool set for multi-layer linguistic corpora	Stephan Druskat, Volker Gast, Thomas Krause and Florian Zipser
919	On Developing Resources for Patient-level Information Retrieval	Stephen Wu, Tamara Timmons, Amy Yates, Meikun Wang, Steven Bedrick, William Hersh and Hongfang Liu
920	Graphical Annotation for Syntax-Semantics Mapping	Koiti Hasida
921	Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests	Takuya Matsuzaki, Akira Fujita, Naoya Todo and Noriko H. Arai
922	Monolingual Social Media Datasets for Detecting Contradiction and Entailment	Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva and Thierry Declerck
923	Cohere: A Toolkit for Local Coherence	Karin Sim Smith, Wilker Aziz and Lucia Specia
926	Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job	Marieke van Erp, Pablo Mendes, Heiko Paulheim, Filip Ilievski, Julien Plu, Giuseppe Rizzo and Joerg Waitelonis
927	A machine learning framework for humor response prediction	Dario Bertero and Pascale Fung
928	Evaluating multi-label annotation in scientific articles - The Multi-CoreSC Cancer Risk Assessment (CRA) corpus	Anika Oellrich, Maria Liakata and Shyamasree Saha
930	Improving the Annotation of Sentence Specificity	Junyi Jessy Li, Bridget O'Daniel, Yi Wu, Wenli Zhao and Ani Nenkova
931	Distributional thesauri for Information Retrieval and vice versa	Ewa Kijak
932	Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments	Rafiya Begum, Kalika Bali, Monojit Choudhury, Koustav Rudra and Niloy Ganguly
934	ALT Explored: Integrating an online dialectometric tool and an online dialect atlas	Martijn Wieling, Eva Sassolini, Sebastiana Cucurullo and Simonetta Montemagni
936	Czech Legal Text Treebank 1.0	Vincent Kríž, Barbora Hladka and Zdenka Uresova
939	Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects	David Lewis, Kaniz Fatema, Alfredo Maldonado, Brian Walshe and Arturo Calvo
941	MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment	YU Yuan, Serge Sharoff and Bogdan Babych
942	Detecting Expressions of Blame or Praise in Text	udochukwu orizu
943	NorGramBank: a ‘deep’ treebank for Norwegian	Helge Dyvik, Paul Meurer, Victoria Rosén, Koenraad De Smedt, Petter Haugereid, Gyri S. Losnegaard, Gunn Inger Lyse and Martha Thunes
946	VerbLexPor: a lexical resource with semantic roles for Portuguese	Leonardo Zilio
947	OpenSubtitles2015: Extracting Large Parallel Corpora from Movie and TV Subtitles	Pierre Lison and Jörg Tiedemann
950	A Multilingual Predicate Matrix	Maddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe and German Rigau
951	Towards producing bilingual lexica from monolingual corpora	Jingyi Han and Núria Bel
955	A Neural Lemmatizer for Bengali	Abhisek Chakrabarty, Akshay Chaturvedi and Utpal Garain
956	Crowdsourcing a multi-lingual speech corpus: recording, transcription and annotation of the CrowdIS corpora	Andrew Caines, Christian Bentz, Calbert Graham, Tim Polzehl and Paula Buttery
958	Predicting author age from Weibo microblog posts	Wanru Zhang, Andrew Caines and Paula Buttery
959	First steps towards coverage-based sentence alignment	Luís Gomes and Gabriel Pereira Lopes
960	CommonCOW: massively huge web corpora from CommonCrawl data and a method to distribute them freely under restrictive EU copyright laws	Roland Schäfer
965	Sentiframes: A Resource for Verb-centered German Sentiment Inference	Manfred Klenner and Michael Amsler
966	Temporal Information Annotation: Crowd vs. Experts	Tommaso Caselli, Rachele Sprugnoli and Oana Inel
968	Effects of Sampling on Twitter Trend Detection	Andrew Yates, Alek Kolcz, Nazli Goharian and Ophir Frieder
969	Studying the temporal dynamics of word co-occurrences: An application to event detection	Daniel Preoţiuc-Pietro, P. K. Srijith, Mark Hepple and Trevor Cohn
975	Parallel Chinese-English Entities, Relations and Events Corpora	Justin Mott, Ann Bies, zhiyi song and Stephanie Strassel
976	Automatic Biomedical Term Polysemy Detection	Juan Antonio Lossio Ventura, Clement Jonquet, Mathieu Roche and Maguelonne Teisseire
977	Government domain named entity recognition for South African languages	Roald Eiselen
979	Learning Thesaurus Relations from Distributional Features	Rosa Tsegaye Aga, Christian Wartena, Lucas Drumond and Lars Schmidt-Thieme
985	Automatic classification of tweets for analyzing communication intention of museums	Nicolas Foucault and Antoine Courtin
987	Named Entity Resources - Overview and Outlook	Maud Ehrmann, Damien Nouvel and Sophie Rosset
988	A Multi-Genre Corpus for English Affect Texts	Shabnam Tafreshi
990	CLARIN-EL Web-based Annotation Tool	Ioannis Manousos Katakis, Georgios Petasis and Vangelis Karkaletsis
992	Adapting the TANL tool suite to Universal Dependencies	Maria Simi
993	Markov Logic Networks for Text Mining	Luis Gerardo Mojica de la Vega and Vincent Ng
994	Merging an Inflectional Dictionary with a Resource of Derivational Morphology of Czech	Adéla Limburská, Milan Straka, Magda Sevcikova, Jonáš Vidra and Zdeněk Žabokrtský
996	Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level	Marcos Garcia
999	EDISON: Feature Extraction for NLP, Simplified	Mark Sammons, Christos Christodoulopoulos, Parisa Kordjamshidi, Daniel Khashabi, Vivek Srikumar and Dan Roth
1002	Entity Linking with a Paraphrase Flavor	Maria Pershina, Yifan He and Ralph Grishman
1003	Semantic Links for Portuguese	Fabricio Chalub, Livy Real, Alexandre Rademaker and Valeria de Paiva
1004	A Gold Standard for Scalar Adjectives	Bryan Wilkinson
1005	Event Reference Interpretation with Multi-Pass Sieves	Jing Lu and Vincent Ng
1006	Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR	Malgorzata Cavar, Damir Cavar and Hilaria Cruz
1007	The PsyMine Corpus	Tilia Ellendorff, Simon Foster and Fabio Rinaldi
1009	A finite-state morphological analyser for Tuvan	Francis Tyers, Aziyana Bayyr-ool, Aelita Salchak and Jonathan Washington
1011	Accurate Deep Syntactic Parsing of Graphs: The Case of French	Corentin Ribeyre, Eric Villemonte de la Clergerie, Marie Candito and Djamé Seddah
1012	QTLeap WSD/NED corpora: Semantic annotation of parallel corpora in six languages	Arantxa Otegi, Nora Aranberri, António Branco, Jan Hajic, Martin Popel, Kiril Simov and Eneko Agirre
1013	Orthographic and Morphological Correspondences between Closely Related Slavic Languages as a Base for Cognate Extraction	Andrea Fischer, Klara Jagrova, Irina Stenger, Tania Avgustinova, Dietrich Klakow and Roland Marti
1014	An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes	Kai Frederic Engelmann, Patrick Holthaus, Britta Wrede and Sebastian Wrede
1018	C-WEP---Rich Annotated Collection of Writing Errors by Professionals	Cerstin Mahlow
1021	Two architectures for parallel processing for huge amounts of text	Mathijs Kattenberg, Zuhaitz Beloki, Aitor Soroa, Xabier Artola, Antske Fokkens, Paul Huygen and Kees Verstoep
1023	Parsing Icelandic using the Berkeley Parser: Prospects for big data syntactic research	Anton Ingason, Eiríkur Rögnvaldsson and Einar Sigurdsson
1026	Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource	Stephan Tulkens, Chris Emmery and Walter Daelemans
1027	VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian	Ivan Sekulić and Jan Šnajder
1031	Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages	John Sylak-Glassman, Christo Kirov and David Yarowsky
1032	Sieve-based coreference resolution in the biomedical domain	Dane Bell, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega and Mihai Surdeanu
1033	Rule-based automatic multi-word term extraction and lemmatization	Ranka Stankovic, Cvetana Krstev, Ivan Obradovic, Biljana Lazic and Aleksandra Trtovac
1035	The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes	Phil Bartie, william mackaness, Dimitra Gkatzia and Verena Rieser
1039	A new integrated open-source morphological analyzer for Hungarian	Attila Novák, Borbála Siklósi and Csaba Oravecz
1040	Ecological gestures for HRI: the GEE corpus	Maxence Girard-Rivier, Romain Magnani, Veronique Auberge, Yuko Sasa and Liliya Tsvetanova
1041	Trends in HLT Research: A Survey of LDC's Data Scholarship Program	Denise DiPersio
1044	Semi-automatic Parsing for Web Knowledge Extraction through Semantic Annotation	Maria Pia di Buono
1046	How to Address Smart Homes with a Social Robot? A multi-modal corpus of user interactions with an intelligent environment	Patrick Holthaus and Christian Leichsenring
1048	Transfer-based learning-to-rank assessment of medical term technicality	Dhouha Bouamor, Leonardo Campillos Llanos, Anne-Laure Ligozat, Sophie Rosset and Pierre Zweigenbaum
1050	“Who was Pietro Badoglio?” Towards a QA system for Italian History	Stefano Menini, Rachele Sprugnoli and Antonio Uva
1052	Enriching a Portuguese Wordnet through a Synonyms Dictionary	Alberto Simões, Xavier Gómez Guinovart and José João Almeida
1053	Error Corpus of Croatian Written Language	Vanja Štefanec and Nikola Ljubešić
1054	MARMOT: A Toolkit for Translation Quality Estimation at the Word Level	Varvara Logacheva, Chris Hokamp and Lucia Specia
1060	New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification	Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur and John Godfrey
1061	An Annotated Corpus of Direct Speech	John Lee and Chak Yan Yeung
1063	Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola	Marco Stranisci, Cristina Bosco, Delia Irazú Hernández Farías and Viviana Patti
1066	A Proposal for a Part-of-Speech Tagset for the Albanian Language	Besim Kabashi and Thomas Proisl
1068	Axolotl: A web accessible parallel corpus for spanish-nahuatl	Ximena Gutierrez-Vasques, Gerardo Sierra and Jonathan Salinas
1069	A Corpus of Wikipedia Discussions: Over the Years, with Topic, Power and Gender Labels	Vinodkumar Prabhakaran and Owen Rambow
1070	NLP Infrastructure for the Lithuanian Language	Daiva Vitkutė-Adžgauskienė, Andrius Utka, Darius Amilevičius and Tomas Krilavičius
1071	The DIRHA Portuguese corpus: A comparison of home automation command detection and recognition in simulated and real data.	Miguel Matos, Alberto Abad and António Serralheiro
1074	Enhanced CORILGA: Introducing the automatic phonetic alignment tool for continuous speech	Roberto Seara, Marta Martinez, Rocio Varela, Carmen Garcia-Mateo, Elisa Fernandez Rei and Xose Luis Regueira
1076	An Empirical Exploration of Moral Foundations Theory in Partisan News Sources	Daniel Preoţiuc-Pietro, Dean Fulgoni, Jordan Carpenter and Lyle Ungar
1077	Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms	Christo Kirov, John Sylak-Glassman, Roger Que and David Yarowsky
1078	Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models	Steven Neale, Luís Gomes, Eneko Agirre, Oier Lopez de Lacalle and António Branco
1083	Focus Annotation in Authentic Data: a comparison of expert and crowd-sourced annotation	Kordula De Kuthy, Ramon Ziai and Detmar Meurers
1085	The OpenCourseWare Metadiscourse (OCWMD) Corpus	Ghada Alharbi and Thomas Hain
1087	Arabic Corpora for Credibility Analysis	Ayman Al Zaatari, Reem El Ballouli, Shady ELbassouni, Wassim El-Hajj, Hazem Hajj, Khaled Shaban and Nizar Habash
1095	Tēzaurs.lv: the Largest Open Lexical Database for Latvian	Andrejs Spektors, Ilze Auziņa, Roberts Darģis, Normunds Grūzītis, Pēteris Paikens, Lauma Pretkalniņa, Laura Rituma and Baiba Saulīte
1100	NULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard Arabic	Samhaa El-Beltagy
1101	VoxML: A Visualization Modeling Language	James Pustejovsky and Nikhil Krishnaswamy
1102	Domain Adaptation for Named Entity Recognition Using CRFs	Tian Tian, Marco Dinarelli, Isabelle Tellier and Pedro Dias Cardoso
1105	Possessions identification in text	Carmen Banea, Xi Chen and Rada Mihalcea
1110	Example-based Acquisition of Fine-grained Collocation Resources	Sara Rodríguez-Fernández, Roberto Carlini, Luis Espinosa Anke and Leo Wanner
1115	Coh-Metrix-Esp: A Complexity Analysis tool for documents written in Spanish	Andre Quispesaravia, Walter Perez, Marco Sobrevilla and Fernando Alva-Manchego
1116	Metonymy Analysis Using Associative Relations between Words	Takehiro Teraoka
1117	Age and Gender Prediction on Health Forum Data	Prasha Shrestha, Nicolas Rey-Villamizar, Farig Sadeque, Ted Pedersen, Steven Bethard and Thamar Solorio
1118	Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project	Malgorzata Cavar, Dov-Ber Kerler, Damir Cavar and Anya Quilitzsch
1119	Manual and Automatic Paraphrases for MT Evaluation	Aleš Tamchyna and Petra Barancikova
1120	KodE Alltag: A German-Language E-Mail Text Corpus as an Emerging Reference Corpus for Balanced Everyday German Language Usage	Ulrike Krieg-Holz, Christian Schuschnig and Udo Hahn
1121	ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions	Olga Uryupina and Massimo Poesio
1122	Embedding Open-domain Common-sense Knowledge from Text	Travis Goodwin and Sanda Harabagiu
1124	A finite-state morphological analyser for Sindhi	Raveesh Motlani and Francis Tyers
1126	Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it	Rob Abbott, Brian Ecker, Pranav Anand and Marilyn Walker
1129	Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data	Éva Mújdricza-Maydt, Silvana Hartmann, Anette Frank and Iryna Gurevych
1130	Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel	Hardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper and Derek Ruths
1131	A survey of multiword expression annotations in treebanks	Victoria Rosén, Koenraad De Smedt, Gyri Smørdal Losnegaard, Eduard Bejček, Agata Savary, Adam Przepiórkowski and Verginica Mitetelu
1132	A DNN-HMM BASED QUERY BY HUMMING SYSTEM	Pascale Fung
1133	Publishing the Trove Newspaper Corpus	Steve Cassidy
1134	Deriving morphological analyzers from example inflections	Markus Forsberg and Mans Hulden
1137	Corpus Query Lingua Franca (CQLF)	Piotr Banski, Elena Frick and Andreas Witt
1138	LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages	Stephanie Strassel and Jennifer Garland
1140	A Computational Perspective on the Romanian Dialects	Alina Maria Ciobanu and Liviu P. Dinu
1141	Improving corpus search via parsing	Natalia Klyueva and Pavel Straňák
1142	Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation	Jeevanthi Liyanapathirana and Andrei Popescu-Belis
1144	Revisiting Summarization Evaluation for Scientific Articles	Arman Cohan and Nazli Goharian
1148	Morphological analysis of Sahidic Coptic for automatic glossing	Daniel Smith and Mans Hulden
1149	Ubuntu-fr: A large and open corpus for multi-modal analysis of online written conversations	Nicolas Hernandez and Soufian Salim
1150	Providing a catalogue of Language Resources for Commercial Users	Bente Maegaard, Lina Henriksen, Sussi Olsen, Gerhard Budin, Luz Esparza, Andrew Joscelyne, Steven Krauwer, Vesna Lusicky, Margaretha Mazura, Blanca Rodrigues and Philippe Wacker
1151	A Turkish-German Code-Switching Corpus	Özlem Çetinoğlu
1154	The DTA Base Format (DTABf) - Enabling Corpus-based Linguistic Research on Structural Phenomena	Susanne Haaf
1155	A Web Tool for Building Parallel Corpora of Spoken and Sign Languages	Fabio Kepler, Alex Becker and Sara Candeias
1156	Introducing the LCC Metaphor Datasets	Michael Mohler, Mary Brunson, Marc Tomlinson and Bryan Rink
1157	The on-line version of Grammatical Dictionary of Polish	Marcin Woliński
1159	Comparing Speech and Text Classification on ICNALE	Sergiu Nisioi
1160	Passing a USA National Bar Exam: a First Corpus for Experimentation	Biralatei Fawei, Adam Wyner and Jeff Pan
1161	Creating a large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data	Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani and Mohamed Al-Badrashiny
1162	Factuality annotation and learning in Spanish texts	Dina Wonsever, Aiala Rosá and Marisa Malcuori
1163	What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems	Emma Barker, Adam Funk, Monica Paramita, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple and R. Gaizauskas
1167	Using Word Embeddings to Translate Named Entities	Octavia-Maria Şulea, Sergiu Nisioi and Liviu P. Dinu
1171	The Alaskan Athabascan Grammar Database	Sebastian Nordhoff, Siri Tuttle and Olga Lovick
1172	Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic	Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim and Mona Diab
1174	Towards a corpus-based online tool for French - Sign Language (LSFB) needs	Laurence Meurant and Anthony Cleve
1179	Corpora for Learning the Mutual Relationship between Semantic Relatedness and Textual Entailment	Octavian Popescu and Ngoc Phuoc An Vo
1180	DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter	Ye Tian, Julian Hough, Laura de Ruiter, Simon Betz, David Schlangen and Jonathan Ginzburg
1182	The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015	Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama and Hugo Zaragoza
1184	Capturing Chat: Annotation and Tools for Multiparty Casual Conversation.	Emer Gilmartin and Nick Campbell
1190	Medical Concept Embeddings via Labeled Background Corpora	Eneldo Loza Mencía, Gerard de Melo and Jinseok Nam
1192	Enriching TimeBank: Towards a more precise annotation of temporal relations in a text	Volker Gast, Lennart Bierkandt, Stephan Druskat and Christoph Rzymski
1194	Phrase Level Segmentation and Labelling of Machine Translation Errors	Frédéric Blain, Varvara Logacheva and Lucia Specia
1195	The United Nations Parallel Corpus v1.0	Michał Ziemski, Marcin Junczys-Dowmunt and Bruno Pouliquen
1197	Building the Macedonian-Croatian Parallel Corpus	Ines Cebović and Marko Tadić
1198	The ACQDIV database: min(d)ing the ambient language	Steven Moran
1199	OPFI: A Tool for Opinion Finding in Polish	Aleksander Wawer
1200	Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation	Domagoj Alagić and Jan Šnajder
1201	Creating Linked Data Morphological Language Resources with MMoOn -The Hebrew Morpheme Inventory	Bettina Klimek, Natanael Arndt, Sebastian Krause and Timotheus Arndt
1203	A Tangled Web: The Faint Signals of Deception in Text	Franco Salvetti, John Brandon Lowe and James H. Martin
1204	A taxonomy of Spanish nouns, a statistical algorithm to generate it and its implementation in open source code	Rogelio Nazar and Irene Renau
1208	Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging Accuracy	Mohamed Outahajala and Paolo Rosso
1209	SatiricaLR: a language resource of satirical news articles	Alice Frain and Sander Wubben
1210	The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval	Kira Griffitt and Stephanie Strassel
1211	RankDCG: Rank-Ordering Evaluation Measure	Denys Katerenchuk and Andrew Rosenberg
1212	Spanish word vectors from Wikipedia	Dina Wonsever and Mathias Etcheverry
1214	Synset Ranking of Hindi WordNet	Sudha Bhingardive, Rajita Shukla, Jaya Jha, Laxmi Kashyap, Dhirendra Singh and Pushpak Bhattacharya
1215	Evaluating Lexical Similarity to build Sentiment Similarity	Grégoire Jadi, vincent claveau, Béatrice Daille and Monceaux Laura
1219	Two Years of Aranea: Increasing Counts and Tuning the Pipeline	Vladimír Benko
1220	Cross-lingual RDF Thesauri Interlinking	Tatiana Lesnikova, Jérôme David and Jérôme Euzenat
1222	Annotating and Detecting Medical Events in Clinical Notes	Prescott Klassen, Fei Xia and Meliha Yetisgen
1223	Neural embeddings language models in semantic clustering of web search results for Russian	Andrey Kutuzov
1224	Collecting Language Resources for the Latvian e-Government Machine Translation Platform	Roberts Rozis, Raivis Skadiņš and Andrejs Vasiļjevs
1225	Multiword Expressions Dataset for Indian Languages	Dhirendra Singh, Sudha Bhingardive and Pushpak Bhattacharya
1226	Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations	Ichiro Umata, Koki Ijuin, Mitsuru Ishida, Moe Takeuchi and Seiichi Yamamoto
1227	The Validation of MRCPD Cross-language Expansions on Imageability Ratings	Ting Liu, Kit Cho, Tomek Strzalkowski, Samira Shaikh and Mehrdad Mirzaei
1228	The Language Application Grid and Galaxy	Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen and Christopher Cieri
1233	Using Data Mining Techniques for Sentiment Shifter Identification	Samira Noferesti and Mehrnoush Shamsfard
1234	A Dependency Treebank of the Chinese Buddhist Canon	Tak-sum Wong and John Lee
1235	PoliCon: a corpus of metaphor in political conﬂict discourse	Andrew Gargett and John Barnden
1236	Hidden resources – strategies to acquire and exploit potential spoken language resources in national archives	Jens Edlund and Joakim Gustafson
1237	Learning from Within: Comparing PoS Tagging Approaches for Historical Text	Sarah Schulz and Jonas Kuhn
1238	Constraint-based Bilingual Dictionary Induction for Indonesian Ethnic Languages	Arbi Haza Nasution, Yohei Murakami and Toru Ishida
1239	Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification	Simone Hantke, Erik Marchi and Björn Schuller
1242	Question-Answering with Logic Specific to Video Games	Corentin Dumont, Ran Tian and Kentaro Inui
1244	SubCo: A Learner Translation Corpus of Human and Machine Subtitles	José Manuel Martínez Martínez and Mihaela Vela
1245	Building Tempo-IndoWordNet:A resource for effective temporal information access in Hindi	Dipawesh Pawar, Mohammed Hasanuzzaman and Asif Ekbal
1248	Multi-language Speech Collection for NIST LRE	Karen Jones, David Graff, Jonathan Wright, Kevin Walker and Stephanie Strassel
1250	ELRA's Activities and Services: 20 Years for the HLT Community	Khalid Choukri, Valérie Mapelli and Hélène Mazo
1251	Language Resource Citation: the ISLRN Dissemination and Further Developments	Khalid Choukri, Valérie Mapelli, Lin Liu and Vladimir Popescu
1252	The ELRA License Wizard	Khalid Choukri, Valérie Mapelli, Lin Liu and Vladimir Popescu
1253	Review on the Existing Language Resources for Languages of France	Thibault Grouas, Valérie Mapelli and Quentin Samier
1254	Selection Criteria for Low Resource Language Programs	Christopher Cieri, Mike Maxwell, Stephanie Strassel and Jennifer Tracey
1256	New Developments in the LRE Map	Vladimir Popescu, Lin Liu, Riccardo Del Gratta, Khalid Choukri and Nicoletta Calzolari
1257	CASSAurus: A Resource of Simpler Spanish Synonyms	Ricardo Baeza-Yates, Luz Rello and Julia Dembowski
1258	Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets (Abstract)	Eduardo Coutinho, Florian Hönig, Yue Zhang, Simone Hantke, Anton Batliner, Elmar Nöth and Björn Schuller
1259	Enhancing Cross-border EU e-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities	Meritxell Fernández Barrera and Vladimir Popescu

Important Dates

25 October 2015: Submission of proposals for oral and poster papers
25 October 2015: Submission of proposals for panels, workshops and tutorials
27 November 2015: Notification of acceptance of worksohps and tutorials
1st February 2016: Notification of accepted papers
18th February 2016: Online Registration
17th March 2016: Final Submission Deadline
23-24 May 2016: Pre-conference Workshops & Tutorials
25-26-27 May 2016: Main Conference
28 May 2016: Workshops & Tutorials

Links

ELRAILCPhotos Credits

Latest Tweets

Tweets by @LREC2016

Share this page!

Important Dates

Links

Latest Tweets