CICLing 2014 Accepted Papers

Notes:
Dat Huynh, Dat Tran and Wanli Ma. Semantic Similarity Measure Using Relational and Latent Topic Features
Alexey An, Bakytkan Dauletbakov and Eugene Levner. Multi-attribute classification of text documents as a tool for ranking and categorization of educational innovation projects
Su Fei, Gang Chen and Xinyan Xiao. Beam-Width Adaptation for Hierarchical Phrase-Based Translation
Sebastian Schmidt, Steffen Schnitzer and Christoph Rensing. Effective Classification of Ambiguous Web Documents Incorporating Human Feedback Efficiently
Caio Teixeira, Ivandré Paraboni, Adriano Silva and Alan Yamasaki. Generating Relational Descriptions involving Mutual Disambiguation
Thiago Ferreira and Ivandre Paraboni. Classification-based Referring Expression Generation
Prasad Perera and Leila Kosseim. Evaluation of Sentence Compression Techniques Against Human Performance
Eric Kergosien, Cédric Lopez, Mathieu Roche and Maguelonne Teisseire. Looking for Opinion in Land-use Planning Corpora
Anselmo Peñas, Bernardo Cabaleiro and Mirella Lapata. Unsupervised Interpretation of Eventive Propositions
David Bracewell, Marc Tomlinson, Michael Mohler and Bryan Rink. A Tiered Approach to the Recognition of Metaphor
Marie Duzi. Isomorphism of structured meanings and synonymy
Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi. Iterative Bilingual Lexicon Extraction from Comparable Corpora with Topical and Contextual Knowledge
Tiansi Dong and Armin B. Cremers. A Novel Machine Translation Method for Learning Chinese as a Foreign Language
Amir Hazem. Improving Bilingual Lexicon Extraction from Comparable Corpora using Window-based and Syntax-based Models
Henning Wachsmuth, Martin Trenkmann, Benno Stein, Gregor Engels and Tsvetomira Palakarska. A Review Corpus for Argumentation Analysis
Kfir Bar and Nachum Dershowitz. Inferring Paraphrases for a Highly Inflected Language from a Monolingual Corpus
Mohamed Farouk Abdel Hady and Abubakrelsedik Karali. Unsupervised Active Learning for Cross-Lingual Information Extraction
David Mareček and Zdeněk Žabokrtský. Dealing with Function Words in Unsupervised Dependency Parsing
Lionel Ramadier, Manel Zarrouk, Mathieu Lafourcade and Antoine Micheau. Spreading Relation Annotations in a Lexical Semantic Network Applied to Radiology
Suraj Maharjan, Prasha Shrestha, Gabriela Ramirez, Alan Sprague, Thamar Solorio and Gary Warner. Using String Information for Malware Family Identification
Sandra Bringay, Eric Kergosien, Pierre Pompidor and Pascal Poncelet. Emotions target in health forums
Roris Victor, Juan M. Santos, Roberto Perez-Rodriguez, Carlos Rivas, Miguel Gomez and Luis Anido Rifon. Information Extraction in Semantic and Semi-Structured Web Sources
Elizaveta Clouet and Beatrice Daille. Compound Terms and their Multi-Word Variants: Case of German and Russian Languages
Meishan Zhang, Yue Zhang, Wanxiang Che and Ting Liu. A Semantic-Oriented Grammar for Chinese Treebanking
Santanu Pal, Pintu Lohar and Sudip Kumar Naskar. Role of Paraphrases in PB-SMT
Lior Wolf, Yair Hanani, Kfir Bar and Nachum Dershowitz. Joint word2vec Networks for Bilingual Semantic Representations
Imene Bensalem, Paolo Rosso and Salim Chikhi. Intrinsic Plagiarism Detection using N-grams Frequency Classes
Aibek Makazhanov, Olzhas Makhambetov, Islam Sabyrgaliyev and Zhandos Yessenbayev. Spelling Correction for Kazakh
Gurpreet Singh Lehal, Tejinder Singh Saini and Preetpal Kaur Buttar. Automatic Bilingual Legacy-Fonts Identification and Conversion System
Abeba Ibrahim and Yaregal Assabie. Amharic Sentence Parsing Using Base Phrase Chunking
Niraj Kumar. A Graph Based Automatic Plagiarism Detection Technique to Handle The Artificial Word Reordering and Paraphrasing
György Orosz, Attila Novák and Gábor Prószéky. Lessons learned from tagging clinical Hungarian
Kanako Komiya, Shohei Shibata and Yoshiyuki Kotani. Cross-lingual Product Recommendation Using Collaborative Filtering With Translation Pairs
Apurbalal Senapati and Utpal Garain. A Maximum Entropy based Honorificity Identification for Bengali Pronominal Anaphora Resolution
Vivek Datla, King-Ip Lin and Max Louwerse. Linguistic features predict the truthfulness of short political statements
Reinhard Rapp. Using Word Association Norms to Measure Corpus Representativeness
Ophélie Lacroix, Denis Bechet and Florian Boudin. Label Pre-annotation for Building Non-projective Dependency Treebanks for French
Diana Inkpen and Amir Hossein Razavi. Topic Classification using Latent Dirichlet Allocation at Multiple Levels
Daiga Deksne, Raivis Skadiņš and Inguna Skadiņa. Extended CFG formalism for grammar checker and parser development
Nir Ofek, Lior Rokach and Prasenjit Mitra. Methodology for Connecting Nouns to their Modifying Adjectives
Cyrine Nasri, Kamel Smaili and Chiraz Latiri. Statistical Machine Translation Without Alignments
Zuzana Neverilova. Acquiring annotated data for recognizing textual entailment by means of a game
Jessica Perrie, Aminul Islam and Evangelos Milios. How Document Properties Affect Document Relatedness Measures
Bassam Hammo, Asma Moubaiddin, Nadim Obeid and Abeer Tuffaha. Understanding Arabic Syntactic Structure in Light of the Government and Binding Theory
Miguel Rios and Lucia Specia. Statistical Relational Learning to Recognise Textual Entailment
Amit Mishra and Sanjay Kumar Jain. An Approach for Computing Sentiment Polarity of Complex Why Type Opinion Questions Asked on Product Review Sites
Nobal B. Niraula and Vasile Rus. A Machine Learning Approach to Anaphora Resolution in Dialogue based Intelligent Tutoring Systems
Md. Abdullah Al Mumin, Abu Awal Md. Shoeb, Mohammad Reza Selim and Muhammed Zafar Iqbal. A Representative Bengali Corpus for Intelligent Text Processing
Ruifeng Xu, Jun Xu, Bin Liu and Lin Yao. News Reader’s Emotion Prediction Using Concept and Concept Sequence Features in Headline
Yuta Kikuchi, Hiroya Takamura, Manabu Okumura and Satoshi Nakazawa. Identifying a Demand towards a Company in CGM
Bayar Tsolmon and Kyung-Soon Lee. Extracting Social Events based on Timeline and User Reliability Analysis on Twitter
Carolina Scarton, Lin Sun, Karin Kipper-Schuler, Magali Sanches Duran, Martha Palmer and Anna Korhonen. Verb clustering for Brazilian Portuguese
Pranay Kumar Venkata Sowdaboina, Sutanu Chakraborti and Sripada Somayajulu G. Learning to summarize time series data
Pat Hall, Bal Krishna Bal, Sagun Dhakwa and Bhim Narayan Regmi. Issues in Encoding the Writing of Nepal’s Languages
Marina Litvak and Natalia Vanetik. Multi-document Summarization using Tensor Decomposition
D Indumathi, A Chitra and J Bineeshia. Search Query Expansion using Concept based Clustering for Improved Personalized Search
Nibaran Das, Swarnendu Ghosh, Teresa Goncalves and Paulo Queresma. Comparison of different graph distance metrics for semantic text based classification
Nikola Ljubešić, Tomaž Erjavec and Darja Fišer. Standardizing Tweets with Character-level Machine Translation
Lakshmi S and Sobha Lalitha Devi. Rule Based Case Transfer in Tamil-Malayalam MT
Rabeb Mbarek, Mohamed Tmar and Hawete Hattab. A New Relevance Feedback Algorithm Based on Vector Space Basis Change
Sobha Lalitha Devi, Lakshmi S and Sindhuja Gopalan. Discourse Tagging for Indian Languages
Tuan Dinh, Hung Phan and Quan Tran. Evaluating prosodic characteristics for Vietnamese Aviation announcements
Vijay Sundar Ram, Efstathios Stamatatos and Sobha Lalitha Devi. Identification of Plagiarism using Syntactic and Semantic Filters
Jörg Tiedemann. Improved Text Extraction from PDF Documents for Large-Scale Natural Language Processing
Hatem Haddad and Bechikh Ali Chedi. Turkish Information Retrieval Performances: Evaluating the Impact of Linguistic Parameters and Compound Nouns
Eric Wehrli and Luka Nerima. When rules meet bigrams
Changliang Li, Bo Xu, Gaowei Wu, Xiuying Wang, Wendong Ge and Yan Li. Obtaining Better Word Representations via Language Transfer
Tao Chen, Ruifeng Xu, Jun Xu, Bin Liu and Lin Yao. A Sentence Vector based Over-sampling Method for Imbalanced Emotion Classification
Anjan Nepal and Alexander Yates. Exploring Applications of Representation Learning in Nepali
Iria Da Cunha, Jorge Vivaldi, Juan-Manuel Torres-Moreno and Gerardo Sierra. SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics
Boris Galitsky, Dmitry Ilvovsky and Sergei O. Kuznetsov. Extending tree kernels towards paragraphs
Oldrich Kruza and Vladislav Kubon. Automatic Recognition of Clauses
Marc Tomlinson, Wayne Krug, David Hinote and David Bracewell. #impressme: The Language of Motivation in User Generated Content
Arpita Batra. Constituency Parsing of Complex Noun Sequences in Hindi
Clara Vania, Mochamad Ibrahim and Mirna Adriani. Sentiment Lexicon Generation for Under-Resourced Language
Zelalem Mekuria and Yaregal Assabie. A hybrid approach to the development of part-of-speech tagger Kafi-noonoo language
Calkin Suero Montero, Tuomo Kakkonen and Myriam Munezero. Investigating the Role of Emotion-based Features in Author Gender Classification of Text
Miao Fan, Qiang Zhou and Thomas Fang Zheng. Mining the Personal Interests of Microbloggers via Exploiting Wikipedia Knowledge
Marina Boia, Claudiu Cristian Musat and Boi Faltings. Constructing Context-aware Sentiment Lexicons with an Asynchronous Game with a Purpose
Felix-Herve Bachand, Elnaz Davoodi and Leila Kosseim. An Investigation on the Influence of Genres and Textual Organizations on the Use of Discourse Relations
Tommi Pirinen and Krister Lindén. State-of-the-Art in Weighted Finite-State Spell-Checking
Guoyu Tang, Yunqing Xia, Jun Sun, Min Zhang and Thomas Fang Zheng. Topic Models Incorporating Statistical Word Senses
Lidong Bing, Chunliang Lu and Wai Lam. Website Community Mining from Query Logs with Two-phase Clustering
Utpal Sikdar, Asif Ekbal and Sriparna Saha. Modified Differential Evolution for Biochemical Name Recognizer
Rui Wang, Wei Liu and Chris McDonald. How Candidate Selection Affects the Ranking in Unsupervised Keyphrase Extraction
Sanghamitra Nath, Himangshu Sarma and Utpal Sharma. A preliminary study on the VOT patterns of the Assamese language and its Nalbaria variety
Yaakov Hacohen-Kerner and Orr Margaliot. Authorship Attribution of Responsa using Clustering
Pintu Lohar, Pinaki Bhaskar, Santanu Pal and Sivaji Bandyopadhyay. Cross Lingual Snippet Generation Using Snippet Translation System
Vincent Claveau and Abir Ncibi. Knowledge discovery with CRF-based clustering of named entities without a priori classes
Savas Yildirim. A Knowledge-poor Approach to Turkish Text Categorization with a Comparative Analysis
Hady Elsahar and Samhaa El-Beltagy. A Fully Automated Approach for Arabic Polarity Lexicon Extraction from Microblogs
Zahrul Islam, Md. Rashedur Rahman and Alexander Mehler. Text Readability Classification of Bangla Texts
Rajesh Piryani, Jagadesha H and Vivek Kumar Singh. An Algorithmic Approach for Learning Concept Identification and Relevant Resource Retrieval in Focused Subject Domains
Zvi Ben-Ami, Ronen Feldman and Binyamin Rosenfeld. Using Multi-View Learning to Improve Detection of Investor Sentiments on Twitter
Ruijing Li, Shumin Shi, Heyan Huang, Chao Su and Tianhang Wang. A Method of Polarity Computation of Chinese Sentiment Words Based on Gaussian Distribution
Lina Rojas and Christophe Cerisara. Bayesian Inverse Reinforcement Learning for Modeling Conversational Virtual Characters in a Situated Environment.
Marco Guerini and Carlo Strapparava. Credible or Incredible? Dissecting Urban Legends
Xiangdong An. How Complementary Are Different Information Retrieval Techniques? - A Study in Biomedicine Domain
Gerard de Melo and Valeria de Paiva. Sense-Specific Implicative Commitments
Alejandra Lorenzo and Christophe Cerisara. Semi-supervised SRL system with Bayesian inference
Phillip Smith and Mark Lee. Acknowledging Discourse Function for Sentiment Analysis
Basanta Joshi, Manoj Ghimire and Umanga Bista. Intelligent clustering scheme for log data streams
Nayan Jyoti Kalita, Navanath Saharia and Smriti Kumar Sinha. Morphological Analysis of the Bishnupriya Manipuri Language using Finite State Transducers
Nadir Durrani and Yaser Al-Onaizan. Improving Egyptian-to-English SMT by mapping Egyptian into MSA
Martha Ruiz Costa-Jussà, Rafael E. Banchs and Alexander Gelbukh. An IR-based strategy for supporting Chinese-Portuguese translation services in off-line mode
Soujanya Poria, Erik Cambria and Alexander Gelbukh. Sentic Parser: A Dependency Relation Based Concept Parser for Concept Level Text Analysis
Thierry Poibeau. Optimality Theory as a Framework for Lexical Acquisition
Nelly Moreno, Sergio Jimenez and Julia Baquero. Automatically Assessing Children Written Skills Based on Age-supervised Dataset
Dan Ştefănescu, Rajendra Banjade and Vasile Rus. A Sentence Similarity Method based on Parsing and Information Content
Sirine Boukedi and Kais Haddar. HPSG grammar treating different forms of Arabic coordination
Griselda Matias Mendoza, Yulia Ledeneva and Rene Arnulfo Garcia Hernandez. Evaluación de Herramientas Comerciales, Herramientas en Línea y Métodos del Estado del Arte para la Generación de Resúmenes de Textos para un solo Documento
Yulia Ledeneva, René Arnulfo García-Hernández and Alexander Gelbukh. Graph Ranking on Maximal Frequent Sequences for Single Extractive Text Summarization
Joao Casteleiro, Joaquim Silva and Gabriel Pereira Lopes. Bilingually Learning Word Senses for Translation
Andrea Segura-Olivares, Alejandro García and Hiram Calvo. Feature Analysis for Paraphrase Recognition and Textual Entailment
Pavel Kral. Named Entities as new Features for Czech Document Classification
Hiram Calvo. Simple TF·IDF is not the Best you can get for Regionalism Classification
Arun K. Timalsina and Dinesh Dangol. Nepali Language Feature Enhanced Vector Space Model for News Classification