CICLing 2014 Accepted Papers

Dat Huynh, Dat Tran and Wanli Ma. Semantic Similarity Measure Using Relational and Latent Topic Features

Alexey An, Bakytkan Dauletbakov and Eugene Levner. Multi-attribute classification of text documents as a tool for ranking and categorization of educational innovation projects

Su Fei, Gang Chen and Xinyan Xiao. Beam-Width Adaptation for Hierarchical Phrase-Based Translation

Sebastian Schmidt, Steffen Schnitzer and Christoph Rensing. Effective Classification of Ambiguous Web Documents Incorporating Human Feedback Efficiently

Caio Teixeira, Ivandré Paraboni, Adriano Silva and Alan Yamasaki. Generating Relational Descriptions involving Mutual Disambiguation

Thiago Ferreira and Ivandre Paraboni. Classification-based Referring Expression Generation

Prasad Perera and Leila Kosseim. Evaluation of Sentence Compression Techniques Against Human Performance

Eric Kergosien, Cédric Lopez, Mathieu Roche and Maguelonne Teisseire. Looking for Opinion in Land-use Planning Corpora

Anselmo Peñas, Bernardo Cabaleiro and Mirella Lapata. Unsupervised Interpretation of Eventive Propositions

David Bracewell, Marc Tomlinson, Michael Mohler and Bryan Rink. A Tiered Approach to the Recognition of Metaphor

Marie Duzi. Isomorphism of structured meanings and synonymy

Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi. Iterative Bilingual Lexicon Extraction from Comparable Corpora with Topical and Contextual Knowledge

Tiansi Dong and Armin B. Cremers. A Novel Machine Translation Method for Learning Chinese as a Foreign Language

Amir Hazem. Improving Bilingual Lexicon Extraction from Comparable Corpora using Window-based and Syntax-based Models

Henning Wachsmuth, Martin Trenkmann, Benno Stein, Gregor Engels and Tsvetomira Palakarska. A Review Corpus for Argumentation Analysis

Kfir Bar and Nachum Dershowitz. Inferring Paraphrases for a Highly Inflected Language from a Monolingual Corpus

Mohamed Farouk Abdel Hady and Abubakrelsedik Karali. Unsupervised Active Learning for Cross-Lingual Information Extraction

David Mareček and Zdeněk Žabokrtský. Dealing with Function Words in Unsupervised Dependency Parsing

Lionel Ramadier, Manel Zarrouk, Mathieu Lafourcade and Antoine Micheau. Spreading Relation Annotations in a Lexical Semantic Network Applied to Radiology

Suraj Maharjan, Prasha Shrestha, Gabriela Ramirez, Alan Sprague, Thamar Solorio and Gary Warner. Using String Information for Malware Family Identification

Sandra Bringay, Eric Kergosien, Pierre Pompidor and Pascal Poncelet. Emotions target in health forums

Roris Victor, Juan M. Santos, Roberto Perez-Rodriguez, Carlos Rivas, Miguel Gomez and Luis Anido Rifon. Information Extraction in Semantic and Semi-Structured Web Sources

Elizaveta Clouet and Beatrice Daille. Compound Terms and their Multi-Word Variants: Case of German and Russian Languages

Meishan Zhang, Yue Zhang, Wanxiang Che and Ting Liu. A Semantic-Oriented Grammar for Chinese Treebanking

Santanu Pal, Pintu Lohar and Sudip Kumar Naskar. Role of Paraphrases in PB-SMT

Lior Wolf, Yair Hanani, Kfir Bar and Nachum Dershowitz. Joint word2vec Networks for Bilingual Semantic Representations

Imene Bensalem, Paolo Rosso and Salim Chikhi. Intrinsic Plagiarism Detection using N-grams Frequency Classes

Aibek Makazhanov, Olzhas Makhambetov, Islam Sabyrgaliyev and Zhandos Yessenbayev. Spelling Correction for Kazakh

Gurpreet Singh Lehal, Tejinder Singh Saini and Preetpal Kaur Buttar. Automatic Bilingual Legacy-Fonts Identification and Conversion System

Abeba Ibrahim and Yaregal Assabie. Amharic Sentence Parsing Using Base Phrase Chunking

Niraj Kumar. A Graph Based Automatic Plagiarism Detection Technique to Handle The Artificial Word Reordering and Paraphrasing

György Orosz, Attila Novák and Gábor Prószéky. Lessons learned from tagging clinical Hungarian

Kanako Komiya, Shohei Shibata and Yoshiyuki Kotani. Cross-lingual Product Recommendation Using Collaborative Filtering With Translation Pairs

Apurbalal Senapati and Utpal Garain. A Maximum Entropy based Honorificity Identification for Bengali Pronominal Anaphora Resolution

Vivek Datla, King-Ip Lin and Max Louwerse. Linguistic features predict the truthfulness of short political statements

Reinhard Rapp. Using Word Association Norms to Measure Corpus Representativeness

Ophélie Lacroix, Denis Bechet and Florian Boudin. Label Pre-annotation for Building Non-projective Dependency Treebanks for French

Diana Inkpen and Amir Hossein Razavi. Topic Classification using Latent Dirichlet Allocation at Multiple Levels

Daiga Deksne, Raivis Skadiņš and Inguna Skadiņa. Extended CFG formalism for grammar checker and parser development

Nir Ofek, Lior Rokach and Prasenjit Mitra. Methodology for Connecting Nouns to their Modifying Adjectives

Cyrine Nasri, Kamel Smaili and Chiraz Latiri. Statistical Machine Translation Without Alignments

Zuzana Neverilova. Acquiring annotated data for recognizing textual entailment by means of a game

Jessica Perrie, Aminul Islam and Evangelos Milios. How Document Properties Affect Document Relatedness Measures

Bassam Hammo, Asma Moubaiddin, Nadim Obeid and Abeer Tuffaha. Understanding Arabic Syntactic Structure in Light of the Government and Binding Theory

Miguel Rios and Lucia Specia. Statistical Relational Learning to Recognise Textual Entailment

Amit Mishra and Sanjay Kumar Jain. An Approach for Computing Sentiment Polarity of Complex Why Type Opinion Questions Asked on Product Review Sites

Nobal B. Niraula and Vasile Rus. A Machine Learning Approach to Anaphora Resolution in Dialogue based Intelligent Tutoring Systems

Md. Abdullah Al Mumin, Abu Awal Md. Shoeb, Mohammad Reza Selim and Muhammed Zafar Iqbal. A Representative Bengali Corpus for Intelligent Text Processing

Ruifeng Xu, Jun Xu, Bin Liu and Lin Yao. News Reader’s Emotion Prediction Using Concept and Concept Sequence Features in Headline

Yuta Kikuchi, Hiroya Takamura, Manabu Okumura and Satoshi Nakazawa. Identifying a Demand towards a Company in CGM

Bayar Tsolmon and Kyung-Soon Lee. Extracting Social Events based on Timeline and User Reliability Analysis on Twitter

Carolina Scarton, Lin Sun, Karin Kipper-Schuler, Magali Sanches Duran, Martha Palmer and Anna Korhonen. Verb clustering for Brazilian Portuguese

Pranay Kumar Venkata Sowdaboina, Sutanu Chakraborti and Sripada Somayajulu G. Learning to summarize time series data

Pat Hall, Bal Krishna Bal, Sagun Dhakwa and Bhim Narayan Regmi. Issues in Encoding the Writing of Nepal’s Languages

Marina Litvak and Natalia Vanetik. Multi-document Summarization using Tensor Decomposition

D Indumathi, A Chitra and J Bineeshia. Search Query Expansion using Concept based Clustering for Improved Personalized Search

Nibaran Das, Swarnendu Ghosh, Teresa Goncalves and Paulo Queresma. Comparison of different graph distance metrics for semantic text based classification

Nikola Ljubešić, Tomaž Erjavec and Darja Fišer. Standardizing Tweets with Character-level Machine Translation

Lakshmi S and Sobha Lalitha Devi. Rule Based Case Transfer in Tamil-Malayalam MT

Rabeb Mbarek, Mohamed Tmar and Hawete Hattab. A New Relevance Feedback Algorithm Based on Vector Space Basis Change

Sobha Lalitha Devi, Lakshmi S and Sindhuja Gopalan. Discourse Tagging for Indian Languages

Tuan Dinh, Hung Phan and Quan Tran. Evaluating prosodic characteristics for Vietnamese Aviation announcements

Vijay Sundar Ram, Efstathios Stamatatos and Sobha Lalitha Devi. Identification of Plagiarism using Syntactic and Semantic Filters

Jörg Tiedemann. Improved Text Extraction from PDF Documents for Large-Scale Natural Language Processing

Hatem Haddad and Bechikh Ali Chedi. Turkish Information Retrieval Performances: Evaluating the Impact of Linguistic Parameters and Compound Nouns

Eric Wehrli and Luka Nerima. When rules meet bigrams

Changliang Li, Bo Xu, Gaowei Wu, Xiuying Wang, Wendong Ge and Yan Li. Obtaining Better Word Representations via Language Transfer

Tao Chen, Ruifeng Xu, Jun Xu, Bin Liu and Lin Yao. A Sentence Vector based Over-sampling Method for Imbalanced Emotion Classiﬁcation

Anjan Nepal and Alexander Yates. Exploring Applications of Representation Learning in Nepali

Iria Da Cunha, Jorge Vivaldi, Juan-Manuel Torres-Moreno and Gerardo Sierra. SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics

Boris Galitsky, Dmitry Ilvovsky and Sergei O. Kuznetsov. Extending tree kernels towards paragraphs

Oldrich Kruza and Vladislav Kubon. Automatic Recognition of Clauses

Marc Tomlinson, Wayne Krug, David Hinote and David Bracewell. #impressme: The Language of Motivation in User Generated Content

Arpita Batra. Constituency Parsing of Complex Noun Sequences in Hindi

Clara Vania, Mochamad Ibrahim and Mirna Adriani. Sentiment Lexicon Generation for Under-Resourced Language

Zelalem Mekuria and Yaregal Assabie. A hybrid approach to the development of part-of-speech tagger Kafi-noonoo language

Calkin Suero Montero, Tuomo Kakkonen and Myriam Munezero. Investigating the Role of Emotion-based Features in Author Gender Classification of Text

Miao Fan, Qiang Zhou and Thomas Fang Zheng. Mining the Personal Interests of Microbloggers via Exploiting Wikipedia Knowledge

Marina Boia, Claudiu Cristian Musat and Boi Faltings. Constructing Context-aware Sentiment Lexicons with an Asynchronous Game with a Purpose

Felix-Herve Bachand, Elnaz Davoodi and Leila Kosseim. An Investigation on the Influence of Genres and Textual Organizations on the Use of Discourse Relations

Tommi Pirinen and Krister Lindén. State-of-the-Art in Weighted Finite-State Spell-Checking

Guoyu Tang, Yunqing Xia, Jun Sun, Min Zhang and Thomas Fang Zheng. Topic Models Incorporating Statistical Word Senses

Lidong Bing, Chunliang Lu and Wai Lam. Website Community Mining from Query Logs with Two-phase Clustering

Utpal Sikdar, Asif Ekbal and Sriparna Saha. Modified Differential Evolution for Biochemical Name Recognizer

Rui Wang, Wei Liu and Chris McDonald. How Candidate Selection Affects the Ranking in Unsupervised Keyphrase Extraction

Sanghamitra Nath, Himangshu Sarma and Utpal Sharma. A preliminary study on the VOT patterns of the Assamese language and its Nalbaria variety

Yaakov Hacohen-Kerner and Orr Margaliot. Authorship Attribution of Responsa using Clustering

Pintu Lohar, Pinaki Bhaskar, Santanu Pal and Sivaji Bandyopadhyay. Cross Lingual Snippet Generation Using Snippet Translation System

Vincent Claveau and Abir Ncibi. Knowledge discovery with CRF-based clustering of named entities without a priori classes

Savas Yildirim. A Knowledge-poor Approach to Turkish Text Categorization with a Comparative Analysis

Hady Elsahar and Samhaa El-Beltagy. A Fully Automated Approach for Arabic Polarity Lexicon Extraction from Microblogs

Zahrul Islam, Md. Rashedur Rahman and Alexander Mehler. Text Readability Classification of Bangla Texts

Rajesh Piryani, Jagadesha H and Vivek Kumar Singh. An Algorithmic Approach for Learning Concept Identification and Relevant Resource Retrieval in Focused Subject Domains

Zvi Ben-Ami, Ronen Feldman and Binyamin Rosenfeld. Using Multi-View Learning to Improve Detection of Investor Sentiments on Twitter

Ruijing Li, Shumin Shi, Heyan Huang, Chao Su and Tianhang Wang. A Method of Polarity Computation of Chinese Sentiment Words Based on Gaussian Distribution

Lina Rojas and Christophe Cerisara. Bayesian Inverse Reinforcement Learning for Modeling Conversational Virtual Characters in a Situated Environment.

Marco Guerini and Carlo Strapparava. Credible or Incredible? Dissecting Urban Legends

Xiangdong An. How Complementary Are Different Information Retrieval Techniques? - A Study in Biomedicine Domain

Gerard de Melo and Valeria de Paiva. Sense-Specific Implicative Commitments

Alejandra Lorenzo and Christophe Cerisara. Semi-supervised SRL system with Bayesian inference

Phillip Smith and Mark Lee. Acknowledging Discourse Function for Sentiment Analysis

Basanta Joshi, Manoj Ghimire and Umanga Bista. Intelligent clustering scheme for log data streams

Nayan Jyoti Kalita, Navanath Saharia and Smriti Kumar Sinha. Morphological Analysis of the Bishnupriya Manipuri Language using Finite State Transducers

Nadir Durrani and Yaser Al-Onaizan. Improving Egyptian-to-English SMT by mapping Egyptian into MSA

Martha Ruiz Costa-Jussà, Rafael E. Banchs and Alexander Gelbukh. An IR-based strategy for supporting Chinese-Portuguese translation services in off-line mode

Soujanya Poria, Erik Cambria and Alexander Gelbukh. Sentic Parser: A Dependency Relation Based Concept Parser for Concept Level Text Analysis

Thierry Poibeau. Optimality Theory as a Framework for Lexical Acquisition

Nelly Moreno, Sergio Jimenez and Julia Baquero. Automatically Assessing Children Written Skills Based on Age-supervised Dataset

Dan Ştefănescu, Rajendra Banjade and Vasile Rus. A Sentence Similarity Method based on Parsing and Information Content

Sirine Boukedi and Kais Haddar. HPSG grammar treating different forms of Arabic coordination

Griselda Matias Mendoza, Yulia Ledeneva and Rene Arnulfo Garcia Hernandez. Evaluación de Herramientas Comerciales, Herramientas en Línea y Métodos del Estado del Arte para la Generación de Resúmenes de Textos para un solo Documento

Yulia Ledeneva, René Arnulfo García-Hernández and Alexander Gelbukh. Graph Ranking on Maximal Frequent Sequences for Single Extractive Text Summarization

Joao Casteleiro, Joaquim Silva and Gabriel Pereira Lopes. Bilingually Learning Word Senses for Translation

Andrea Segura-Olivares, Alejandro García and Hiram Calvo. Feature Analysis for Paraphrase Recognition and Textual Entailment

Pavel Kral. Named Entities as new Features for Czech Document Classification

Hiram Calvo. Simple TF·IDF is not the Best you can get for Regionalism Classification

Arun K. Timalsina and Dinesh Dangol. Nepali Language Feature Enhanced Vector Space Model for News Classification