CICLing-2005: Accepted Papers

Click on a paper to see its abstract

ID

Title

Authors

Page

 

 

 

 

 

Computational Linguistics Research

 

 

 

 

 

 

Computational Linguistics Formalisms

 

K

Invited paper:
An Overview of Probabilistic Tree Transducers for Natural Language Processing

Kevin Knight and Jonathan Graehl

1

155

A Modular Account of Information Structure in Extensible Dependency Grammar

Ralph Debusmann, Oana Postolache, and Maarika Traat

26

151

Modelling Grammatical and Lexical Knowledge: A Declarative Approach

Palmira Marrafa

38

133

Constructing a Parser for Latin

Cornelis H.A. Koster

49

127

Parsing Korean Case Phenomena in a Type-Feature Structure Grammar

Jong-Bok Kim and Jaehyung Yang

61

138

A Computational Model of the Spanish Clitic System

Luis A. Pineda, Ivan V. Meza

73

337

A Parallel Approach to Syllabification

Anca Dinu and Liviu P. Dinu

83

 

 

 

 

 

Semantics and Discourse

 

M

Invited paper:
Towards Developing Probabilistic Generative Models for Reasoning with Natural Language Representations

Daniel Marcu and Ana-Maria Popescu

87

153

Putting Pieces Together: Combining FrameNet, VerbNet and WordNet for Robust Semantic Parsing

Lei Shi and Rada Mihalcea

99

207

Assigning Function Tags with a Simple Model

Vasile Rus and Kirtan Desai

111

217

Finding Discourse Relations in Student Essays

Rohana Mahmud and Allan Ramsay

115

 

 

 

 

 

Parsing and Syntactic Disambiguation

 

142

Regional vs. Global Finite-State Error Repair

Manuel Vilares, Juan Otero, and Jorge Graña

119

117

Lexicalized Beam Thresholding Parsing with Prior and Boundary Estimates

Deyi Xiong, Qun Liu, and Shouxun Lin

131

126

Unsupervised Evaluation of Parser Robustness

Johnny Bigert, Jonas Sjöbergh, Ola Knutsson, and Magnus Sahlgren

141

181

Mutual Information Independence Model using Kernel Density Estimation for Segmenting and Labeling Sequential Data

ZHOU GuoDong, YANG LingPeng, SU Jian, JI DongHong

153

175

Applying Conditional Random Fields to Chinese Shallow Parsing

Yongmei Tan, Tianshun Yao, Qing Chen and Jingbo Zhu

165

233

Distributional Thesaurus vs. WordNet: A Comparison of Backoff Techniques for Unsupervised PP Attachment

Hiram Calvo, Alexander Gelbukh , Adam Kilgarriff

172

 

 

 

 

 

Morphology

 

110

Automatic Recognition of Czech Derivational Prefixes

Alfonso Medina Urrea and Jaroslava Hlaváčová

184

237

Korma 2003: Newly Improved Korean Morpheme Analysis Module for Reducing Terminological and Spacing Errors in Document Analysis

Ho-cheol Choi and Sang-yong Han

193

160

Word Extraction Based on Semantic Constraints in Chinese Word-formation

Maosong Sun, Shengfen Luo, and Benjamin K T’sou

197

395

Using Directed Graph based BDMM Algorithm for Chinese Word Segmentation

Yaodong Chen, Ting Wang, and Huowang Chen

209

 

 

 

 

 

Anaphora and Coreference

 

227

Entity-Based Noun Phrase Coreference Resolution

Xiaofeng Yang, Jian Su, and Lingpeng Yang

213

216

The Right Frontier Constraint as Conditional

Claudia Sassen and Peter Kühnlein

217

 

 

 

 

 

Word Sense Disambiguation

 

162

Name Discrimination by Clustering Similar Contexts

Ted Pedersen, Amruta Purandare, and Anagha Kulkarni

221

388

Word Sense Disambiguation by Semi-Supervised Learning

Zheng-Yu Niu, Dong-Hong Ji, Chew-Lim Tan, and Ling-Peng Yang

233

211

Crossing Parallel Corpora and Multilingual Lexical Databases for WSD

Alfio Massimiliano Gliozzo, Marcello Ranieri, and Carlo Strapparava

237

212

A Mapping between Classifiers and Training Conditions for WSD

Aarón Pancardo-Rodríguez, Manuel Montes-y-Gómez, Luis Villaseñor-Pineda, Paolo Rosso

241

128

Multiwords and Word Sense Disambiguation

Victoria Arranz, Jordi Atserias and Mauro Castillo

245

206

Context Expansion with Global Keywords for a Conceptual Density-Based WSD

Davide Buscaldi, Paolo Rosso, and Manuel Montes y Gómez

257

182

Two Web-based Approaches for Noun Sense Disambiguation

Paolo Rosso, Manuel Montes-y-Gómez, Davide Buscaldi, Aarón Pancardo-Rodríguez, and Luis Villaseñor Pineda

261

 

 

 

 

 

Lexical Resources

 

198

Finding Instance Names and Alternative Glosses on the Web: WordNet Reloaded

Marius Paşca

273

145

Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries

Masaki Murata, Toshiyuki Kanamaru, and Hitoshi Isahara

285

354

Enriching WordNet with Derivational Subnets

Karel Pala and Radek Sedláček

297

159

Customisable Semantic Analysis of Texts

Vivi Nastase and Stan Szpakowicz

304

190

ITOLDU, a Web Service to Pool Technical Lexical Terms in a Learning Environment and Contribute to Multilingual Lexical Databases

Valérie Bellynck, Christian Boitet, John Kenwright

316

352

Building a Situation-based Language Knowledge Base

Qiang ZHOU, Zushun CHEN

325

222

Unsupervised Learning of P NP P Word Combinations

Sofía N. Galicia-Haro, Alexander Gelbukh

329

 

 

 

 

 

Natural Language Generation

 

111

Evaluating Evaluation Methods for Generation in the Presence of Variation

Amanda Stent, Matthew Marge, and Mohit Singhai

333

210

Reconciling Parameterization, Configurability and Optimality in Natural Language Generation via Multiparadigm Programming

Jorge Marques Pelizzoni, Maria das Graças Volpe Nunes

344

 

 

 

 

 

Machine Translation

 

B

Invited paper:
Message Automata for Messages with Variants, and Methods for their Translation

Christian Boitet

349

123

The UNL Initiative: An Overview

Igor Boguslavsky; Jesús Cardeñosa; Carolina Gallardo; Luis Iraola

369

196

Interactive Resolution of Intrinsic and Translational Ambiguity in a Machine Translation System

Igor M. Boguslavsky, Leonid L. Iomdin, Alexander V. Lazursky, Leonid G. Mityushin, Victor G. Sizov, Leonid G. Kreydlin, Alexander S. Berdichevsky

380

166

Chinese-Japanese Clause Alignment

Xiaojie Wang, Fuji Ren

392

157

Direct Combination of Spelling and Pronunciation Information for Robust Back-Transliteration

Slaven Bilac and Hozumi Tanaka

403

 

 

 

 

 

Speech and Natural Language Interfaces

 

213

A Prosodic Diphone Database for Korean Text-to-Speech Synthesis System

Kyuchul Yoon

415

205

On a Pitch Detection Method Using Noise Reduction

Jong Kuk Kim, Ki Young Lee, Myung Jin Bae

419

202

Toward Acoustic Models for Languages with Limited Linguistic Resources

Luis Villaseñor-Pineda, Viet Bac Le, Manuel Montes-y-Gómez, and Manuel Pérez-Coutiño

423

201

A Study On Pitch Detection in Time-Frequency Hybrid Domain

Wangrae Jo, Jongkuk Kim, Myungjin Bae

427

192

VoiceUNL: a Semantic Representation of Emotions within Universal Networking Language Formalism Based on a Dialogue Corpus Analysis

Mutsuko Tomokiyo, Gérard Chollet

431

169

Combining Multiple Statistical Classifiers to Improve the Accuracy of Task Classification

Wei-Lin Wu, Ru-Zhan Lu, Feng Gao, Yan Yuan

442

 

 

 

 

 

Language Documentation

 

119

A Finite State Network for Phonetic Text Processing

Edward John Garrett

453

144

Language Documentation: the Nahuatl Grammar

Mike Maxwell, Jonathan D. Amith

464

 

 

 

 

 

Intelligent Text Processing Applications

 

 

 

 

 

 

Information Extraction

 

R

Invited paper:
Creating Subjective and Objective Sentence Classifiers from Unannotated Texts

Janyce Wiebe and Ellen Riloff

476

109

Instance Pruning by Filtering Uninformative Words: an Information Extraction Case Study

Alfio Massimiliano Gliozzo, Claudio Giuliano, and Raffaella Rinaldi

488

114

Incremental Information Extraction Using Tree-based Context Representations

Christian Siefkes

500

134

Learning Information Extraction Rules for Protein Annotation from Unannotated Corpora

Jee-Hyub Kim and Melanie Hilario

512

204

Transformation-Based Information Extraction Using Learned Meta-Rules

Un Yong Nahm

524

141

A Machine Learning Approach to Information Extraction

Alberto Téllez-Valero, Manuel Montes-y-Gómez, Luis Villaseñor-Pineda

528

130

Automatic Time Expression Labeling for English and Chinese Text

Kadri Hacioglu, Ying Chen, Benjamin Douglas

537

185

Integrating Natural Language Techniques in OO-Method

Isabel Díaz, Lidia Moreno, Inmaculada Fuentes, and Oscar Pastor

549

 

 

 

 

 

Information Retrieval

 

174

Document Re-ordering Based on Key Terms in Top Retrieved Documents

Yang Lingpeng, Ji Donghong, Nie Yu, Zhou Guodong

561

167

Merging Case Relations into VSM to Improve Information Retrieval Precision

Wang Hongtao, Sun Maosong, Liu Shaoming

573

171

Evaluating Document-to-document Relevance based on Document Language Model: Modeling, Implementation and Performance Evaluation

Ge Yu, Xiaoguang Li, Yubin Bao, Daling Wang

582

358

Retrieval Efficiency of Normalized Query Expansion

Sofia Stamou and Dimitris Christodoulakis

593

325

Selecting Interesting Articles Using Their Similarity Based Only on Positive Examples

Jiří Hroza and Jan Žižka

597

 

 

 

 

 

Question Answering

 

163

Question Classification in Spanish and Portuguese

Thamar Solorio, Manuel Pérez-Coutiño, Manuel Montes-y-Gómez, Luis Villaseñor-Pineda, and Aurelio López-López

601

218

Learning the Query Generation Patterns

Marcin Skowron and Kenji Araki

609

221

Exploiting Question Concepts for Query Expansion

Hae-Jung Kim, Ki-Dong Bu, Junghyun Kim and Sang-Jo Lee

613

238

Experiment on Combining Sources of Evidence for Passage Retrieval

Alexander Gelbukh, NamO Kang, SangYong Han

617

 

 

 

 

 

Summarization

 

149

Summarisation through Discourse Structure

Dan Cristea, Oana Postolache, and Ionuţ Pistol

621

215

LexTrim: A Lexical Cohesion based Approach to Parse-and-Trim Style Headline Generation

Ruichao Wang, Nicola Stokes, William Doran, Eamonn Newman, John Dunnion, Joe Carthy

633

331

Generating Headline Summary from a Document Set

Kamal Sarkar, Sivaji Bandyopadhyay

637

209

Extractive Summarization Based on Word Information and Sentence Position

Carlos MÉNDEZ Cruz and Alfonso MEDINA Urrea

641

172

Automatic Extraction and Learning of Keyphrases from Scientific Articles

Yaakov HaCohen-Kerner, Zuriel Gross, Asaf Masa

645

183

Automatic Annotation of Corpora for Text Summarisation: A Comparative Study

Constantin Orăsan

658

 

 

 

 

 

Text Classification, Categorization, and Clustering

 

132

Techniques for Improving the Performance of Naive Bayes for Text Classification

Karl-Michael Schneider

670

136

Efficient Modeling of Analogy

Lars G. Johnsen and Christer Johansson

682

187

A Supervised Clustering Method for Text Classification

Umarani Pappuswamy, Dumisizwe Bhembe, Pamela W. Jordan and Kurt VanLehn

692

348

Unsupervised Text Classification using Kohonen’s Self Organizing Network

Nirmalya Chowdhury and Diganta Saha

703

234

Enhancement of DTP Feature Selection Method for Text Categorization

Edgar Moyotl-Hernández, Héctor Jiménez-Salazar

707

176

FASiL Adaptive Email Categorization System

Yunqing Xia, Angelo Dalli, Yorick Wilks, Louise Guthrie

711

170

ESPClust: An Effective Skew Prevention Method for Model-based Document Clustering

Xiaoguang Li, Ge Yu, Daling Wang, Yubin Bao

723

226

A Method of Rapid Prototyping of Evolving Ontologies

Pavel Makagonov, Alejandro Ruiz Figueroa

735

 

 

 

 

 

Named Entity Recognition

 

178

Resolution of Data Sparseness in Named Entity Recognition using Hierarchical Features and Feature Relaxation Principle

ZHOU GuoDong, SU Jian, YANG LingPeng

739

164

Learning Named Entity Recognition in Portuguese from Spanish

Thamar Solorio and Aurelio López López

751

223

A Simple Rule-based Approach to Organization Name Recognition in Chinese Text

Houfeng Wang and Wuguang Shi

758

 

 

 

 

 

Language Identification

 

113

Disentangling from Babylonian Confusion – Unsupervised Language Identification

Chris Biemann, Sven Teresniak

762

203

On the Syllabic Similarities of Romance Languages

Anca Dinu and Liviu P. Dinu

774

214

Automatic Language Identification Using Multivariate Analysis

Vinosh Babu James J. and Baskaran Sankaran

778

 

 

 

 

 

Spelling and Style Checking

 

140

Design and Development of a System for the Detection of Agreement Errors in Basque

Arantza Díaz de Ilarraza, Koldo Gojenola and Maite Oronoz

782

122

An Experiment in Detection and Correction of Malapropisms through the Web

Igor A. Bolshakov

792

118

A Paragraph Boundary Detection System

Dmitriy Genzel

805

 

 

 

 

A

Author Index

817