Back

CICLing-2003

Computational Linguistics and Intelligent Text Processing

Lecture Notes in Computer Science N 2588, Springer-Verlag, 2003

Table of Contents

 

Computational Linguistics

 

 

 

Computational Linguistics Formalisms

 

Keynote talk:

 

Starting with Complex Primitives Pays Off

1

Aravind K. Joshi

 

Things Are Not Always Equal

12

Ronald M. Kaplan and Annie Zaenen

 

GIGs: Restricted Context-sensitive Descriptive Power in Bounded Polynomial-time

23

José M. Castańo

 

Total Lexicalism and GASGrammars: A Direct Way to Semantics

37

Gábor Alberti and Katalin Balogh and Judit Kleiber and Anita Viszket

 

Pseudo Context-Sensitive Models for Parsing Isolating Languages: Classical Chinese – A Case Study

49

Liang Huang, Yinan Peng, Zhenyu Wu, Zhihao Yuan, Huan Wang, and Hui Liu

 

Semantics and Discourse

 

Imperatives as Obligatory and Permitted Actions

53

Miguel Pérez-Ramírez, Chris Fox

 

Formal Representation and Semantics of Modern Chinese Interrogative Sentences

66

Jia-ju Mao, Qiu-lin Chen, Ru-zhan Lu

 

Analyzing V+Adj in Situation Semantics

76

Jia-ju Mao, Qiu-lin Chen, Ru-zhan Lu

 

Diagnostics for Determining Compatibility in English Support-verb-nominalization Pairs

87

Leslie Barrett and Anthony R. Davis

 

A Maximum Entropy Approach for Spoken Chinese Understanding

94

Guodong Xie, Chengqing Zong, Bo Xu

 

A Study to Improve the Efficiency of a Discourse Parsing System

104

Huong T. Le, Geetha Abeysinghe

 

Syntax and POS tagging

 

Conversion of Japanese Passive/Causative Sentences into Active Sentences Using Machine Learning

118

Masaki Murata and Hitoshi Isahara

 

From Czech Morphology through Partial Parsing to Disambiguation

129

Eva Mráková and Radek Sedláček

 

Fast Base NP Chunking with Decision Trees — Experiments on Different POS Tag Settings

139

Dirk Lüdtke and Satoshi Sato

 

Guaranteed Pre-Tagging for the Brill Tagger

151

Saif Mohammad and Ted Pedersen

 

Performance Analysis of a Part of Speech Tagging Task

161

Rada Mihalcea

 

Parsing Techniques

 

An Efficient Online Parser for Contextual Grammars with at Most Context-Free Selectors

171

Karin Harbusch

 

Off-line Compilation of Chains for Head-driven Generation with Constraint-based Grammars

183

Toni Tuells, German Rigau, Horacio Rodríguez

 

Generation of Incremental Parsers

193

Manuel Vilares, Miguel A. Alonso, and Victor M. Darriba

 

Morphology

 

Computing with Realizational Morphology

205

Lauri Karttunen

 

Approach to Construction of Automatic Morphological Analysis Systems for Inflective Languages with Little Effort

217

Alexander Gelbukh and Grigori Sidorov

 

Per-Node Optimization of Finite-State Mechanisms for Natural Language Processing

223

Alexander Troussov, Brian O'Donovan, Seppo Koskenniemi, and Nikolay Glushnev

 

Word Sense Disambiguation

 

Keynote talk:

 

An Evaluation of a Lexicographer's Workbench Incorporating Word Sense Disambiguation

227

Adam Kilgarriff and Rob Koeling

 

Keynote talk:

 

Using Measures of Semantic Relatedness for Word Sense Disambiguation

243

Siddharth Patwardhan, Satanjeev Banerjee and Ted Pedersen

 

Automatic Sense Disambiguation of the Near-Synonyms in a Dictionary Entry

260

Diana Zaiu Inkpen and Graeme Hirst

 

Word Sense Disambiguation for Untagged Corpus: Application to Romanian Language

270

Gabriela Şerban and Doina Tătar

 

Automatic Noun Sense Disambiguation

275

Paolo Rosso, Francesco Masulli, Davide Buscaldi, Ferran Pla, and Antonio Molina

 

Tool for Computer-Aided Spanish Word Sense Disambiguation

279

Yoel Ledo Mezquita, Grigori Sidorov, and Alexander Gelbukh

 

Dictionary, Lexicon, Ontology

 

Augmenting WordNet's Structure Using LDOCE

283

Vivi Nastase and Stan Szpakowicz

 

Building Consistent Dictionary Definitions

297

Karel Pala and Eva Mráková

 

Is Shallow Parsing Useful for Unsupervised Learning of Semantic Clusters?

306

Marie-Laure Reinberger and Walter Daelemans

 

Experiments on Extracting Semantic Relations from Syntactic Relations

316

Caroline Varaschin Gasperin and Vera Lúcia Strube de Lima

 

A Method of Automatic Detection of Lexical Relationships using a Raw Corpus

327

Héctor Jiménez-Salazar

 

Sentence Co-occurrences as Small-world Graphs: A Solution to Automatic Lexical Disambiguation

331

Stefan Bordag

 

Dimensional Analysis to Clarify Relations among the Top-Level Concepts of an Upper Ontology: Process, Event, Substance, Object

335

Patrick Cassidy

 

Classifying Functional Relations in Factotum via WordNet Hypernym Associations

349

Tom O'Hara and Janyce Wiebe

 

Corpus and Language Statistics

 

Keynote talk:

 

Processing Natural Language without Natural Language Processing

362

Eric Brill

 

The Design, Implementation and Use of the Ngram Statistics Package

372

Satanjeev Banerjeet and Ted Pedersen

 

An Estimate Method of the Minimum Entropy of Natural Languages

384

Fuji Ren, Shunji Mitsuyoshi, Kang Yen, Chengqing Zong, Hongbing Zhu

 

A Corpus Balancing Method for Language Model Construction

395

Luis Villaseńor-Pineda, Manuel Montes-y-Gómez, Manuel Alberto Pérez‑Coutińo, and Dominique Vaufreydaz

 

Building a Chinese Shallow Parsed TreeBank for Collocation Extraction

404

Li Baoli, Lu Qin, Li Yin

 

Corpus Construction within Linguistic Module of City Information Dialogue System

408

Roman Mouček, Kamil Ekštein

 

Diachronic Stemmed Corpus and Dictionary of Galician Language

412

Nieves R. Brisaboa, Juan-Ramón López, Miguel R. Penabad, Ángeles S. Places

 

Can We Correctly Estimate the Total Number of Pages in Google for a Specific Language?

417

Igor A. Bolshakov and Sofia N. Galicia-Haro

 

Machine Translation and Bilingual Corpora

 

The Word is Mightier than the Count: Accumulating Translation Resources from Parsed Parallel Corpora

422

Stephen Nightingale and Hideki Tanaka

 

Identifying Complex Sound Correspondences in Bilingual Wordlists

434

Grzegorz Kondrak

 

Text Generation

 

Generating Texts with Style

446

Richard Power, Donia Scott, and Nadjet Bouayad-Agha

 

Multilingual Syntax Editing in GF

455

Janna Khegai, Bengt Nordström, and Aarne Ranta

 

QGen — Generation Module for the Register Restricted InBASE System

467

Michael V. Boldasov and Elena G. Sokolova

 

Natural Language Interfaces

 

Towards Designing Natural Language Interfaces

479

Svetlana Sheremetyeva

 

A Discourse System for Conversational Characters

492

Ron Zacharski

 

A Portable Natural Language Interface for Diverse Databases Using Ontologies

496

J. Antonio Zárate M., Rodolfo A. Pazos R., Alexander Gelbukh, and J. Isabel Padrón C.

 

Speech Processing

 

Time-Domain Structural Analysis of Speech

508

Kamil Ekštein and Roman Mouček

 

Experiments with Linguistic Categories for Language Model Optimization

513

Arantza Casillas, Amparo Varona, Ines Torres

 

Chinese Utterance Segmentation in Spoken Language Translation

518

Chengqing Zong and Fuji Ren

 

 

 

Intelligent Text Processing

 

 

 

Information Retrieval and Information Extraction

 

Using Natural Language Processing for Semantic Indexing of Scene-of-Crime Photographs

528

Horacio Saggion and Katerina Pastra and Yorick Wilks

 

Natural Language in Information Retrieval

539

Elżbieta Dura

 

Natural Language System for Terminological Information Retrieval

543

Gerardo Sierra and John McNaught

 

Query Expansion based on Thesaurus Relations: Evaluation over Internet

555

Luiz Augusto Sangoi Pizzato and Vera Lúcia Strube de Lima

 

Suggesting Named Entities for Information Access

559

Enrique Amigó, Anselmo Peńas, Julio Gonzalo, and Felisa Verdejo

 

Probabilistic Word Vector and Similarity based on Dictionaries

564

Satoshi Suzuki

 

Web Document Indexing and Retrieval

575

Byurhan Hyusein and Ahmed Patel

 

Event Sentence Extraction in Korean Newspapers

582

Bo-Hyun Yun, Tae-Hyun Kim, Yi-Gyu Hwang, Pal-Jin Lee, and Seung‑Shik Kang

 

Text Categorization and Clustering

 

Searching for Significant Word Associations in Text Documents Using Genetic Algorithms

586

Jan Žižka and Michal Šrédl and Aleš Bourek

 

Cascaded Feature Selection in SVMs Text Categorization

590

Takeshi Masuyama and Hiroshi Nakagawa

 

A Study on Feature Weighting in Chinese Text Categorization

594

Xue Dejun, Sun Maosong

 

Experimental Study on Representing Units in Chinese Text Categorization

604

Li Baoli, Chen Yuzhong, Bai Xiaojing, Yu Shiwen

 

Partitional Clustering Experiments with News Documents

617

Arantza Casillas and Mayte González de Lena and Raquel Martínez

 

Fast Clustering Algorithm for Information Organization

621

Kwangcheol Shin and Sangyong Han

 

Summarization

 

Automatic Text Summarization of Scientific Articles Based on Classification of Extract's Population

625

Maher Jaoua and Abdelmajid Ben Hamadou

 

Spell-Checking

 

Positive Grammar Checking: A Finite State Approach

637

Sylvana Sofkova Hashemi, Robin Cooper and Robert Andersson

 

Author Index

649

 

 

Back