International Conference on Recent Advances in Natural
Language Processing (RANLP 2013)
Hissar, Bulgaria
9-11 September 2013
Editors:
G. Angelova
K. Bontcheva R. Mitkov
ISBN:
978-1-62993-555-3
Table of Contents
ASMA: A
System for
AutomaticSegmentation
andMorpho-Syntactic Disambiguation of
Modern Stan¬dard Arabic
Muhammad
Abdul-Mageed,
Mona Diab and Sandra Kiibler 1Optimising
Tree Edit Distance withSubtreesfor
Textual EntailmentMaytham
Alabbas and AllanRamsay
9Opinion Learning from
Medical ForumsTanveerAli,MarinaSokolova,David Schramm and Diana
Inkpen
18Annotating
events, Time and PlaceExpressions
inArabicTextsHassinaAliane,Wassila Guendouzi and Amina Mokrani 25
A
Semi-supervised Learning Approach
toArabicNamedEntity Recognition
MahaAlthobaili, Udo Kruschwitz and MassimoPoesio 32
An NLP-based
Reading
Toolfor Aiding
Non-nativeEnglish
ReadersMahmoudAzab,AhmedSalama,Kemal Oflazer,HidekiShima, Jun Araki
and Teruko Mitamura 41
ImprovingSentiment
Analysis
in TwitterUsing Multilingual
Machine Translated DataAlexandra Balahur and Marco Turchi 49
Domain
Adaptation for Parsing
Eric
Baucom,
LeviKing
andSandraKiibler 56Towardsa Structured
Representation of
GenericConcepts
and RelationsinLarge
TextCorpora
Archana Bhattarai and Vasile Rus 65
Authorship
Attribution in Health ForumsVictoriaBobicev,MarinaSokolova, Khaled El Emam andStanMatwin 74
TwitlE: An
Open-Source Information
ExtractionPipeline for Microblog
TextKalina
Bontcheva,
LeonDerczynski,
Adam Funk, Mark Greenwood, DianaMaynard
andNiraj
Aswani 83
A
unified
lexicalprocessing framework
basedontheMargin
Infused RelaxedAlgorithm.
Acasestudy
ontheRomanian Language
Tiberiu Boros 91
Automatic extraction
of
contextual valenceshifters.
Noemi
Boubel,
ThomasFrancoisandHubert Naets 98Grammar-Based Lexicon Extension
for Aligning
GermanRadiology
Text andlinages
ClaudiaBretschneider,
Sonja
Zillner and Matthias Hammon 105Recognising andInterpreting Named
Temporal Expressions
MatteoBrucato,Leon
Derczynski,
Hector Llorens, Kalina Bontcheva and Christian S. Jensen .113Unsupervised Improving of
SentimentAnalysis
Using GlobalTarget
ContextTomas
Brychcm
and Ivan Habernal 122Aii
Agglomerative
HierarchicalClustering Algorithm for Labelling Morphs
Burcu Can and Suresh Manandhar 129
Temporal
TextClassification for
Romanian Novelssetin the PastAlina Maria Ciobanu, Liviu P. Dinu,Octavia-Maria
§ulea,
ancadinu and Vlad Niculae 136,4
Dictionary-Based Approach for Evaluating
OrthographicMethods inCognates Identification
Alina Maria Ciobanuand Liviu Petrisor Dinu 14!
A PilotStudxonthe Semantic
Classification of
Two GermanPrepositions: Combining Monolingual
andMultilingual
EvidenceSimon Clematide and Manfred Klenner 148
Semantic Relations between Events and their Time. Locations and
Participants for
EventConference
Resolution
Agata Cybulska
and Piek Vossen 156Sense
Clustering Using Wikipedia
BharathDandala, Chris
Hokamp,
Rada Mihalcea and Razvan Bunescu 164Effective Spell Checking
MethodsUsing Clustering Algorithms
Renato Cordeiro deAmorimandMarcos
Zampieri
172Normalization
of
Dutch User-Generated ContentOrphee
DeClercq,
SarahSchulz,BartDesmet, Els Lefever andVeronique
Hoste 179Linguistic Profiling of
TextsAcross Textual Genres andReadability
Levels. AnExploratory Study
onItalian Fictional Prose
Felice
Dell'Orletta,
SimonettaMontemagni
and Giulia Venturi 189Part-of-Speech Tagging for
All:Overcoming Sparse
andNoisy
DataLeon
Derczynski,
AlanRitter, Sam Clark and Kalina Bontcheva 198Weighted
maximum likelihoodloss as aconvenient shortcut tooptimizing
the F-measureof
maximum entropyclassifiers
Georgi Dimitroff,
Laura Tolosi,BorislavPopov
andGeorgi Georgiev
207Sequence Tagging for
VerbConjugation
in RomanianLiviuDinu,Octavia-Maria Sulea and Vlad Niculae 215
A
Tagging Approach
toIdentify Complex
Constituentsfor
TextSimplification
IustinDornescu, RichardEvans andConstantin Orasan 221
AutomaticEvaluationMetric
for
Machine Translation that isIndependent of
SentenceLength
Hiroshi
Echizen'ya, Kenji
Araki and EduardHovy
230Acronym recognition
andprocessing
in 22languages
Maud
Ehrmann,
Leo dellaRocca,
RalfSteinberger
and Hristo Tanev 237An Evaluation
Summary
MethodBasedon a Combinationof
Content andLinguistic
MetricsSamiraEllouze,Maher Jaoua and LamiaHadrich
Belguith
245Hierarchy Identification for Automatically Generating Table-of-Contents
Nicolai
Erbs, Iryna Gurevych
and Torsten Zesch 252Temporal
RelationClassificationin PersianandEnglish
contextsMahbaneh
Eshaghzadeh
Torbati, Gholamreza Ghassem-sani,Seyed Abolghasem
Mirroshandel,Yadollah
Yaghoobzadeh
andNegin
Karimi Hosseini 261The Extended Lexicon: Language ProcessingasLexical
Description
RogerEvans 270
Did I
really
meanthat?Applying
automatic summarisationtechniques
toformative feedback
Debora
Field, Stephen Pulman,
Nicolas VanLabeke,Denise Whitelock and John Richardson . 277Matching
setsof
parsetreesfor answering
multi-sentencequestions
Boris
Galitsky, Dmitry Ilvovsky, Sergei
O. Kuznetsov and Fedor Strok 285 Realizationofcommonstatistical methodsincomputational linguistics
withfunctional
automataStefan
Gerdjikov,
Petar Mitankin and Vladislav Nenchev 294Mining Fine-grained Opinion Expressions
with ShallowParsingSucheta
Ghosh,
SaraTonelli and Richard Johansson 302Justifying Corpus-Based
Choices inReferring Expression
GenerationHelmut Horacek 311
A
Boosting-based Algorithm for
Classificationof
Semi-Structured Textusing
theFrequency of
Substruc¬tures
Tomoya
Iwakura 319Headerless,
Quoteless,
butnotHopeless? Using
Pairwise Email Classification toDisentangle
EmailThreads
Emily
Jamison andIryna Gurevych
327Using
ParallelCorpora for
Word SenseDisambiguation
Dimitar Kazakov and Ahmad R. Shahid 336
Recogiuzing
semantic relations within Polishnounphrase:
A rule-basedapproach
Pawel Kedzia and Marek Maziarz 342
Unsupervised
Inductionof
Arabic RootandPattern LexiconsusingMachineLearning
Bilal
Khaliq
and John Carroll 350'Towards Domain
Adaptation for Parsing
Web DataMohammadKhan,MarkusDickinson andSandra Kiibler 357
CapturingAnomalies in the Choice
of
Content Words inCompositional
Distributional SemanticSpace
Ekaterina Kochmar and Ted Briscoe 365
Incremental and Predictive
Dependency Parsing
under Real-Time ConditionsArneKohn and
Wolfgang
Menzel 373Rationale,
Concepts,
and Current Outcomeof
the UnitGraphs
FrameworkMaxime Lefrancois and Fabien Gandon 382
The Unit
Graphs
Framework: FoundationalConcepts
and SemanticConsequence
Maxime Lefrancois and Fabien Gandon 389
Confidence
Estimationfor Knowledge
BasePopulation
Xiang
Li andRalph
Grishman 396Towards
Fine-grained
Citation FunctionClassification
Xiang
Li, YifanHe,
AdamMeyers
andRalph
Grishman 402Supervised Morphology
GenerationUsing
ParallelCorpus
Alireza Mahmoudi, MohsenArabsorkhi and Heshaam Faili 408
Sentiment Anahsis
of
Reviews: Shouldweanalyze
writer intentionsorreaderperceptions?
Isa Maks and Piek Vossen 415
Revisiting
theOld Kitchen Sink: Do weNeed Sentiment DomainAdaptation
?RihamMansour, NesmaRefaei, Michael Gamon, Ahmed Abdul-Hamid and Khaled Sami 420
Evaluation
of
baselineinformation
retrievalfor
Polishopen-domain Question
Answeringsystem MichalMarcinczuk, Adam Radziszewski,Maciej
Piasecki, Dominik Piaseckiand Marcin Ptak 428
WCCL Relation—aToolset
for
Rule-basedRecognition of
SemanticRelationsBetween Named EntitiesMichal Marcinczuk 436
Bexond the
Transfer-and-Merge
Wordnet Construction:plWordNet
andaComparison
with WordNetMarekMaziarz,
Maciej
Piasecki,Ewa Rudnicka and StanSzpakowicz
443History
BasedUnsupervised
Data OrientedParsing
Mohsen
Mesgar
and Gholamreza Ghasem-Sani 453Contrasting
andCorroborating
Citations in Journal ArticlesAdam
Meyers
460CCG
Categoriesfor
Distributional Semantic ModelsParamita Mirza and Raffaella Bernardi 467
Discourse-awareStatisticalMachine Translationas aContext-sensitive
Spell
CheckerBehzadMirzababaei, Heshaam Faili and Nava Ehsan 475
Cross-Lingual Information
Retrieval and SemanticInteroperabilityfor
CulturalHeritage
Repositories JohannaMonti,
MarioMonteleone,Maria Pia di Buono and Federica Marano 483Improving
Web 2.0Opinion Mining Systems
Using Text NormalisationTechniques
Alejandro
Mosquera
and Paloma Moreda Pozo 491Identifying
SocialandExpressive
Factors inRequest
TextsUsing Transaction/Sequence
ModelDasaMunkova, Michal Munk andZuzanaFraterova 496
Parameter
Optimization for
Statistical Machine Translation: ItPays
toLearnfrom
HardExamples
Preslav
Nakov,
Fahad AlObaidli,
Francisco Guzman andStephan Vogel
504Automatic
Cloze-Questions
GenerationAnnamaneniNarendra,Manish
Agarwal
and Rakshit shah 511High-Accuracy
PhraseTranslationAcquisition Through Battle-Royale
SelectionLionel
Nicolas, Egon
W.Stemle,
Klara Kranebitter and VerenaLyding
516Enriching
Patent Search withExternalKeywords:
aFeasibility Study
Ivelina
Nikolova,
Irina Temnikovaand GaliaAngelova
525A
clustering approach
fortransJulkme.seidentification
Sergiu
Nisioi and Liviu P. Dinu 532PurePos 2.0: a
hybrid
toolfor
morphologicaldisambiguation
Gyorgy
Orosz andAttila Novak 539More than
Bag-of-Words:
Sentence-based DocumentRepresentation for
SentimentAnalysis
Georgios Paltoglou
andMikeThelwall 546Information
Spreading
inExpanding
WordnetHypemymy
StructureMaciej
Piasecki,Radosiaw Ramocki andMichalKaliriski 553Context
Independent
TermMapper
for European LanguagesMarcisPinnis 562
Semi-supervised
vs. Cross-domainGraphs for
SentimentAnalysis
Natalia Ponomareva and MikeThelwall 571
Towardsa
Hybrid
Rule-based and Statistical Arabic-French Machine TranslationSystem
fatiha sadat 579
Segmenting
vs.Chunking
Rules:Unsupen-ised
TTG Induction via Minimum Conditional DescriptionLx'iigtli
Markus Saers, Karteek Addankiand Dekai Wu 584
A Combined Pattern-basedandDistributional
Approach for
AutomaticHypernym
Detectionin Dutch.Gwendolyn Schropp,
Els Lefever andVeronique
Hoste 593Exploiting Synergies
BetweenOpen
Resourcesfor
GermanDependency Parsing, POS-tagging,
andMor¬phological Analysis
RicoSennrich,Martin Volk and GeroldSchneider 601
Using
aWeighted
SemanticNetworkfor
Lexical Semantic RelatednessReda Siblini and Leila Kosseim 610
A New
Approach
totlie POSTagging
Problem UsingEvolutionary Computation
Ana PaulaSilva, Arlindo Silva and Irene
Rodrigues
619How Joe and JaneTweetabout Their Health:
Mining for
PersonalHealthInformation
onTwitterMarinaSokolova, StanMatwin,Yasser JaferandDavid Schramm 626
What Sentiments Can BeFoundin Medical Forums?
Marina Sokolova and Victoria Bobicev 633
Automated
learning of everyday patients' language for
medicalblogs analytics
Giovanni Stilo,MorenoDeVincenzi,Alberto E. Tozzi and Paola Velardi 640
How
Symbolic
Learning CanHelp
StatisticalLearning
(and viceversa)Isabelle Tellier and Yoann
Dupont
649Measuring
ClosureProperties
ofPatentSublanguages
IrinaTemnikova,
Negacy
Hailu, GaliaAngelova
and K. Bretonnel Cohen 659Closure
Properties of Bulgarian
Clinical TextIrinaTemnikova,IvelinaNikolova,William A.
Baumgartner,
GaliaAngelova
and K. Bretonnel Cohen 667
Analyzingthe Use
of
Character-LevelTranslation withSparse
andNoisy
DatasetsJbrg
Tiedemann and Preslav Nakov 676A Feature Induction
Algorithm
withApplication
toNamedEntity Disambiguation
LauraTolosi,ValentinZhikov,
Georgi Georgiev
and BorislavPopov
685Introducing
aCorpus of
Human-AuthoredDialogue
Summaries inPortuguese
Norton TrevisanRoman, Paul Piwek,Ariadne M.B. Rizzoni Carvalho
andAlexandreRossiAlvares 692
Wikipedia
as anSMTTraining Corpus
Dan
Tufi§,
RaduIon,Stefan Dumitrescu and Dan Stefanescu 702DutchSemCor: inquest
of
the idea!sense-tagged
corpusPiek Vossen, Ruben
Izquierdo
andAttilaGSrbg
710Towards
detecting
anomaliesin the contentof
standardizedLMFdictionariesWafaWALI,Bilel
Gargouri
andAbdelmajid
BEN HAMADOU 719EditDistance:A New Data Selection Criterion
for
DomainAdaptation
in SMTLongyue Wang,
Derek F.Wong,
Lidia S. Chao,JunwenXing,Yi Lu and Isabel Trancoso 727Automatic Enhancement
of
LTAG TreebankFarzanehZarei,AliBasirat, HeshamFaili and
Maryam
S.Mirian 733Inductive and deductive
inferences
ina CrowdsourcedLexical-SemanticNetworkManelZarrouk,Mathieu LafourcadeandAlain Joubert 740
Machine
Learning for
Mention Head Detection inMultilingual Coreference
ResolutionDesislavaZhekova and Sandra Kiibler 747
Combining
POSTagging, Dependency
ParsingandCoreferential
Resolutionfor
BulgarianValentinZhikov,
Georgi Georgiev,
Kiril Simov andPetya
Osenova 755magyarlanc:
A Toolfor Morphological
andDependency Parsing of
HungarianJanosZsibrita, Veronika Vincze andRichardFarkas 763