Thursday, March 22, 2007
Wednesday, March 21, 2007
Internet Language Resources
....
Bafia (Mbam Cameroon) Wayumbe....
Bagesu (Central Africa) Watulire?
Bagesu (Central Africa) [answer] Natulire nili mlahi
Bajawa (Indonesia) ['where are you going'] Male de?
Bakitara (Central Africa) [morning] Oirwota?
Bakitara (Central Africa) [answer] Ndabanta
Bakitara (Central Africa) [after absense] Mirembe
Bakweri (Cameroon) [morning] O wusi
Balanta (Guinea-Bissau) Abala, lite utchole
Balinese (Bali) Om swastyastu
Balinese (Bali) [reply] Om shanti shanti shanti
Balti (India, Pakistan) Yang chi halyo?
Balti (India, Pakistan) [answer] Lyakhmo
That was "hello" in some languages. Jennifer Runner has this page with "Hello" and other pleasantries in a large number of languages. Don't forget to check her Internet Language Resources page.
Khau bulyghyz!
- Delip Rao at 8:12 PM 0 comments
Principal Components: language, linguistics
Sunday, March 18, 2007
Interesting papers from NAACL and WWW 2007
NAACL
Computing Semantic Similarity between Skill Statements for Approximate Matching
Feng Pan and Robert Farrell
Extracting Appraisal Expressions
Kenneth Bloom, Shlomo Argamon and Navendu Garg
Unsupervised Resolution of Objects and Relations on the Web
Alexander Yates and Oren Etzioni
Near-Synonym Choice in an Intelligent Thesaurus
Diana Inkpen
Using Wikipedia for Automatic Word Sense Disambiguation
Rada Mihalcea
An integrated approach to measuring Semantic Similarity between Words using Information available on the Web
Danushka Bollegala, Yutaka Matsuo and Mitsuru Ishizuka
Improving Relation Extraction Using Domain Information
Alfio Massimiliano Gliozzo, Marco Pennacchiotti and Patrick Pantel
High-Performance, Language-Independent Morphological Segmentation
Sajib Dasgupta and Vincent Ng
A Systematic Exploration of The Feature Space for Relation Extraction
Jing Jiang and ChengXiang Zhai
Data-Driven Graph Construction for Semi-Supervised Graph-Based Learning in NLP
Andrei Alexandrescu and Katrin Kirchhoff
WWW
Towards Domain-Independent Information Extraction from Web Tables
Wolfgang Gatterbauer, Paul Bohunsky, Marcus Herzog, Bernhard Kroepl, Bernhard Pollak
Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds
Marius Pasca
A New Suffix Tree Similarity Measure for Document Clustering
Hung Chim, Xiaotie Deng
Scaling Up All-Pairs Similarity Search
Roberto Bayardo, Yiming Ma, Ramakrishnan Srikant
Wherefore Art Thou R3579X? Anonymized Social Networks, Hidden Patterns, and Structural Steganography
Lars Backstrom, Cynthia Dwork, Jon Kleinberg
Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs
Qiaozhu Mei, Xu Ling, Matthew Wondra, Hang Su, ChengXiang Zhai
Measuring Semantic Similarity between Words Using Web Search Engines
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka
Using Google Distance to weight approximate ontology matches
Risto Risto Gligorov, Zharko Aleksovski, Warner ten Kate, Frank van Harmelen
NLP on VVLC
Papers to read
1. Banko and Brill, ACL 2001
2. Deepak Ravichandran, ACL 2005
- Delip Rao at 8:38 PM 0 comments