18 Sep 10

A set of ontology matching algorithms (for finding correspondences between concepts) is based on a thesaurus that provides the source data for the semantic distance calculations. In this wiki era, new resources may spring up and improve this kind of semantic search. In the paper a solution of this task based on Russian Wiktionary is compared to WordNet based algorithms. Metrics are estimated using the test collection, containing 353 English word pairs with a relatedness score assigned by human evaluators. The experiment shows that the proposed method is capable in principle of calculating a semantic distance between pair of words in any language presented in Russian Wiktionary. The calculation of Wiktionary based metric had required the development of the open-source Wiktionary parser software.

by mlb

We introduce Wiktionary as an emerging lexical semantic resource that can be used as a substitute for expert-made resources in AI applications. We evaluate Wiktionary on the pervasive task of computing semantic relatedness for English and German by means of correlation with human rankings and solving word choice problems. For the first time, we apply a concept vector based measure to a set of different concept representations like Wiktionary pseudo glosses, the first paragraph of Wikipedia articles, English WordNet glosses, and GermaNet pseudo glosses. We show that: (i) Wiktionary is the best lexical semantic resource in the ranking task and performs comparably to other resources in the word choice task, and (ii) the concept vector based approach yields the best results on all datasets in both evaluations.

by mlb

02 Jun 10

Bueda increases the value of rich media and user-generated content via its innovative semantic analysis engine. Leveraging research from Carnegie Mellon University’s Language Technologies Institute, Bueda combines user-generated tags, existing ontologies, and semantic analysis in order to provide publishers with actionable information that can be used for content categorization, targeted advertising, content recommendation and search engine optimization. Via Bueda’s API, any user generated content website can have access to the latest technology for tag disambiguation and cleanup with minimal integration hurdles.

by mlb

13 Apr 10

The Social Graph API makes information about the public connections between people on the Web, expressed by XFN and FOAF markup and other publicly declared connections, easily available and useful for developer.

by mlb