Friday, May 28, 2010

HAADS: A Hebrew Aramaic abbreviation disambiguation system

First, I promise that I'll blog about the Hugoye Symposium soon. I've just been so busy. :-P

In the meantime, for all of you NLP enthusiasts who are also into Aramaic like me:

In many languages abbreviations are very common and are widely used in both written and spoken language. However, they are not always explicitly defined and in many cases they are ambiguous. This research presents a process that attempts to solve the problem of abbreviation ambiguity using modern machine learning (ML) techniques. Various baseline features are explored, including context-related methods and statistical methods. The application domain is Jewish Law documents written in Hebrew and Aramaic, which are known to be rich in ambiguous abbreviations. Two research approaches were implemented and tested: general and individual. Our system applied four common ML methods to find a successful integration of the various baseline features. The best result was achieved by the SVM ML method in the individual research, with 98.07% accuracy.

http://sciencia.org/stories/758882/HAADS_A_Hebrew_Aramaic_abbreviation_disambiguation_system.html?utm_source=twitterfeed&utm_medium=twitter&utm_campaign=Feed%3A+sciencia%2FfOkO+%28Sciencia+-+Informatics%29

Peace,
-Steve

No comments:

Post a Comment

There are several rules about commenting here:

1) All unsigned/anonymous comments will be temp-deleted. I would like the actual names of the people who comment here.

2) SPAM will be deleted outright and permanently.

3) If someone is obnoxious, I will temp-delete their comments until they become more civil.

4) By commenting here, you release the copyrights of your comments to me.

Other than that, have fun. :-)