Sawsaa, Ahlam, Lu, Joan, Newman, Christopher D and Ribchester, Helen (2013) Extracting Occupational Therapy Concepts to Develop Domain Ontology. In: The Seventh International Conference on Digital Society. ICDS 2013 . IARIA, Nice, France, pp. 65-73. ISBN 978-1-61208-249-3

Abstract— Recently, unstructured data on the World Wide
Web has generated significant interest in the extraction of text,
emails, web pages, reports and research papers in their raw
form. Far more interestingly, extracting information from a
specific domain using distributed corpora from the World
Wide Web is a vital step towards creating corpus annotation.
This paper describes a method of annotation, based on
Occupational Therapy (OT) concepts, to build domain
ontology using Natural Language Programming (NLP)
technology. We used Java Annotation Patterns Engine
(JAPE) grammar to support regular expression matching and
thus annotate OT concepts using a GATE developer tool. This
speeds up the time-consuming development of the ontology,
which is important for experts in the domain facing time
constraints and high workloads. The rules provide significant
results: the pattern matching of OT concepts based on the
lookup list produced 403 correct concepts and the accuracy
was generally higher. Using NLP technique is a good approach
to reducing the domain expert’s work, and the results can be

icds_2013_3_30_10176.pdf - Published Version
