Model WordNet Bahasa Indonesia berbasis Linked Data

Hendrik Hendrik, Andhik Budi Cahyono

Abstract


WordNet is an online lexical database. In Computer Science domain, it plays important role in solving semantic interoperability issues. It also helps in many researches related to Natural Language Processing topic. Because of the importance of WordNet, there are many works to develop WordNet into several languages, e.g., Japanese, Arabic, and Indonesian. However, those are still not sufficient to address semantic interoperability issues. Therefore, there are several attempts to form WordNet into machine understandable format, i.e, Resource Description Framework (RDF) model. Still, there is no effort to form WordNet Bahasa Indonesia into RDF format. This paper presents the process of forming WordNet Bahasa Indonesia into Linked Data form. This process involves several phases, which are identifying data sources, data extraction, data transformation, data loading into relational database, and mapping database model into RDF model. The latest is done by using D2RQ framework, resulting the WordNet Bahasa Indonesia as Linked Data format. This data set is linked to WordNet-RDF of Princetown University.

Full Text:

PDF

References


Nurudin, “Media Sosial Baru dan Munculnya Revolusi Proses Komunikasi”, Jurnal Komunikasi, vol. 5(2), hal. 127–142, 2013.

Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D. dan Miller, K. J., “Introduction to WordNet: an Online Lexical Database”, International Journal of Lexicography, vol. 3(4), hal. 235–244, 1990.

Mahendra, R., Larasati, S. D., dan Manurung, R., “Extending an Indonesian Semantic Analysis-based Question Answering System with Linguistic and World Knowledge Axioms.”, Prosiding the 22nd Pacific Asia Conference on Language, Information, and Computation, hal. 262–271, 2008.

Clark, P., Fellbaum, C., dan Hobbs, J., “Using and Extending WordNet to Support Question- Answering 2 Semantic Requirements on WordNet”, Prosiding the 4th Global WordNet Conference, 2008.

Abouenour, L., Bouzoubaa, K., dan Rosso, P., “Improving QA Using Arabic WordNet”, Prosiding the 2008 International Arab Conference on Information Technology, 2008.

Andreevskaia, A., dan Bergler, S., “Mining WordNet for Fuzzy Sentiment : Sentiment Tag Extraction from WordNet Glosses”, European Chapter of the Association for Computational Linguistics., vol. 6, hal. 209–216, 2006.

Montejo-Ráez, A., Martínez-Cámara, E., Martín-Valdivia, M. T., dan Ureña-López, L. A., “Ranked WordNet Graph for Sentiment Polarity Classification in Twitter”, Compututer Speech & Language, vol. 28(1), hal. 93-107, 2014.

Elberrichi, Z., Rahmoun, A., Bentaalah, M. A., dan Arabia, S., “Using WordNet for Text Categorization”, International Arab Journal of Information Technology, vol. 5(1), hal. 16–24, 2008.

Sriram, B., Fuhry, D., Demir, E., dan Demirbas, H. F. M., “Short text Classification in Twitter to Improve Information Filtering”, Prosiding the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, hal. 841-842, 2010.

Elkateb, S., Black, W., Vossen, P., Farwell, D., Rodriguez, H., Pease, A., dan Alkhalifa, M., “Arabic WordNet and the Challenges of Arabic”, Prosiding Arabic NLP/MT Conference, hal. 15–24, 2006.

Koide, S., Takeda, H., Kato, F., Ohmukai, I., Bond, F., Isahara, H. dan Kuribayashi, T., “DBpedia and Wordnet in Japanese”, Semantic Web Journal, vol. 1, hal. 4–7, 2009.

Putra, D. D., Arfan, A., dan Manurung, R., “Building an Indonesian Wordnet”, Prosiding the 2nd International MALINDO Workshop., 2008.

Hirfana, N., Noor, M., Sapuan, S., dan Bond, F., “Creating the Open Wordnet Bahasa”, Prosiding the 25th Pacific Asia Conference on Language, Information, and Computation, hal. 255–264, 2011.

Riza, H., Budiono, dan Hakim, C., “Collaborative Work on Indonesian Word et through Asian WordNet (AWN)”, Prosiding the 23rd International Conference on Computational Linguistics, hal. 9–13, 2010.

van Assem, M., Gangemi, A., dan Schreiber, G., “Conversion of WordNet to a standard RDF / OWL representation”, Prosiding the 5th International Conference on Language Resources and Evaluation, hal. 237–242, 2006.

Chiarcos, C., Cimiano, P., dan Declerck, T., “Linguistic Linked Open Data ( LLOD ) Introduction and Overview”, Prosiding the 2nd Workshop on Linked Data in Linguistics, 2013.

Yu, L., A Developer’s Guide to the Semantic Web Programming. Heidelberg: Springer, 2011.

Heath, T. dan Bizer, C., Linked Data: Evolving the Web into a Global Data Space. California: Morgan & Claypool Publishers, 2011.

Hendrik dan Perdana, D. H. F., “Trip Guidance: a Linked Data Based Mobile Tourists Guide”, Advanced Science Letters, vol. 20(1), hal. 75–79, 2014.

Chiarcos, C., McCrae, J., Cimiano, P. dan Fellbaum, C., Towards Open Data for Linguistics: Linguistic Linked Data. Springer, Berlin, 2013.




DOI: http://dx.doi.org/10.22146/jnteti.v6i1.288

Refbacks

  • There are currently no refbacks.


Copyright (c) 2017 Jurnal Nasional Teknik Elektro dan Teknologi Informasi (JNTETI)

JNTETI (Jurnal Nasional Teknik Elektro dan Teknologi Informasi)

Departemen Teknik Elektro dan Teknologi Informasi, Fakultas Teknik Universitas Gadjah Mada
Jl. Grafika No 2. Kampus UGM Yogyakarta 55281
+62 274 552305
jnteti@ugm.ac.id