Research Article | Open Access | Download PDF
Volume 17 | Number 1 | Year 2014 | Article Id. IJETT-V17P229 | DOI : https://doi.org/10.14445/22315381/IJETT-V17P229
Pre-processing of Domain Ontology Graph Generation System in Punjabi
Rajveer Kaur , Saurabh Sharma
Citation :
Rajveer Kaur , Saurabh Sharma, "Pre-processing of Domain Ontology Graph Generation System in Punjabi," International Journal of Engineering Trends and Technology (IJETT), vol. 17, no. 1, pp. 141-146, 2014. Crossref, https://doi.org/10.14445/22315381/IJETT-V17P229
Abstract
This paper describes pre-processing phase of ontology graph generation system from Punjabi text documents of different domains. This research paper focuses on pre-processing of Punjabi text documents. Pre-processing is structured representation of the input text. Pre-processing of ontology graph generation includes allowing input restrictions to the text, removal of special symbols and punctuation marks, removal of duplicate terms, removal of stop words, extract terms by matching input terms with dictionary and gazetteer lists terms.
Keywords
Ontology, Pre-processing phase, Ontology Graph, Knowledge Representation, Natural Language Processing.
References
[1] J. N. K. Liu, Y. He, E. H. Y. Lin, and X. Wang, “Domain ontology graph model and its application in Chinese text classification,” Neural Computing & Applications, Springer, London, vol. 24, pp. 779-798, March 2014.
[2] G.S. Lehal, “A Survey of the State of the Art in Punjabi Language Processing,” Language in India, vol. 9, pp. 9-23, Oct. 2009.
[3] Nidhi and V. Gupta, “Domain based classification of Punjabi text documents using ontology and hybrid based approach,” in Proc. of 3rd Workshop on South and Southeast Asian Natural Language Processing, SANLP, COLING, Mumbai, 2012, pp. 109-122.
[4] K. Kaur and V. Gupta, “Keyword Extraction for Punjabi Language,” Indian Journal of Computer Science and Engineering, vol. 2, pp. 364-370, July 2011.
[5] V. Gupta and G. S. Lehal, “Automatic Keyword Extraction for Punjabi Language,” International Journal of Computer Science Issues, vol. 8, pp. 327-331, September 2011.
[6] P. Talita, A. W. Yeo, and N. Kulathuramaiyer, “Challenges in building domain ontology for minority languages,” in Proc. of IEEE International Conference on Computer Applications and Industrial Electronics, 2010, pp. 574-578.