Journal of Engineering and Applied Sciences

Year: 2018
Volume: 13
Issue: 21
Page No. 8986 - 8992

Rule Based Sentence Segmentation of Indonesian Language

Authors : Suwanto Raharjo, Retantyo Wardoyo and Agfianto E. Putra

Abstract: Sentence detection also known as sentence boundary detection or sentence boundary disambiguation is one of the study fields in linguistic computation and one of the important stages in the development of an application or research based on natural language processing. Researches topic on Sentence Boundary Detection or Sentence Boundary Disambiguation (SBD) for Indonesian language were not get much intention by researchers as result there are not many paper are written with this topic. The Indonesian language sentence segmentation problems considered as not a big issues and could be using an English SBD method. There are could be the reasons why this topic is not get attention. Existing researches are not specially, discussed on Indonesian language sentence segmentation but only mention as one of stages of research. Two methods, rule based and machine learning are usually used as sentence segmentation methods in several languages. The other methods are using statistic based such as maximum entropy, regression tree or using artificial neural network. This study intended to do sentence segmentation using rule based method on text Indonesian language and comparing the result with existing sentence segmentation softwares. Two models of experiment are conducted on developed rules, first, using input sentences that contain ambiguity problems and second using of many sentences from several kind of input.

How to cite this article:

Suwanto Raharjo, Retantyo Wardoyo and Agfianto E. Putra, 2018. Rule Based Sentence Segmentation of Indonesian Language. Journal of Engineering and Applied Sciences, 13: 8986-8992.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved