Cite Article

Improving Stemming Algorithm Using Morphological Rules

Choose citation format

BibTeX

@article{IJASEIT1705,
   author = {Titin Winarti and Djati Kerami and Lussiana ETP and Sunny Arief Sudiro},
   title = {Improving Stemming Algorithm Using Morphological Rules},
   journal = {International Journal on Advanced Science, Engineering and Information Technology},
   volume = {7},
   number = {5},
   year = {2017},
   pages = {1758--1764},
   keywords = {stemming; information retrieval; morphological rule.},
   abstract = {Stemming words to remove suffixes has applications in text search, translation machine, summarization document, and text classification. For example, Indonesian stemming reduces the words “kebaikan”, “perbaikan”, “memperbaiki” and “sebaik-baiknya” to their common morphological root “baik”. In text search, this permits a search for player to find documents containing all words with the stem play. In the Indonesian language, stemming is of crucial importance: words have prefixes, suffixes, infixes, and confixes that make them match to relate difficult words. This research proposed a stemmer with more accurate word results by employing algorithm which gave more than one word candidate results and more than one affix combinations. New stemming algorithm is called CAT stemming algorithm. Here, the word results did not depend on the order of the morphological rule. All rules were checked and the word results were kept in a candidate list. To make an efficient stemmer, two kinds of word lists (vocabularies) were used: words that had more than one candidate words and list of root word as a candidate reference. The final word results were selected with several rules. This strategy was proved to have better result than the two most known about Indonesian stemmers. The experiments showed that the proposed approach gave higher accuracy than the compared systems known.},
   issn = {2088-5334},
   publisher = {INSIGHT - Indonesian Society for Knowledge and Human Development},
   url = {http://ijaseit.insightsociety.org/index.php?option=com_content&view=article&id=9&Itemid=1&article_id=1705},
   doi = {10.18517/ijaseit.7.5.1705}
}

EndNote

%A Winarti, Titin
%A Kerami, Djati
%A ETP, Lussiana
%A Sudiro, Sunny Arief
%D 2017
%T Improving Stemming Algorithm Using Morphological Rules
%B 2017
%9 stemming; information retrieval; morphological rule.
%! Improving Stemming Algorithm Using Morphological Rules
%K stemming; information retrieval; morphological rule.
%X Stemming words to remove suffixes has applications in text search, translation machine, summarization document, and text classification. For example, Indonesian stemming reduces the words “kebaikan”, “perbaikan”, “memperbaiki” and “sebaik-baiknya” to their common morphological root “baik”. In text search, this permits a search for player to find documents containing all words with the stem play. In the Indonesian language, stemming is of crucial importance: words have prefixes, suffixes, infixes, and confixes that make them match to relate difficult words. This research proposed a stemmer with more accurate word results by employing algorithm which gave more than one word candidate results and more than one affix combinations. New stemming algorithm is called CAT stemming algorithm. Here, the word results did not depend on the order of the morphological rule. All rules were checked and the word results were kept in a candidate list. To make an efficient stemmer, two kinds of word lists (vocabularies) were used: words that had more than one candidate words and list of root word as a candidate reference. The final word results were selected with several rules. This strategy was proved to have better result than the two most known about Indonesian stemmers. The experiments showed that the proposed approach gave higher accuracy than the compared systems known.
%U http://ijaseit.insightsociety.org/index.php?option=com_content&view=article&id=9&Itemid=1&article_id=1705
%R doi:10.18517/ijaseit.7.5.1705
%J International Journal on Advanced Science, Engineering and Information Technology
%V 7
%N 5
%@ 2088-5334

IEEE

Titin Winarti,Djati Kerami,Lussiana ETP and Sunny Arief Sudiro,"Improving Stemming Algorithm Using Morphological Rules," International Journal on Advanced Science, Engineering and Information Technology, vol. 7, no. 5, pp. 1758-1764, 2017. [Online]. Available: http://dx.doi.org/10.18517/ijaseit.7.5.1705.

RefMan/ProCite (RIS)

TY  - JOUR
AU  - Winarti, Titin
AU  - Kerami, Djati
AU  - ETP, Lussiana
AU  - Sudiro, Sunny Arief
PY  - 2017
TI  - Improving Stemming Algorithm Using Morphological Rules
JF  - International Journal on Advanced Science, Engineering and Information Technology; Vol. 7 (2017) No. 5
Y2  - 2017
SP  - 1758
EP  - 1764
SN  - 2088-5334
PB  - INSIGHT - Indonesian Society for Knowledge and Human Development
KW  - stemming; information retrieval; morphological rule.
N2  - Stemming words to remove suffixes has applications in text search, translation machine, summarization document, and text classification. For example, Indonesian stemming reduces the words “kebaikan”, “perbaikan”, “memperbaiki” and “sebaik-baiknya” to their common morphological root “baik”. In text search, this permits a search for player to find documents containing all words with the stem play. In the Indonesian language, stemming is of crucial importance: words have prefixes, suffixes, infixes, and confixes that make them match to relate difficult words. This research proposed a stemmer with more accurate word results by employing algorithm which gave more than one word candidate results and more than one affix combinations. New stemming algorithm is called CAT stemming algorithm. Here, the word results did not depend on the order of the morphological rule. All rules were checked and the word results were kept in a candidate list. To make an efficient stemmer, two kinds of word lists (vocabularies) were used: words that had more than one candidate words and list of root word as a candidate reference. The final word results were selected with several rules. This strategy was proved to have better result than the two most known about Indonesian stemmers. The experiments showed that the proposed approach gave higher accuracy than the compared systems known.
UR  - http://ijaseit.insightsociety.org/index.php?option=com_content&view=article&id=9&Itemid=1&article_id=1705
DO  - 10.18517/ijaseit.7.5.1705

RefWorks

RT Journal Article
ID 1705
A1 Winarti, Titin
A1 Kerami, Djati
A1 ETP, Lussiana
A1 Sudiro, Sunny Arief
T1 Improving Stemming Algorithm Using Morphological Rules
JF International Journal on Advanced Science, Engineering and Information Technology
VO 7
IS 5
YR 2017
SP 1758
OP 1764
SN 2088-5334
PB INSIGHT - Indonesian Society for Knowledge and Human Development
K1 stemming; information retrieval; morphological rule.
AB Stemming words to remove suffixes has applications in text search, translation machine, summarization document, and text classification. For example, Indonesian stemming reduces the words “kebaikan”, “perbaikan”, “memperbaiki” and “sebaik-baiknya” to their common morphological root “baik”. In text search, this permits a search for player to find documents containing all words with the stem play. In the Indonesian language, stemming is of crucial importance: words have prefixes, suffixes, infixes, and confixes that make them match to relate difficult words. This research proposed a stemmer with more accurate word results by employing algorithm which gave more than one word candidate results and more than one affix combinations. New stemming algorithm is called CAT stemming algorithm. Here, the word results did not depend on the order of the morphological rule. All rules were checked and the word results were kept in a candidate list. To make an efficient stemmer, two kinds of word lists (vocabularies) were used: words that had more than one candidate words and list of root word as a candidate reference. The final word results were selected with several rules. This strategy was proved to have better result than the two most known about Indonesian stemmers. The experiments showed that the proposed approach gave higher accuracy than the compared systems known.
LK http://ijaseit.insightsociety.org/index.php?option=com_content&view=article&id=9&Itemid=1&article_id=1705
DO  - 10.18517/ijaseit.7.5.1705