Dalam teks mining, ada learning method yang memproses teks sebagai vector (vector space model) seperti Rocchio dan kNN, ada juga yang merepresentasikan dlam skalar seperti Tree dan SVM. Dengan representasi yang berbeda, pengukuran jarak/perbedaan juga berbeda. Vektor dengan sudut dan skalar dengan jarak Ecludian.
Terlepas dari keunggulan metoda pembelajaran, representasi dokumen yang mana yang lebih baik?
Showing posts with label idea. Show all posts
Showing posts with label idea. Show all posts
Saturday, November 6, 2010
Saturday, February 27, 2010
A tool for knowdegment management
I am thinking an idea (that will help the first phase of a Ph.D. student: literature review):
while we are reading and marking the paper, a tool automatically save our marking. Our marking process is an domain expert's information extraction and summarization, isn't it?. The tool will organize the item we saved for later analysis or query.
while we are reading and marking the paper, a tool automatically save our marking. Our marking process is an domain expert's information extraction and summarization, isn't it?. The tool will organize the item we saved for later analysis or query.
Subscribe to:
Posts (Atom)