Видео автора el chief мне понравились, все кратко и по делу. Просмотр подбрки видео об обработке текстов (5 штук) помог вспомнить основные подходы и этапы tokenizing, stemming, stopwords, and n-grams, Text Association Rules... Кроме того здесь еще две важные ссылки Top 16 free software for text analysis Text Mining, кластеризация текста в RapidMiner
Part 2 of 5. This video discusses processing text in RapidMiner, including tokenizing, stemming, stopwords, and n-grams. This video describes how to find frequent item sets and association rules for text mining in RapidMiner
Part 3. Text Association Rules in RapidMiner This video describes how to find frequent item sets and association rules for text mining in RapidMiner
Top 16 free software for text analysis Text Mining, кластеризация текста в RapidMiner
#1 This video shows how to load text into RapidMiner from copy & paste, a single file,
a group of files in a group of folders, and from excel or a database
#2 Extract content,Tokenaze, Transform cases, Filter stopwords, Stem(Porter)
#3 Read Database, Process Document, Numerical to , FP Growth, Create Associacion
#4 This is part 4 of a 5 part video series on Text Mining using the free and open-source RapidMiner.
This video describes how to calculate a term's TF-IDF score, as well as how to find similar documents
using cosine similarity, and how to cluster documents using the K-Means algorithm.
#5 This is part 5 of a 5 part video series on Text Mining using the free and open-source RapidMiner.
This video describes how to automatically classify documents using the Nearest Neighbor algorithm,
and finding out which words are important to classification using the Naive Bayes learner.
Cross-Validation is also covered.
Посты чуть ниже также могут вас заинтересовать
Комментариев нет:
Отправить комментарий