DATA MINING –TEXT MINING
RETRIEVE DATA SET ROLE NOMINAL TO TEXT PROCESS DOCUMENT TO DATA TOKENIZE FITLER STOPWORDS FILTER TOKENS (Length) TRANSFORM CASE PROCESSES USED (MINING WORD COUNT):
TEXT MINING (LOCATING ALL WORDS WITHIN BALLOT QUESTIONS)
RESULTS
SAME BEGINNING PROCESS AS MINING WORD COUNT ADDITIONS FOR ASSOCIATIONS: 1.NUMERICAL TO BINOMINAL 2.FP-GROWTH 3.CREATE ASSOCIATIONS PROCESSES USED (MINING WORD ASSOCIATIONS):
TEXT MINING (CREATING ASSOCIATIONS)
RESULTS
SAME BEGINNING PROCESS AS MINING WORD COUNT ADDITIONS FOR CLUSTERING: K-Means PROCESSES USED (WORD CLUSTERING):
WORD CLUSTERING (CLUSTERING SIMILAR WORDS)
RESULTS
REFERENCES El Chief’s Youtube page - Auburnbigdata blogspot – association.html