Szuhui Wu, Finance Data Science

Menu

Skip to content
  • Bio
  • Resume
  • Portfolio
    • Visualization
    • R Programming
    • Python
    • Spark
    • Machine Learning
    • Matlab Programming
    • Shiny App
    • NLP
    • Tableau
    • Excel
    • VBA
  • Consulting

Natural Language Processing Data Exploration

Posted December 1, 2015 by szwu

This text analysis uses NLP functions from R tm{} package to explore and understand the corpus before the implementation of a n-gram prediction model.

The corpus is the “English-US” dataset obtained from HC Corpora.  See their readme file for details on the corpora available.

 

See the report here.

 

freqCharts-1

 

Share:

  • Twitter
  • Facebook
  • LinkedIn
  • Reddit
Posted in: NLP, R Programming, Visualization

Post navigation

Impact of Weather Events on Population Health and Economic Losses: An Analysis of NOAA’s Storm Events Database →
← MS Excel Visualization: NASA TLX Comparison

Connect with Szuhui Wu

  • LinkedIn

All materials © Szuhui Wu, 2016 unless otherwise specified

Szuhui Wu, Finance Data Science
  • Bio
  • Resume
  • Portfolio
    • Visualization
    • R Programming
    • Python
    • Spark
    • Machine Learning
    • Matlab Programming
    • Shiny App
    • NLP
    • Tableau
    • Excel
    • VBA
  • Consulting