================
Kara Belknap & Cassio Monti
2022-11-14
The purpose of this repository is to provide EDA and modeling for different data channels obtained between 2013 and 2014 in Marshable website. The goal is to perform prediction of the number of shares that the papers presented after publication by using some variables obtained before publication. In other words, the idea is to predict the number of shares a publication may have before it is published.
The packages listed in this section are used throughout the analysis. tidyverse
is used for data management and plotting through dplyr
and ggplot
packages. caret
package is used for data splitting and modeling. knitr
package is used for nice printing of tables. GGally
is used for nice correlation and exploratory plots assisting in the visualization.
library(tidyverse)
library(caret)
library(knitr)
library(GGally)
In this section you can access the reports generated for each data channel considered in this analysis.
The analysis for Lifestyle articles is available here.
The analysis for Entertainment articles is available here.
The analysis for Business articles is available here.
The analysis for Social Media articles is available here.
The analysis for Tech articles is available here.
The analysis for World articles is available here.
This section shows the R code used to run all analysis listed above.
rmarkdown::render("Belknap_Monti_project3_ST558.Rmd", output_file = "lifestyle.md", params = list(channel = "lifestyle"))
rmarkdown::render("Belknap_Monti_project3_ST558.Rmd", output_file = "entertainment.md", params = list(channel = "entertainment"))
rmarkdown::render("Belknap_Monti_project3_ST558.Rmd", output_file = "bus.md", params = list(channel = "bus"))
rmarkdown::render("Belknap_Monti_project3_ST558.Rmd", output_file = "socmed.md", params = list(channel = "socmed"))
rmarkdown::render("Belknap_Monti_project3_ST558.Rmd", output_file = "tech.md", params = list(channel = "tech"))
rmarkdown::render("Belknap_Monti_project3_ST558.Rmd", output_file = "world.md", params = list(channel = "world"))