COVID-tweet-topic-sentiment-project

In this project we scraped 127,128 US tweets related to COVID. We performed topic analysis and sentiment analysis over time. Furthermore, we showed that we were able to identify related events to fluctuations of sentiment en topic proportions. Check out the "Final_paper.pdf" file for the final results and our e-mail.

Run order:

API_scrape_test3.ipynb
Sorted_by_date.ipynb
LDA
LDA_threshold_correction_FINAL.ipynb
Plotting.ipynb
Wordclouds_distinctive_words.ipynb

Note:

Mallet was used to train the LDA models, you will need to download and install this yourself, and add the mallet path in your repository. Finally we used a LDA model with 50 topics and 10000 iterations (the data and model filenames are named accordingly). Some of the CSVs in the Data directory are quite big. In order to run all code, some of these files need to be unzipped first.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Csv		Csv
Data		Data
Models		Models
.DS_Store		.DS_Store
.Rhistory		.Rhistory
API_scrape_test3.ipynb		API_scrape_test3.ipynb
Csv.zip		Csv.zip
Final_paper.pdf		Final_paper.pdf
LDA.ipynb		LDA.ipynb
LDA_proportion_correction.ipynb		LDA_proportion_correction.ipynb
LDA_threshold_correction_FINAL.ipynb		LDA_threshold_correction_FINAL.ipynb
Plotting.ipynb		Plotting.ipynb
README.md		README.md
Sorted_by_date.ipynb		Sorted_by_date.ipynb
Wordclouds_distinctive_words.ipynb		Wordclouds_distinctive_words.ipynb
figures.docx		figures.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID-tweet-topic-sentiment-project

Run order:

Note:

About

Releases

Packages

Languages

JoeySpronck/COVID-tweet-topic-sentiment-project

Folders and files

Latest commit

History

Repository files navigation

COVID-tweet-topic-sentiment-project

Run order:

Note:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages