Theme: | Theme - Capacity Building (Theme - CapB), Activity - Curriculum Development (Activity - CD) |
Status: | Active |
Start Date: | 2021-03-13 |
End Date: | 2021-03-13 |
Lead |
Lefsrud, Lianne |
Project Overview
Objective
This tutorial is intended to give an introduction to the data stream context through a hands-on experience. We will work on the deployment of a modular and flexible real-time data processing pipeline that is going to be capable of materializing insights from tweets and generate visualizations for analysis.
The tutorial will have 4 modules and a bonus analysis, as follows:
Module 1: Twitter Developer Account Creation Process & Tweepy Client
Module 2: Processing Tweets with Kafka
Module 3: Storing Processed Tweets in PostgreSQL
Module 4: Visualizing & Analysing Metrics with Superset Module 5: Development Considerations for a System in Production
Bonus: Sentiment Analysis with NLTK Library