Real Time Stock Market Data Engineering
Build the archetecture for Streaming Real Time Stock Market Data using Kafka and AWS.
The project is End-To-End Data Engineering Project on Real-Time Stock Market Data using the power of Kafka and AWS. The project aims to provide real-time stock market data to the end-users. The project is divided into three parts:
- Data Collection
- Utilize the Alpaca API to collect real-time stock market data. At this moment, we only use free plan to collect data.
- Data Processing
- Utilize Kafka Producer and Kafka Producer and Consumer on top of AWS EC2.
- (The Kafka Producer API allows applications to send streams of data to the Kafka cluster. The Kafka Consumer API allows applications to read streams of data from the cluster)
- Data Streaming
- Utilize AWS S3, Crawler, AWS Glue Data Catalog, and Athena to store and query the data.
