Selected Open Projects
Chess Data Lake
A Data Lake project using AWS services to ingest, store, transform, and analyze chess game data
- Apache Spark
- AWS Glue
- AWS Athena
- AWS S3
Querido Diário/Querido diário data processing
Project to scrape websites publishing official gazzetes of Brazilian municipalities in order to make them more accessible and transparent
- Scrapy
- MinIO
- PostgreSQL
- Elastic Search
- Podman
Grape diseases
ML project that uses AWS Sagemaker to train a RandomForest classifier to identify and classify leaf images into three disease classes and one missing disease class
- Scikit-Learn
- AWS Sagemaker