Twitch link: https://www.twitch.tv/videos/740791498
Very practical, I’m feeling the same.
Author: Francesco Gadaleta
I do stuff with math, machine learning and programming.
- Founder & Software Engineer @ amethix.com
- Host @datascienceathome.com
- https://www.linkedin.com/in/fragadaleta
- https://medium.com/@frag
Data
- Upstream data update frequency
- Validation data set from upstream team
- Data latency
- 1 week old data, max allowed
- Real-time data ingestion?
- globalization - Regions?
Service performance
- Required TPS
- Request Latency
- Storage retention time
- DVC tool, data version control
- https://dvc.org/
Science
- Model accuracy %
- Arruracy improve vs profit improve
- ROI
- Model management?
- Cuberflow
- MLflow
- AirFlow
Business
- Success measure for project
- Important business metrics from up/down stream teams
- Used for talk with stakeholders
- Debate with data
Team
- Simple structure is gold
- Coding standard
- Especially scientist and engineer and economist
- Configuration standard template, across within team Remove fun part. Fun means risk.
- Data scientist (with no engineering workflow concept) will need 2 engineers support
- so 1 applied scientist is enough
- Put people in the same room