What can AI do?

No title

Recommendation systems
Self driving
Producing insights from data
...

AI tools?

Unsupervised learning
- Detects patterns in data, without labels.
Reinforcement learning
- Learning by trial and error.
- Used in robotics, gaming, etc.
Generative AI
- Generating new content, by looking at existing data (image, text, audio, etc.).
Supervised learning
- Captcha / Classification.

GPT Assistant training pipeline

Pretraining
- Uses large datasets.
- Uses Transformer Architecture.
- High GPU usage and months of training.
Supervised finetuning
Reward Modeling
- Gets ratings for answers from humans.
Reinforcement learning
- Uses the data from the previous step to improve the model.

ChatGPT used Reward Modeling and Reinforcement learning to improve their model.

AI journey

2010 to 2020 was the decade of supervised learning
- AdTech Targeting
- Search
- Personalization
- Needs large upfront investment in data and compute.
- Requires large datasets.
2020 to 2030 is the decade of Generative AI

Generative AI Applications can be made in 6months

Supervised learning

Get labeled data (1 month)
Train AI model (3 month)
Deploy and iterate (3 months)

Prompt-based AI

Get labeled set, if you have a very exotic use case (NQL, 1month)
Specify prompt (1 hour)
Deploy and iterate (days)

In this article