Predictive / Discriminative AI — Interactive Architecture Chart (2026)

ML Pipeline

The end-to-end machine learning pipeline for predictive / discriminative AI — from problem definition through monitoring and retraining.

1. Problem Definition

Define target, success metrics, constraints

→

2. Data Collection

Gather training data from DBs, APIs, logs

→

3. Data Cleaning / EDA

Handle missing values, outliers, distributions

→

4. Feature Engineering

Transform, encode, create informative features

→

5. Model Selection

Choose algorithm family, baseline models

→

6. Training

Fit model, tune hyperparameters, validate

→

7. Evaluation

Cross-val, hold-out, precision/recall, AUC

→

8. Deployment

REST/gRPC API, batch scoring, edge deploy

→

9. Monitoring / Retrain

Drift detection, alerts, scheduled retraining

How Predictive AI Works — The ML Pipeline

Predictive AI systems follow a structured pipeline from raw data to deployed prediction:

┌──────────────────────────────────────────────────────────────────────┐
│ PREDICTIVE AI PIPELINE │
│ │
│ 1. DATA 2. FEATURES 3. TRAINING │
│ ───────────── ────────────── ────────────── │
│ Collect & Engineer & Fit model to │
│ label transform labelled data; │
│ training data input variables minimise loss │
│ │
│ 4. EVALUATION 5. DEPLOYMENT 6. MONITORING │
│ ───────────── ────────────── ────────────── │
│ Measure Serve predictions Monitor drift, │
│ accuracy, via API or accuracy, and │
│ precision, AUC batch pipeline data quality │
│ │
│ ──────── RETRAIN LOOP: NEW DATA → IMPROVED MODEL ──────── │
└──────────────────────────────────────────────────────────────────────┘

The Prediction Process

Step	What Happens
Data Collection	Historical labelled examples are gathered (inputs + known correct outputs)
Preprocessing	Data is cleaned, normalised, and missing values are handled
Feature Engineering	Raw data is transformed into numeric features the model can learn from
Model Training	A learning algorithm adjusts its parameters to minimise prediction error on training data
Validation	Model performance is evaluated on a held-out validation set to detect overfitting
Testing	Final performance measurement on a completely unseen test set
Deployment	Model is packaged and served via API, batch job, or embedded in an application
Monitoring	Prediction accuracy and data distributions are tracked over time in production
Retraining	Model is periodically updated with new labelled data as patterns evolve

Key Predictive Model Parameters

Parameter	What It Controls
Learning Rate	How aggressively the model updates weights during training
Number of Estimators	Number of trees in ensemble methods (Random Forest, XGBoost)
Max Depth	Maximum depth of decision trees; controls model complexity
Regularisation (L1/L2)	Penalises model complexity to prevent overfitting
Threshold	Classification probability cutoff (e.g., flag fraud if P > 0.7)
Class Weights	Adjust training emphasis on minority classes in imbalanced datasets
Feature Importance	Ranking of input variables by their contribution to predictions

Did You Know?

Random forests remain the most-used ML algorithm in Kaggle competitions, despite deep learning advances.

XGBoost's gradient boosting approach won more Kaggle competitions than any other single algorithm between 2015-2020.

Logistic regression, invented in 1958, still powers billions of real-time ad-targeting decisions daily.

Knowledge Check

Test your understanding — select the best answer for each question.

Q1. What does a discriminative model learn?

Q2. Which algorithm uses an ensemble of decision trees with boosting?

Q3. What metric measures the area under the ROC curve?

The Predictive AI Stack — 8 Layers

Click any layer to expand details about the components and technologies at each level of the predictive ML stack.

8Monitoring & Drift Detection▼

Feature drift detection, prediction drift alerts, data quality monitoring, model performance dashboards, automated retraining triggers.

7Model Serving▼

REST/gRPC APIs, batch scoring pipelines, edge inference, model versioning, A/B serving, canary deployments, latency optimization.

6Evaluation & Validation▼

K-fold cross-validation, stratified hold-out sets, A/B testing, champion-challenger evaluation, fairness audits, calibration checks.

5Model Training▼

Gradient descent optimization, hyperparameter tuning (grid, random, Bayesian), AutoML pipelines, distributed training, early stopping.

4Feature Engineering▼

Numerical transformations, categorical encoding, text embeddings, feature crosses, feature stores for online/offline serving, automated feature generation.

3Data Processing▼

ETL/ELT pipelines, data cleaning, missing value imputation, normalisation, standardisation, outlier handling, data versioning.

2Data Storage▼

Data lakes (S3, GCS), data warehouses (Snowflake, BigQuery), feature stores (Feast, Tecton), vector databases, Delta Lake / Iceberg.

1Data Sources▼

Transactional databases, application logs, event streams (Kafka), REST/GraphQL APIs, IoT sensors, third-party data providers.

The Predictive AI Stack — 8 Layers

Layer	What It Covers
1. Data & Label Infrastructure	Data collection, labelling pipelines, annotation tooling, data warehouses
2. Feature Engineering & Stores	Feature computation, transformation, storage, and serving infrastructure
3. Model Training Infrastructure	GPU/CPU compute, distributed training, experiment tracking
4. Model Development Frameworks	scikit-learn, XGBoost, TensorFlow, PyTorch, and the ML ecosystem
5. AutoML & Model Selection	Automated architecture search, hyperparameter optimisation, no-code ML
6. Model Registry & Versioning	Storing, versioning, and managing trained model artefacts
7. Deployment & Serving	Real-time inference endpoints, batch prediction, embedded ML
8. Monitoring, Observability & Retraining	Drift detection, data quality monitoring, automated retraining pipelines

Sub-Types of Predictive / Discriminative AI

Fifteen distinct prediction paradigms — from binary classification to zero-shot learning.

Classification

Binary Classification

Spam/not-spam, fraud/legit, churn/retain. Core methods: logistic regression, SVM, gradient boosting.

Classification

Multi-Class Classification

Image recognition, named entity recognition. Core methods: softmax, random forest, deep neural networks.

Classification

Multi-Label Classification

Document tagging, scene annotation. Multiple labels per instance; binary relevance, classifier chains.

Regression

House prices, demand forecasting, salary prediction. Linear, polynomial, neural regression models.

Forecasting

Time-Series Forecasting

Sales, weather, energy demand. ARIMA, Prophet, N-BEATS, temporal fusion transformers.

Detection

Anomaly Detection

Fraud, intrusion, defect detection. Isolation forest, autoencoders, one-class SVM.

Ranking

Search results, recommendations. LambdaMART, RankNet, listwise learning-to-rank.

Survival

Survival Analysis

Churn prediction, reliability engineering. Cox proportional hazards, Kaplan-Meier estimator.

Vision

Object Detection

Bounding box localisation. YOLO, Faster R-CNN, DETR, anchor-free detectors.

Vision

Image Classification

Scene & object categorisation. CNN, Vision Transformer (ViT), EfficientNet.

Sequence

Sequence Labelling

NER, POS tagging, slot filling. CRF, BiLSTM-CRF, token-level transformers.

Structured

Structured Prediction

Parsing, semantic segmentation. Energy-based models, graph neural networks.

Calibration

Calibrated Prediction

Risk scoring with reliable probabilities. Platt scaling, isotonic regression, temperature scaling.

Ordinal

Ordinal Regression

Ratings, severity levels, ordered categories. Proportional odds model, threshold-based approaches.

Few-Shot

Zero-Shot / Few-Shot Classification

Generalise with minimal labels. CLIP, prompt-tuning, prototypical networks, meta-learning.

Sub-Types by Task

Classification

Assigns each input to one of a fixed set of discrete categories.

Classification Type	What It Does	Examples
Binary Classification	Predicts one of two classes (positive/negative)	Fraud / not fraud; spam / not spam; malignant / benign
Multi-Class Classification	Predicts one of three or more classes	Image category (cat/dog/bird); sentiment (positive/neutral/negative)
Multi-Label Classification	Assigns multiple labels simultaneously	Topic tagging (news article tagged finance + politics + AI)
Ordinal Classification	Predicts ordered categories	Star ratings (1–5); severity scores (mild/moderate/severe)
Hierarchical Classification	Predicts within a category hierarchy	Product taxonomy; medical ICD coding

Real-World Examples:

Domain	Classification Task	Model Type
Email	Spam detection	Naive Bayes / XGBoost
Healthcare	Tumour malignancy prediction	CNN / XGBoost
Finance	Loan default prediction	Logistic Regression / LightGBM
E-Commerce	Product category assignment	BERT / XGBoost
Security	Intrusion detection	Random Forest / DNN
HR	Resume screening	BERT / Logistic Regression

Regression

Predicts a continuous numeric value from input features.

Regression Type	What It Does	Examples
Linear Regression	Models a linear relationship between inputs and output	Housing price estimation
Polynomial Regression	Fits curved relationships	Non-linear growth curves
Ridge / Lasso Regression	Regularised linear regression	High-dimensional feature sets
Gradient Boosting Regression	Ensemble regression for non-linear relationships	Demand forecasting, revenue prediction
Neural Regression	Deep learning for complex regression tasks	Energy consumption, stock price targets

Real-World Examples:

Domain	Regression Task	Typical Model
Real Estate	Property price prediction	XGBoost / LightGBM
Retail	Sales volume forecasting	LightGBM / LSTM
Finance	Portfolio return estimation	Ridge Regression / DNN
Energy	Electricity demand prediction	Prophet / LSTM
Manufacturing	Equipment remaining useful life	Random Forest / XGBoost

Anomaly Detection

Identifies observations that deviate significantly from expected patterns.

Anomaly Type	What It Does	Examples
Point Anomaly	A single value is abnormal relative to the rest	Fraudulent credit card transaction
Contextual Anomaly	A value is abnormal in context but not globally	Normal temperature reading in wrong season
Collective Anomaly	A sequence of values together is abnormal	Network intrusion pattern

Anomaly Detection Methods:

Method	How It Works	Best For
Isolation Forest	Isolates anomalies via random decision trees; anomalies are easier to isolate	General-purpose tabular anomaly detection
One-Class SVM	Learns the boundary of normal data; flags anything outside	High-dimensional feature spaces
Autoencoder	Learns to reconstruct normal data; high reconstruction error = anomaly	Complex, unstructured data; network traffic
Statistical Methods (Z-score, IQR)	Flags values beyond statistical thresholds	Simple univariate time-series monitoring
LSTM Anomaly Detection	Learns normal temporal patterns; flags deviations	Time-series: log monitoring, IoT sensor data
DBSCAN	Density-based clustering; outliers are unclustered points	Spatial anomalies, customer behaviour

Ranking & Recommendation

Orders or suggests items based on predicted relevance or preference.

Technique	How It Works	Examples
Collaborative Filtering	Finds users with similar behaviour; recommends what they liked	Netflix, Spotify, Amazon
Content-Based Filtering	Recommends items similar to what the user previously liked	News recommendation, YouTube
Hybrid Recommendation	Combines collaborative and content-based signals	Most modern systems (Netflix, Spotify)
Matrix Factorisation	Decomposes user-item interaction matrix into latent factors	Classic recommendation baseline (SVD)
Learning to Rank (LTR)	Trains a model to order results by predicted relevance	Google Search, Bing, e-commerce search
Two-Tower Models	Separate encoder towers for user and item; dot-product similarity	YouTube, TikTok, Pinterest at scale
Deep Learning Recommendation Models (DLRM)	Deep neural networks with sparse and dense features	Meta's DLRM; industry-standard at scale

Time-Series Forecasting

Projects future values based on historical temporal data.

Approach	How It Works	Best For
ARIMA / SARIMA	Statistical autoregressive integrated moving average models	Univariate, stationary time series
Prophet (Meta)	Additive model with trend, seasonality, and holiday effects	Business metrics; missing data tolerant
Exponential Smoothing	Weighted average of past observations; higher weight to recent data	Short-horizon demand forecasting
LSTM / GRU	Recurrent neural networks that learn long-range temporal dependencies	Complex, multivariate sequences
Temporal Fusion Transformer (TFT)	Transformer for multi-horizon, multivariate forecasting	Long-horizon forecasting with covariates
N-BEATS / N-HiTS	Pure neural architectures for time series; competitive with transformers	General time-series forecasting
Chronos (Amazon)	Foundation model for zero-shot time-series forecasting	Transfer learning across time-series tasks
TimesFM (Google)	Large pre-trained time-series forecasting model	Few-shot and zero-shot forecasting
Moirai (Salesforce)	Universal time-series forecasting foundation model	Multi-domain forecasting

Risk Scoring

Assigns a probability or score representing the likelihood of a specific risk event.

Risk Scoring Type	What It Predicts	Real-World Examples
Credit Risk	Probability of loan default	FICO Score, VantageScore, bank internal models
Fraud Risk	Probability of fraudulent transaction	Visa Advanced AI, Stripe Radar, PayPal AI
Churn Risk	Probability a customer will cancel or leave	Salesforce Einstein Churn, Gainsight AI
Health Risk	Patient probability of adverse health event	Sepsis prediction, readmission risk (EHR models)
Insurance Risk	Probability of claim; premium pricing	Progressive Snapshot AI, Lemonade AI
Cyber Risk	Likelihood of security breach or vulnerability	CrowdStrike Risk Score, Tenable
Operational Risk	Probability of equipment failure or process breakdown	Predictive maintenance scores (GE, Siemens)

Natural Language Processing (Discriminative NLP)

Classifying, extracting, or structuring information from text — without generating new content.

NLP Task	What It Does	Models
Sentiment Analysis	Classify text sentiment (positive/negative/neutral)	BERT, RoBERTa, DistilBERT
Named Entity Recognition (NER)	Identify and label entities (person, org, location)	BERT-NER, spaCy, Flair
Text Classification	Assign text to predefined categories	BERT, XGBoost + TF-IDF
Intent Classification	Identify user intent in conversational inputs	BERT, Dialogflow, Amazon Lex
Relation Extraction	Identify relationships between entities in text	BERT-based RE models
Text Summarisation (Extractive)	Select and rank existing sentences from document	BERT-based, TextRank
Question Answering (Extractive)	Extract the answer span from a context document	BERT, RoBERTa (SQuAD)
Toxicity Detection	Identify harmful, abusive, or policy-violating content	Perspective API, OpenAI Moderation API

Core Architectures

The fundamental model families powering predictive AI — from interpretable baselines to automated ensembles.

Ensemble

Gradient Boosting

XGBoost, LightGBM, CatBoost. Sequential weak learners correcting predecessors. Tabular SOTA; dominant in competitions and production.

Ensemble

Random Forest

Bagging of decision trees with feature randomisation. Robust, minimal tuning, strong baseline for classification and regression.

Deep Learning

Deep Neural Networks

MLP, CNN, Transformer architectures. Learn representations end-to-end; excel on vision, sequence, and large-scale tabular data.

Kernel

Support Vector Machines

Kernel trick for non-linear boundaries. Max-margin classifier; effective on small-to-medium datasets with careful tuning.

Linear

Logistic / Linear Regression

Simple, interpretable, and fast inference. Excellent baseline; widely used in regulated industries for explainability.

Instance

k-Nearest Neighbours

Instance-based lazy learning; no explicit training phase. Predictions via distance metrics; works well with low-dimensional data.

Probabilistic

Naive Bayes

Probabilistic classifier with independence assumption. Fast text classification; strong baseline for NLP tasks.

AutoML

AutoML Ensembles

Auto-sklearn, H2O AutoML, AutoGluon. Automated model selection, hyperparameter tuning, and stacking for optimal performance.

Core Architectures & Algorithms

Gradient Boosted Trees (GBT)

The dominant algorithm for structured / tabular data in production.

Aspect	Detail
Core Mechanism	Builds an ensemble of decision trees sequentially; each tree corrects the errors of its predecessors
Why It Dominates	Handles mixed data types, missing values, and non-linear relationships without extensive preprocessing
Key Advantage	Consistently outperforms deep learning on tabular data; fast training and inference
Used For	Fraud detection, credit scoring, churn prediction, ranking, any structured data task
Top Implementations	XGBoost, LightGBM, CatBoost

Key GBT Variants:

Implementation	Introduced By	Key Advantage
XGBoost	Tianqi Chen, 2014	Regularised boosting; tree pruning; GPU support; Kaggle champion
LightGBM	Microsoft, 2017	Leaf-wise growth; histogram binning; fastest large-dataset training
CatBoost	Yandex, 2017	Native categorical feature handling; ordered boosting; minimal preprocessing
HistGradientBoosting	scikit-learn	Native sklearn implementation; competitive at scale

Random Forest

Aspect	Detail
Core Mechanism	Builds many uncorrelated decision trees in parallel; aggregates predictions by majority vote or averaging
Why It Works	Reduces variance through averaging; robust to outliers and noise
Key Advantage	Highly reliable out-of-the-box; good baseline for any classification or regression task
Limitation	Slower inference than GBT; less accurate on complex patterns
Used For	Medical diagnosis, risk scoring, feature selection, general-purpose classification

Logistic Regression

Aspect	Detail
Core Mechanism	Learns a linear decision boundary; outputs probabilities via sigmoid/softmax function
Why It Persists	Fast, interpretable, probabilistically calibrated, and robust with regularisation
Key Advantage	Fully transparent; coefficients directly interpretable as feature contributions
Used For	Credit scoring (FICO), medical risk stratification, marketing propensity models
Limitation	Cannot capture non-linear relationships without feature engineering

Support Vector Machines (SVMs)

Aspect	Detail
Core Mechanism	Finds the maximum-margin hyperplane separating classes in high-dimensional space
Kernel Trick	Projects data into higher dimensions where linear separation becomes possible
Key Advantage	Effective in high-dimensional spaces; memory-efficient; theoretically well-grounded
Used For	Text classification, bioinformatics, image classification (pre-deep learning)
Limitation	Does not scale well to large datasets; hyperparameter tuning is critical

Deep Neural Networks (DNNs)

The dominant architecture for unstructured data — images, text, audio, and video.

Aspect	Detail
Core Mechanism	Learns hierarchical representations through multiple layers of non-linear transformations
Why It Dominates Unstructured Data	Automatically learns features from raw data; scales with data volume and compute
Key Advantage	State-of-the-art on image classification, NLP classification, speech recognition
Used For	Medical imaging, fraud detection on sequences, NLP classification, recommendation

Key DNN Architectures for Prediction:

Architecture	Best For	Key Examples
CNN (Convolutional Neural Network)	Image classification, detection, segmentation	ResNet, EfficientNet, ViT
RNN / LSTM	Sequential data, time-series prediction	Demand forecasting, anomaly detection
Transformer (Encoder-only)	Text classification, NLP tasks	BERT, RoBERTa, DeBERTa
Tabular DNN	Large-scale structured data	TabNet, NODE, FT-Transformer
Graph Neural Network (GNN)	Relational and graph-structured data	GraphSAGE, GAT, fraud networks

Bayesian Methods

Method	What It Does	Best For
Naive Bayes	Applies Bayes' theorem with feature independence assumption	Text classification, spam filtering, fast baseline
Bayesian Networks	Probabilistic graphical model of variable relationships	Medical diagnosis, causal reasoning
Gaussian Processes	Non-parametric probabilistic predictions with uncertainty estimates	Small datasets, scientific regression, active learning
Bayesian Optimisation	Find optimal hyperparameters or model architecture efficiently	AutoML, neural architecture search

K-Nearest Neighbours (KNN)

Aspect	Detail
Core Mechanism	Classifies a new point based on the majority label of its k nearest neighbours
Advantage	Non-parametric; no training phase; intuitive
Limitation	Slow inference at scale; sensitive to irrelevant features and data scale
Used For	Recommendation systems (collaborative filtering baseline), anomaly detection

Architecture Comparison Summary

Algorithm	Best For	Tabular Data	Unstructured Data	Interpretability	Training Speed
XGBoost / LightGBM	Structured / tabular	Excellent	Poor	Medium	Very Fast
Random Forest	General purpose	Very Good	Poor	Medium	Fast
Logistic Regression	Linear, regulated domains	Good	Poor	Excellent	Very Fast
SVM	High-dimensional sparse	Good	Moderate	Low	Moderate
Deep Neural Networks	Unstructured data	Moderate	Excellent	Low	Slow
BERT / Transformers	NLP classification	Poor	Excellent (text)	Low	Slow
GNNs	Graph / relational data	Moderate	Excellent (graphs)	Low	Moderate

Tools & Frameworks

The leading libraries, platforms, and services powering the predictive AI ecosystem.

Tool	Provider	Focus
scikit-learn	Open-source	Swiss-army knife ML; preprocessing, models, evaluation
XGBoost	DMLC	Gradient boosting; Kaggle champion; tabular data
LightGBM	Microsoft	Fast gradient boosting; histogram-based; large datasets
CatBoost	Yandex	Gradient boosting with native categorical support
PyTorch	Meta	Deep learning framework; dynamic graphs; research
TensorFlow / Keras	Google	Production DL framework; TF Serving, TFLite
Hugging Face	Hugging Face	Transformers hub; fine-tuning; model sharing
Amazon SageMaker	AWS	End-to-end ML platform; training, deployment, monitoring
Vertex AI	Google	GCP ML platform; AutoML, pipelines, model registry
Azure ML	Microsoft	Enterprise ML platform; Designer, SDK, endpoints
MLflow	Databricks	Experiment tracking, model registry, deployment
Feast	Tecton / LF	Open-source feature store; online/offline serving
Weights & Biases	W&B	Experiment tracking, hyperparameter sweeps, artifacts
Evidently AI	Evidently	ML monitoring; data drift, model quality dashboards

Leading Platforms, Frameworks & Tools

Core ML Libraries (Python)

Library	Purpose	Deployment	Highlights
scikit-learn	Comprehensive ML library	Open-Source (any OS; Python 3.9+; CPU-only)	Most widely used ML library; 500+ algorithms; consistent API
XGBoost	Gradient boosting	Open-Source (any OS; Python/C++/R; CPU or NVIDIA GPU)	Kaggle champion; GPU support; regularised boosting
LightGBM	Gradient boosting	Open-Source (any OS; Python/C++/R; CPU or GPU)	Microsoft; fastest large-scale GBM; leaf-wise growth
CatBoost	Gradient boosting	Open-Source (any OS; Python/C++/R; CPU or NVIDIA GPU)	Yandex; native categorical handling; minimal preprocessing
imbalanced-learn	Imbalanced datasets	Open-Source (any OS; Python 3.9+; CPU-only)	SMOTE, ADASYN, and ensemble methods for imbalanced learning
statsmodels	Statistical models	Open-Source (any OS; Python 3.9+; CPU-only)	ARIMA, OLS, GLM; p-values and confidence intervals
Optuna	Hyperparameter optimisation	Open-Source (any OS; Python 3.8+; CPU; optional DB backend)	Bayesian search; pruning; distributed; most popular HPO library
SHAP	Explainability	Open-Source (any OS; Python 3.8+; CPU)	Feature attribution; model-agnostic; integrates with all major models

Deep Learning Frameworks

Framework	Maintained By	Deployment	Highlights
PyTorch	Meta	Open-Source (any OS; Python; NVIDIA GPU recommended; CUDA 11.8+)	De facto standard for research and production deep learning
TensorFlow	Google	Open-Source (any OS; Python; CPU or NVIDIA GPU; CUDA 11.8+)	Production-grade; TF Serving; TFLite for edge deployment
Keras	Google	Open-Source (any OS; Python; runs on TensorFlow backend)	High-level API on TensorFlow; beginner-friendly
JAX	Google	Open-Source (any OS; Python; TPU / NVIDIA GPU; XLA-accelerated)	High-performance numerical computing; XLA compilation; growing adoption
PyTorch Lightning	Lightning AI	Open-Source (any OS; Python; inherits PyTorch GPU requirements)	Structured PyTorch; reduces boilerplate; reproducibility
Hugging Face Transformers	Hugging Face	Open-Source (any OS; Python; CPU or NVIDIA GPU; CUDA 11.8+)	Pre-trained NLP models; 500k+ model checkpoints; fine-tuning pipeline
FastAI	fast.ai	Open-Source (any OS; Python; inherits PyTorch GPU requirements)	High-level PyTorch API; excellent for practitioners

Time-Series Libraries

Library	Focus	Deployment	Highlights
Prophet	Business forecasting	Open-Source (any OS; Python/R; CPU)	Meta's additive model; handles seasonality and holidays; easy to use
statsforecast	Statistical forecasting	Open-Source (any OS; Python; CPU)	Fast, scalable implementation of classical statistical models
neuralforecast	DL forecasting	Open-Source (any OS; Python; CPU or NVIDIA GPU)	LSTM, NBEATS, TFT — production neural forecasting
Darts	Universal time-series	Open-Source (any OS; Python; CPU or GPU)	Unified interface for statistical, ML, and DL forecasting models
sktime	Unified time-series ML	Open-Source (any OS; Python; CPU)	Classification, regression, and forecasting under one API
tsfresh	Feature extraction	Open-Source (any OS; Python; CPU)	Automated time-series feature engineering; 789 features per series
GluonTS	Probabilistic forecasting	Open-Source (any OS; Python; CPU or GPU; Amazon-maintained)	Amazon; probabilistic deep learning time-series models

Cloud ML Platforms

Platform	Provider	Deployment	Key Capabilities
AWS SageMaker	Amazon	Cloud (AWS — EC2, S3, ECS; GPU: P4d/P5 instances)	End-to-end ML platform; AutoML, training, deployment, monitoring; 100+ built-in algorithms
Google Vertex AI	Google	Cloud (GCP — Compute Engine, GCS; GPU: A100/H100 VMs)	Unified ML platform; AutoML, custom training, Model Garden, Feature Store
Azure Machine Learning	Microsoft	Cloud (Azure — VMs, Blob Storage; GPU: NC/ND series)	Enterprise MLOps; pipeline automation; responsible AI dashboard; Fabric integration
Databricks ML	Databricks	Cloud (AWS, Azure, GCP — Spark clusters; GPU optional)	MLflow-native; distributed training; Delta Lake integration; LakeHouse ML
IBM Watson Studio	IBM	Hybrid (IBM Cloud; On-Prem via Cloud Pak for Data on x86/POWER servers)	Enterprise ML with governance; OpenScale model monitoring
Oracle OCI ML	Oracle	Cloud (Oracle Cloud Infrastructure — OCI Compute; GPU: A100 bare metal)	Database-native ML; AutoML; integrated with Oracle Cloud analytics
Snowpark ML	Snowflake	Cloud (Snowflake on AWS, Azure, GCP — uses Snowflake warehouses)	Train and deploy ML models directly in Snowflake without moving data

Experiment Tracking & Model Registry

Tool	Deployment	Highlights
MLflow	Open-Source / Cloud (self-host any infra; managed on Databricks — AWS, Azure, GCP)	De facto standard for experiment tracking; model registry; deployment
Weights & Biases (W&B)	Cloud (W&B SaaS on AWS / GCP; self-hosted option on any K8s)	Rich experiment tracking; collaborative; visualisation-first
Neptune.ai	Cloud (Neptune SaaS on AWS; self-hosted option)	Experiment tracking; model registry; research-friendly
Comet ML	Cloud (Comet SaaS on AWS; self-hosted option)	Experiment management; automated insights; enterprise governance
DVC (Data Version Control)	Open-Source (any OS; Git-based; stores data on S3, GCS, Azure Blob, or local)	Version control for data and models; Git-native
ClearML	Open-Source / Cloud (self-host Docker/K8s; ClearML Cloud on AWS)	Full MLOps suite; experiment tracking + orchestration + data management

Use Cases by Industry

How predictive / discriminative AI creates value across major industry verticals.

Financial Services

Credit scoring and underwriting, real-time fraud detection, algorithmic trading signal generation, insurance pricing and claims prediction, anti-money laundering risk scoring.

Healthcare

Disease risk prediction, hospital readmission scoring, medical imaging diagnosis (X-ray, MRI, pathology), drug response prediction, clinical trial patient matching.

Retail & E-Commerce

Demand forecasting and inventory optimisation, customer churn prediction, dynamic pricing engines, market basket analysis, next-best-offer recommendations.

Manufacturing

Predictive maintenance for equipment failure, quality control and defect detection, yield optimisation, supply chain demand planning, energy consumption forecasting.

Marketing & AdTech

Click-through rate (CTR) prediction, customer lifetime value estimation, marketing attribution modelling, audience segmentation and targeting, bid optimisation.

Cybersecurity

Network intrusion detection, malware classification, phishing email detection, threat scoring and prioritisation, user and entity behaviour analytics (UEBA).

Industry Use Cases

Financial Services

Use Case	Description	Key Examples
Credit Scoring	Predict probability of loan default; determine creditworthiness	FICO Score, VantageScore, bank internal models
Fraud Detection	Real-time classification of payment transactions as fraudulent	Visa Advanced AI, Stripe Radar, PayPal AI
Anti-Money Laundering (AML)	Detect suspicious transaction networks and behaviour patterns	NICE Actimize, Featurespace ARIC, Oracle Financial Crime
Algorithmic Trading	Predict short-term price movements; execute systematic strategies	Two Sigma, Renaissance Technologies, Citadel ML
Risk Management	Estimate market, credit, and operational risk at portfolio level	BlackRock Aladdin, Bloomberg BAM, internal quant models
Customer Churn Prediction	Identify customers likely to close accounts or switch providers	Salesforce Einstein, Pecan AI, DataRobot
Insurance Underwriting	Score individual risk for premium pricing	Lemonade AI, Progressive Snapshot, Swiss Re models
Loan Origination	Automate and optimise loan approval decisions	Blend, Zest AI, Upstart

Healthcare & Life Sciences

Use Case	Description	Key Examples
Medical Imaging Diagnosis	Classify X-rays, CT scans, MRIs, pathology slides	Viz.ai, Aidoc, Google Health AI, PathAI
Sepsis Prediction	Predict sepsis onset hours before clinical deterioration	Epic Sepsis Model, Dascena, Johns Hopkins AI
Readmission Risk	Predict hospital readmission probability within 30 days	Epic, Optum AI, Health Catalyst
Disease Progression	Forecast disease trajectory for chronic conditions	Flatiron, Tempus, Komodo Health
Drug Target Identification	Predict protein-drug binding affinity	Atomwise, Insilico Medicine, Schrödinger
Clinical Trial Matching	Score patient eligibility against trial criteria	Mendel AI, Deep 6 AI, Transparently AI
EHR Risk Stratification	Score patients by risk level for proactive care management	Health Catalyst, Arcadia, Cotiviti AI
Pathology AI	Classify tissue samples for cancer detection	PathAI, Paige.AI, Lunit

Retail & E-Commerce

Use Case	Description	Key Examples
Demand Forecasting	Predict product-level sales volumes across stores and channels	Amazon Forecast, Walmart AI, Blue Yonder
Inventory Optimisation	Set optimal stock levels to balance service and cost	o9 Solutions, Llamasoft (Coupa), Relex
Churn Prediction	Identify at-risk subscribers or repeat customers	Salesforce Einstein, Amplitude AI, Mixpanel
Price Optimisation	Set dynamic prices to maximise revenue	Pros Holdings, Zilliant, Revionics
Product Recommendation	Personalise product discovery for each shopper	Amazon, Netflix, Shopify AI
Propensity to Buy	Score likelihood of purchase for marketing targeting	Salesforce, HubSpot AI, Amplitude
Return Prediction	Forecast product returns for logistics planning	Internal models at Amazon, Zalando
Customer Lifetime Value (CLV)	Predict total future revenue from a customer	Pecan AI, DataRobot, Salesforce Einstein

Marketing & Advertising

Use Case	Description	Key Examples
Lead Scoring	Predict which leads are most likely to convert	Salesforce Einstein Lead Score, HubSpot AI, 6sense
Customer Segmentation	Cluster customers by behaviour and value	Adobe Experience Platform, Segment, Amplitude
Campaign Response Prediction	Predict which customers will respond to a campaign	Persado, Optimove, Braze AI
Ad Click-Through Rate (CTR) Prediction	Score probability of a user clicking an ad	Google Ads AI, Meta Ads AI, The Trade Desk
Attribution Modelling	Predict the contribution of each touchpoint to conversion	Rockerbox, Northbeam, Measured
Email Send-Time Optimisation	Predict the best time to send email to each user	Salesforce Marketing Cloud AI, Klaviyo AI
Uplift Modelling	Predict incremental impact of an intervention (not just response)	Pylift, CausalML, internal models

Manufacturing & Supply Chain

Use Case	Description	Key Examples
Predictive Maintenance	Predict equipment failure before it occurs	Siemens MindSphere, GE Predix, IBM Maximo
Quality Control / Defect Detection	Classify manufactured goods for defects	Cognex AI, Landing AI, NVIDIA Metropolis
Supply Chain Risk Scoring	Predict supplier disruption probability	Resilinc AI, Riskmethods, o9 Solutions
Production Yield Prediction	Forecast production output and waste	AspenTech, Rockwell AI, Sight Machine
Energy Consumption Forecasting	Predict plant-level energy use for optimisation	EnerNOC AI, AutoGrid, SparkCognition
Delivery Time Prediction	Estimate shipment delivery windows	Project44, FourKites, Amazon SWA

Human Resources & Talent

Use Case	Description	Key Examples
Candidate Screening	Score résumés against job requirements	HireVue AI, Pymetrics, Eightfold AI
Employee Attrition Prediction	Identify flight-risk employees before they resign	Workday AI, Visier, OneModel
Performance Prediction	Forecast employee performance ratings	Eightfold AI, Culture Amp
Workforce Demand Planning	Forecast staffing needs by role, location, and time	Workday Scheduling AI, Legion AI
Compensation Benchmarking	Predict market-competitive compensation	Radford AI, Levels.fyi ML models

Cybersecurity

Use Case	Description	Key Examples
Intrusion Detection	Classify network traffic as malicious or benign	CrowdStrike Falcon, Darktrace, Vectra AI
Malware Classification	Classify files and code as malicious	CrowdStrike, SentinelOne AI, Cylance
User & Entity Behaviour Analytics (UEBA)	Score anomalous user behaviour within an organisation	Exabeam, Securonix, Microsoft Sentinel
Vulnerability Risk Scoring	Prioritise CVEs by exploitability probability	Tenable Predictive Prioritization, Qualys
Phishing Detection	Classify emails and URLs as phishing attempts	Proofpoint AI, Mimecast, Microsoft Defender
Insider Threat Detection	Detect anomalous behaviour indicating insider risk	ObserveIT, Dtex Systems, Forcepoint AI

Logistics & Transportation

Use Case	Description	Key Examples
Route Optimisation	Predict and select optimal delivery routes	Google Maps Platform, Uber AI, Route4Me
ETA Prediction	Predict shipment and delivery arrival times	FourKites, Project44, Amazon AI
Demand Forecasting (Ride-Sharing)	Predict rider demand to position drivers	Uber Forecasting, Lyft ML
Freight Rate Prediction	Forecast spot and contract freight rates	Freightos, Flexport AI
Airport / Traffic Congestion Prediction	Forecast capacity and flow	Google Maps, INRIX AI, HERE AI

Benchmarks & Model Comparisons

Performance benchmarks for leading predictive model architectures on tabular tasks.

Tabular ML Benchmarks (Accuracy %)

Model Properties (Radar)

Evaluation Metrics & Benchmarks

Choosing the right metric is as important as choosing the right model — different business objectives demand different measurement criteria.

Classification Metrics

Metric	Formula / Description	Best For
Accuracy	Correct predictions / Total predictions	Balanced classes; misleading on imbalanced data
Precision	TP / (TP + FP)	Minimise false positives (e.g., spam filters)
Recall (Sensitivity)	TP / (TP + FN)	Minimise false negatives (e.g., cancer detection)
F1 Score	Harmonic mean of Precision and Recall	Imbalanced classes; trade-off metric
F-beta Score	Weighted F1 with beta controlling recall/precision trade-off	Domain-specific cost asymmetry
AUC-ROC	Area under the Receiver Operating Characteristic curve	Ranking quality; threshold-independent
AUC-PR	Area under the Precision-Recall curve	Highly imbalanced datasets; fraud, rare disease
Log Loss (Cross-Entropy)	Penalises confident wrong predictions heavily	Probabilistic calibration quality
MCC (Matthews Correlation Coefficient)	Balanced metric even for very imbalanced datasets	Binary classification; comprehensive single metric
Cohen's Kappa	Agreement beyond chance	Multi-class with imbalanced distribution
Confusion Matrix	TP / TN / FP / FN breakdown by class	Detailed per-class error analysis

Regression Metrics

Metric	Description	Best For
MAE (Mean Absolute Error)	Average absolute difference between predicted and actual	Interpretable; robust to outliers
MSE (Mean Squared Error)	Average squared difference; penalises large errors heavily	When large errors are particularly costly
RMSE (Root MSE)	Square root of MSE; same units as target	Standard regression evaluation metric
MAPE (Mean Absolute % Error)	Percentage error relative to actual value	Forecasting; interpretable to business users
SMAPE (Symmetric MAPE)	Symmetric version of MAPE; handles zero actuals	Time-series forecasting
R² (Coefficient of Determination)	Proportion of variance explained by the model	Model fit quality relative to a baseline
WAPE (Weighted APE)	Sum of absolute errors / Sum of actuals	Supply chain and demand forecasting

Ranking & Recommendation Metrics

Metric	Description	Used In
NDCG (Normalised Discounted Cumulative Gain)	Measures ranking quality; rewards correct items higher in the list	Search ranking, recommendation systems
MAP (Mean Average Precision)	Average precision across all queries	Information retrieval
MRR (Mean Reciprocal Rank)	Reciprocal of the rank of the first relevant result	Question answering, search
Hit Rate / Recall@k	Fraction of users for whom a relevant item appears in top-k	Recommendation systems
Precision@k	Fraction of top-k recommendations that are relevant	Recommendation systems

Time-Series Forecasting Metrics

Metric	Description	Best For
MASE (Mean Absolute Scaled Error)	MAE scaled by in-sample naive forecast error	Scale-independent; cross-series comparison
sMAPE	Symmetric MAPE; bounded between 0% and 200%	Standard competition metric
CRPS (Continuous Ranked Probability Score)	Evaluates probabilistic forecast quality	Uncertainty quantification
Pinball Loss	Measures quantile forecast accuracy	Probabilistic forecasting

Validation Strategies

Strategy	How It Works	Best For
Hold-Out Split	70/15/15 train/validation/test split	Large datasets; fast evaluation
k-Fold Cross-Validation	Rotate through k folds; average performance	Small to medium datasets; robust estimate
Stratified k-Fold	Maintain class proportions across folds	Imbalanced classification
Time-Series CV (Walk-Forward)	Train on past; validate on next window; roll forward	All time-series and forecasting models
Group k-Fold	Ensure same entity (user, patient) is only in one fold	Recommendation; patient-level models
Nested Cross-Validation	Outer loop for model evaluation; inner loop for HPO	Unbiased performance estimate with HPO

Calibration

Concept	Description
Calibration	Alignment between a model's predicted probabilities and actual event frequencies
Reliability Diagram	Plot predicted probability vs. observed frequency; well-calibrated model falls on the diagonal
Brier Score	Mean squared error between predicted probability and actual binary outcome
Platt Scaling	Post-hoc logistic regression to calibrate SVM or other model probabilities
Isotonic Regression	Non-parametric calibration; better than Platt for larger datasets
Temperature Scaling	Simple, effective calibration for deep neural networks

Public Benchmarks & Datasets

Benchmark / Dataset	Domain	What It Tests
Kaggle Competitions	All domains	Real-world predictive modelling challenges; community leaderboards
UCI ML Repository	All domains	600+ classic ML datasets; academic standard
OpenML	All domains	4,000+ datasets; reproducible ML benchmarking
MNIST / CIFAR-10/100	Image classification	Standard CV classification baselines
ImageNet (ILSVRC)	Image classification	1M+ images; CNN benchmark; top-1/top-5 accuracy
SQuAD 1.1 / 2.0	NLP / Reading comprehension	Extractive QA; BERT family benchmark
GLUE / SuperGLUE	NLP classification	Multi-task NLP benchmark suite
M4 / M5 Competition	Time-series forecasting	Retail demand; 100k series; academic gold standard
Electricity / ETT	Time-series forecasting	Energy demand; standard DL forecasting benchmark
Adult Income / German Credit	Fairness	Demographic bias assessment in classification
MIMIC-III / IV	Healthcare ML	ICU patient data; mortality, sepsis, readmission prediction

Market & Adoption Data

Market sizing and growth projections for the Predictive Analytics ecosystem.

Predictive Analytics Market ($B)

Market Growth 2024 → 2030 (CAGR 23%)

Market & Adoption Data

Market Size & Growth

Metric	Value	Source / Notes
Global Predictive Analytics Market (2024)	~$14.9 billion	Grand View Research; includes predictive AI across all verticals
Projected Market Size (2030)	~$67.9 billion	CAGR ~28.5%; driven by enterprise ML adoption and AutoML democratisation
% of Enterprises Using ML in Production (2024)	~54%	McKinsey Global AI Survey 2024
Top Verticals by Spend	BFSI, Healthcare, Retail, Manufacturing, Telecom	Consistent across major analyst reports
AutoML Market Size (2024)	~$1.7 billion	Growing to ~$8.0B by 2030; CAGR ~30%+
MLOps Market Size (2024)	~$2.1 billion	Projected to reach ~$13.5B by 2030

Adoption Patterns by Enterprise Size

Enterprise Segment	Adoption Pattern	Key Tools / Platforms
Large Enterprise (>10,000 employees)	Custom models + cloud ML platforms; dedicated data science teams; MLOps at scale	SageMaker, Vertex AI, Azure ML, Databricks
Mid-Market (500–10,000 employees)	AutoML + SaaS ML platforms; growing data science function	DataRobot, H2O, Dataiku, Pecan AI
Small Business (<500 employees)	No-code ML; embedded AI in SaaS applications	Obviously AI, Pecan, built-in AI in CRM/ERP
Startups	Open-source ML + cloud; rapid experimentation; lean ML stack	scikit-learn, XGBoost, PyTorch + AWS/GCP

Key Adoption Drivers

Driver	Description
Cost Reduction Pressure	Predictive models reduce operational costs through automation, efficiency, and waste reduction
Cloud ML Democratisation	Managed ML services (SageMaker, Vertex AI) lower the barrier to production deployment
AutoML Maturation	Non-specialist practitioners can now build production-grade models without deep ML expertise
Regulatory Demand	Financial services, insurance, and healthcare regulators increasingly expect AI-driven risk management
Data Availability	Explosion of structured data from CRM, ERP, IoT, and transactional systems provides rich training material
ROI Demonstrability	Predictive AI delivers measurable, auditable business value — churn reduction, fraud savings, yield improvement
Foundation Model Transfer	Pre-trained models reduce the labelled data and compute required to build production NLP and CV systems

Productivity & ROI Benchmarks

Use Case	Typical ROI / Impact	Source
Fraud Detection	20–40% reduction in fraud losses; 50%+ reduction in manual review cost	Visa, Mastercard internal reports
Predictive Maintenance	10–25% reduction in maintenance costs; 35–45% reduction in downtime	McKinsey Manufacturing AI Report
Demand Forecasting	15–30% reduction in inventory carrying costs; 5–10% improvement in service levels	Gartner Supply Chain AI Survey
Churn Prediction	5–15% improvement in customer retention rate	Salesforce Customer Success AI benchmarks
Credit Scoring (ML vs FICO)	15–25% better discrimination (AUC); 10–15% reduction in default rates at same approval rate	Upstart, Zest AI published results
Medical Imaging AI	94–99% sensitivity on specific tasks vs. 85–90% for radiologists alone (augmented)	FDA-cleared device studies

Key Vendors & Competitive Landscape

Segment	Leaders	Challengers
Cloud ML Platforms	AWS SageMaker, Google Vertex AI, Azure ML	Databricks, Snowflake ML, Oracle OCI
Enterprise AutoML	DataRobot, H2O Driverless AI, Dataiku	RapidMiner, SAS Viya, Alteryx AI
Open-Source ML	scikit-learn, XGBoost, LightGBM, PyTorch	CatBoost, JAX, FastAI
MLOps & Monitoring	MLflow, Weights & Biases, Arize AI	Fiddler, Evidently, NannyML, Arthur AI
Feature Stores	Feast, Tecton, Databricks Feature Store	Hopsworks, Vertex Feature Store
Time-Series Forecasting	Prophet, statsforecast, Darts	Chronos, TimesFM, Moirai
Explainability	SHAP, LIME, InterpretML	Fiddler, Arthur AI, Credo AI
Fairness & Governance	Fairlearn, AIF360, Responsible AI Dashboard	Credo AI, Arthur AI, Holistic AI

Risks & Limitations

Critical challenges and failure modes when deploying predictive / discriminative AI systems.

Bias & Fairness

Historical data encodes discrimination; models can produce disparate impact on protected groups without careful auditing.

Data Leakage

Target leakage or train-test contamination inflates evaluation metrics; models fail catastrophically in production.

Concept Drift

Data distribution shifts over time; model accuracy degrades silently without proper monitoring and retraining pipelines.

Overfitting

Model memorises training data noise rather than learning true patterns; poor generalisation to unseen data.

Feature Attribution

Black-box models cannot explain individual predictions; creates regulatory risk in finance, healthcare, and hiring.

Adversarial Examples

Small, carefully crafted input perturbations can flip predictions; represents a security vulnerability in deployed models.

Risks, Limitations & Fairness

Technical Limitations

Limitation	Description
Distributional Shift	A model trained on historical data may fail when the real-world data distribution changes (e.g., COVID-19 disrupted all demand forecasting models)
Overfitting	Model learns noise in training data; performs poorly on unseen data
Underfitting	Model is too simple to capture real patterns; high bias, low variance
Data Leakage	Future information accidentally included in training features; inflates performance metrics
Correlation vs. Causation	Predictive models learn correlations; they cannot identify causal mechanisms or reason about interventions
Benchmark Overfitting	Models optimised for benchmark metrics that do not reflect real business outcomes
Label Quality	Garbage labels produce garbage models; incorrect or inconsistent labelling degrades all downstream performance
Feature Drift	A feature available at training time disappears or changes definition in production
Extrapolation Failure	Models perform poorly on inputs outside their training distribution (out-of-distribution generalisation)
Cold Start Problem	Recommendation and personalisation models have no data for new users or items

Fairness & Bias Risks

Risk	Description	Real-World Examples
Historical Bias	Model learns and perpetuates discriminatory patterns present in training data	COMPAS recidivism; Amazon hiring algorithm
Sampling Bias	Training data does not represent all demographic groups equally	Facial recognition failing on darker skin tones (MIT Media Lab study)
Label Bias	Human labellers encode their own biases into training labels	Sentiment models reflecting annotator demographics
Proxy Discrimination	Model uses race-neutral features (zip code, name) that correlate with protected attributes	Redlining via ML; credit scoring via geography
Feedback Loops	Biased predictions influence future actions that generate new biased training data	Predictive policing; content recommendation radicalisation
Algorithmic Monoculture	When many organisations use the same model, systematic errors affect entire populations	All banks using the same credit model denying the same groups
Disparate Impact	Model produces statistically different outcomes for protected demographic groups	Employment screening; loan approval; insurance pricing

Fairness Metrics

Metric	Definition	Notes
Demographic Parity	Equal positive prediction rates across demographic groups	Cannot be simultaneously satisfied with other metrics
Equalised Odds	Equal TPR and FPR across groups	Useful when both errors carry consequences
Equal Opportunity	Equal true positive rate across groups	Focus on the beneficial outcome
Individual Fairness	Similar individuals receive similar predictions	Hard to operationalise; requires similarity metric
Calibration by Group	Probabilities are accurate within each demographic group	Required for fair risk scoring
Counterfactual Fairness	Prediction would be the same if only protected attribute changed	Theoretical; causal framework

Transparency & Explainability Risks

Challenge	Description
Black-Box Models	Gradient boosting and deep learning models do not provide inherently interpretable decisions
Right to Explanation	GDPR Article 22 requires meaningful explanation for automated decisions affecting individuals
Explanation Fidelity	Post-hoc explanations (SHAP, LIME) approximate model behaviour but are not exact
Adversarial Manipulation	Explanations can be gamed to appear fair while underlying model remains biased
Regulatory Documentation	Model Cards, Datasheets, and risk documentation required by EU AI Act for high-risk models

Safety Design Principles

Principle	Description
Human-in-the-Loop for High Stakes	Require human review for consequential automated decisions (loans, hiring, medical)
Regular Retraining	Prevent model staleness; retrain when drift is detected
Fairness Auditing	Regularly evaluate model performance across demographic subgroups
Model Cards	Publish model documentation covering intended use, limitations, and performance breakdowns
Explainability by Design	Choose interpretable models when possible; reserve black boxes for cases where performance justifies it
Conservative Thresholds	Set classification thresholds to minimise the more costly error type
Data Provenance	Track the lineage of datasets and labels; document collection methods and known biases
Secure Feature Access	Restrict access to sensitive attributes; apply privacy-preserving transformations where needed
Robustness Testing	Evaluate sensitivity to outliers, missingness, adversarial perturbations, and out-of-distribution inputs
Post-Deployment Monitoring	Monitor drift and error rates continuously; trigger review when metrics degrade

Related AI System Types

Explore how this system type connects to others in the AI landscape:

Analytical AI Bayesian / Probabilistic AI Explainable AI (XAI) Recommendation / Retrieval AI Generative AI

Key Terminology Glossary

Essential predictive / discriminative AI terminology — searchable.

AUC-ROCArea Under the Receiver Operating Characteristic curve; measures classifier discrimination across all thresholds.

BaggingBootstrap Aggregating; parallel ensemble method training models on random data subsets to reduce variance.

BoostingSequential ensemble technique where each learner corrects the errors of its predecessor.

CalibrationAligning predicted probabilities with actual outcome frequencies for reliable risk scoring.

Cross-ValidationSplitting data into K folds for robust model evaluation; each fold serves as validation once.

F1 ScoreHarmonic mean of precision and recall; balances false positives and false negatives.

Feature EngineeringCreating informative input features from raw data to improve model performance.

Gradient DescentIterative optimisation algorithm that adjusts parameters in the direction of steepest loss reduction.

HyperparameterConfiguration setting determined before training begins (e.g., learning rate, tree depth, regularisation).

ImputationFilling in missing values in a dataset using statistical methods (mean, median, KNN, MICE).

L1/L2 RegularisationPenalty terms (Lasso/Ridge) added to the loss function to prevent overfitting and improve generalisation.

OverfittingModel captures noise rather than signal in training data; indicates need for regularisation or more data.

PrecisionTrue positives divided by (True positives + False positives); measures prediction exactness.

RecallTrue positives divided by (True positives + False negatives); measures sensitivity / completeness.

SHAPSHapley Additive exPlanations; game-theoretic approach to explain individual feature contributions.

Key Terminology Glossary

Term	Definition
Algorithm	A set of rules or statistical procedure a model follows to learn patterns from data
AUC-ROC	Area Under the Receiver Operating Characteristic Curve; measures a classifier's ability to discriminate between classes regardless of threshold
Bagging	Bootstrap aggregating; training multiple models on random data subsets and averaging their predictions to reduce variance
Batch Prediction	Running a trained model over a large dataset at a scheduled time to produce predictions offline
Bias (Statistical)	Systematic error in a model's predictions due to incorrect assumptions in the learning algorithm
Bias (Ethical)	Unfair, discriminatory, or unrepresentative patterns in model predictions affecting demographic groups
Boosting	Sequential ensemble method where each model corrects the errors of its predecessor; e.g., XGBoost, LightGBM
Calibration	The degree to which a model's predicted probabilities match actual observed frequencies
CatBoost	Gradient boosting library by Yandex with native categorical feature handling
Class Imbalance	A dataset where one class has far fewer examples than another; common in fraud, rare disease, and anomaly detection
Classification	A supervised learning task where the model predicts which discrete category an input belongs to
Concept Drift	When the statistical relationship between features and the target variable changes over time in production
Confusion Matrix	A table showing the counts of true positives, true negatives, false positives, and false negatives for a classifier
Cross-Validation	A technique for evaluating model performance by rotating through multiple train/test splits of the data
Data Drift	When the statistical distribution of input features in production diverges from the training data distribution
Data Leakage	When information from outside the training period or the target itself accidentally enters the feature set, artificially inflating performance
Decision Tree	A model that makes predictions by splitting the data into branches based on feature thresholds
Discriminative Model	A model that directly learns the boundary between classes — P(Y\|X) — rather than modelling the full data distribution
Embeddings	Dense, low-dimensional vector representations of discrete entities (users, products, words) that capture semantic similarity
Ensemble	A combination of multiple models whose predictions are aggregated to improve accuracy and robustness
F1 Score	The harmonic mean of precision and recall; the standard metric for imbalanced classification tasks
Feature	An individual measurable property or input variable used by a model to make predictions
Feature Engineering	The process of creating, transforming, and selecting input variables to improve model performance
Feature Importance	A measure of how much each feature contributes to a model's predictions
Feature Store	A centralised infrastructure layer for computing, storing, and serving ML features consistently across training and inference
Gradient Descent	An optimisation algorithm that iteratively adjusts model parameters in the direction of the steepest decrease in the loss function
Hyperparameter	A model configuration setting (e.g., learning rate, tree depth) set before training and not learned from data
Hyperparameter Optimisation (HPO)	The process of systematically searching for the best hyperparameter values to maximise model performance
Inference	The process of applying a trained model to new, unseen data to generate predictions
Isolation Forest	An unsupervised anomaly detection algorithm that isolates anomalies by randomly partitioning the feature space
K-Fold Cross-Validation	Splitting data into k equal folds; training on k-1 folds and validating on the remaining one, rotated k times
Label	The known correct output for a training example; also called the target variable or ground truth
LightGBM	Microsoft's gradient boosting framework; optimised for speed and large datasets via leaf-wise tree growth
Logistic Regression	A linear classification model that outputs class probabilities via a sigmoid or softmax function
Loss Function	A mathematical function that measures the difference between the model's predictions and the true labels; minimised during training
MAE (Mean Absolute Error)	The average absolute difference between predicted and actual values in a regression task
MLOps	Machine Learning Operations; the discipline of deploying, monitoring, and maintaining ML models in production
Model Card	A standardised document summarising a model's intended use, performance characteristics, limitations, and ethical considerations
Model Drift	The degradation of a deployed model's predictive performance over time as real-world patterns evolve
Model Registry	A centralised repository for storing, versioning, and managing trained ML model artefacts
Normalisation	Scaling numeric features to a standard range (e.g., [0,1]) so that features with different scales do not dominate the model
Overfitting	When a model learns the noise and idiosyncrasies of training data so well that it fails to generalise to new data
Precision	The fraction of positive predictions that are actually correct: TP / (TP + FP)
Predictive Maintenance	Using ML models to predict when equipment will fail so maintenance can be proactively scheduled
Propensity Model	A model that estimates the probability that an individual will take a specific action (purchase, churn, respond)
Random Forest	An ensemble of decision trees trained on random subsets of data and features; predictions are aggregated by voting
Real-Time Inference	Serving model predictions via an API in response to live requests, typically with sub-100ms latency
Recall	The fraction of actual positives correctly identified by the model: TP / (TP + FN); also called sensitivity
Regression	A supervised learning task where the model predicts a continuous numeric value
Regularisation	Techniques (L1/L2 penalty, dropout) that constrain model complexity to reduce overfitting
RMSE (Root Mean Squared Error)	The square root of the average squared prediction error; the standard regression metric
Risk Score	A numeric probability or index output by a model representing the likelihood of a specific risk event
SHAP (SHapley Additive exPlanations)	A game-theory-based method for explaining how each feature contributed to a specific model prediction
SMOTE	Synthetic Minority Over-sampling Technique; generates synthetic minority class examples to address class imbalance
Supervised Learning	A learning paradigm where models are trained on (input, label) pairs with the goal of predicting the label for new inputs
SVM (Support Vector Machine)	A classification algorithm that finds the maximum-margin hyperplane separating two classes in feature space
Target Encoding	Replacing a categorical variable with the mean of the target variable for that category; useful for high-cardinality features
Transfer Learning	Fine-tuning a pre-trained model on a new, smaller, domain-specific dataset
Underfitting	When a model is too simple to capture the real patterns in data; exhibits high bias and low variance
Uplift Modelling	A technique that models the incremental effect of a treatment (e.g., marketing contact) on an individual's behaviour
Validation Set	A held-out subset of training data used to evaluate model performance and tune hyperparameters during development
Variance (Statistical)	The sensitivity of a model's predictions to fluctuations in the training data; high variance leads to overfitting
Walk-Forward Validation	A time-series cross-validation strategy where the model is trained on past data and validated on the next future window
XGBoost	Extreme Gradient Boosting; a regularised gradient boosting framework; dominant on tabular ML tasks and Kaggle competitions

Supervised LearningTraining on labelled examples where the correct output for each input is known.

ClassificationPredicting a categorical label (spam/not-spam, disease/healthy) for an input.

RegressionPredicting a continuous numerical value (price, temperature, probability) from features.

Gradient BoostingEnsemble method building trees sequentially, each correcting errors of the previous trees.

Random ForestEnsemble of decision trees trained on random subsets of data and features.

XGBoostExtreme Gradient Boosting — high-performance gradient boosting with regularisation and parallel processing.

LightGBMMicrosoft's gradient boosting framework using histogram-based algorithms for speed and efficiency.

Feature EngineeringCreating informative input variables from raw data to improve model performance.

Cross-ValidationEvaluating model performance by training and testing on different data splits.

ROC-AUCArea Under the Receiver Operating Characteristic curve — measures classification performance across thresholds.

Precision and RecallPrecision: fraction of positive predictions that are correct. Recall: fraction of actual positives found.

F1 ScoreHarmonic mean of precision and recall, balancing both metrics in a single number.

Confusion MatrixTable showing true positives, true negatives, false positives, and false negatives.

OverfittingModel memorising training data noise, leading to poor generalisation on unseen data.

RegularisationTechniques (L1, L2, dropout) penalising model complexity to prevent overfitting.

Regulation

Detailed reference content for regulation.

Regulation & Governance

Predictive AI has been regulated longer than generative and agentic AI because it is most commonly deployed for high-stakes decisioning — credit, hiring, insurance, healthcare, and law enforcement. The 2025–2026 regulatory wave formalises documentation, transparency, and risk-management requirements that many regulated industries already partially implement.

EU AI Act — Predictive AI Implications

Many predictive models fall under the high-risk classification when deployed in employment, education, credit, insurance, public benefits, healthcare, biometric identification, or law enforcement contexts.

Requirement Area	Practical Implication for Predictive AI Systems
Risk Management System	Formal risk process covering model development, deployment, and incident handling
Data Governance	Document training data sources, representativeness, bias mitigation measures, and preprocessing steps
Technical Documentation	Model design, intended use, performance characteristics, known limitations, and monitoring plan
Logging & Traceability	Maintain logs of inputs, outputs, and key decisions for auditability and incident investigation
Transparency	Users must be informed when subject to AI-driven decisions in high-risk contexts
Human Oversight	Ability for a human to intervene, override, and review automated decisions
Accuracy, Robustness & Cybersecurity	Stress-test and monitor for failures and attacks; ensure resilience across edge cases
Post-Market Monitoring	Ongoing monitoring and mandatory reporting of serious incidents post-deployment

Sector-Specific Regulation

Domain	Regulatory Drivers	Typical Controls Required
Credit & Lending	ECOA, FCRA, CFPB guidance (US); FCA (UK); EU consumer credit directive	Explainability, adverse action notices, disparate impact testing, model validation
Insurance	State regulators (US); EIOPA (EU); FCA (UK)	Pricing transparency, discrimination constraints, audit trails, actuarial sign-off
Healthcare	FDA / EMA / MHRA; HIPAA (US); MDR (EU)	Clinical validation, safety monitoring, privacy safeguards, regulatory clearance for SaMD
Employment	EEOC guidance (US); NYC Local Law 144; EU AI Act Title III	Bias audits, candidate notification, documentation of model use
Data Privacy	GDPR (EU); CCPA / CPRA (US); LGPD (Brazil); PDPA (various)	Data minimisation, lawful basis for processing, right to explanation, data subject rights
Financial Markets	SR 11-7 (US Fed); EBA guidelines (EU); MAS (Singapore)	Model risk management, validation, governance, documentation of assumptions
Public Sector	EU AI Act prohibited & high-risk lists; national AI strategies	Mandatory human oversight; prohibition on certain social scoring applications

Model Risk Management (SR 11-7 Framework)

The US Federal Reserve's SR 11-7 guidance is the most widely adopted framework for model governance in financial services — and its principles are increasingly applied across industries.

SR 11-7 Component	What It Requires
Model Development	Sound methodology; well-reasoned assumptions; documented design choices
Model Validation	Independent validation by a team separate from model developers
Conceptual Soundness	Review of the theory and assumptions underlying the model
Ongoing Monitoring	Track model performance and stability over time; flag material degradation
Outcomes Analysis	Compare model predictions to actual outcomes; assess accuracy and discrimination
Benchmarking	Compare model against alternative approaches or challenger models
Model Inventory	Maintain a register of all models in use, their status, risk tier, and validation history

Responsible AI Toolkits

Toolkit	Provider	Capabilities
Fairlearn	Microsoft (open-source)	Fairness assessment and mitigation for classification and regression models
AI Fairness 360 (AIF360)	IBM (open-source)	70+ fairness metrics; 10+ bias mitigation algorithms
What-If Tool	Google (open-source)	Interactive exploration of model behaviour and fairness across subgroups
SHAP	Open-source	Model-agnostic feature attribution; supports all major ML frameworks
LIME	Open-source	Local interpretable model-agnostic explanations; approximates model locally
InterpretML	Microsoft (open-source)	Explainable Boosting Machines (EBM); glass-box model family
Responsible AI Dashboard	Microsoft (Azure ML)	Integrated fairness, explainability, error analysis, and causal analysis
Fiddler AI	SaaS	Enterprise model monitoring, explainability, and fairness auditing
Arthur AI	SaaS	Bias detection, performance monitoring, explainability for regulated industries
Credo AI	SaaS	AI governance platform; policy-as-code; compliance evidence generation

Documentation Standards

Standard	Description	Who Uses It
Model Cards	Standardised model documentation covering intended use, performance, limitations, and ethical considerations	Google, Hugging Face, industry-wide
Datasheets for Datasets	Documentation standard for training datasets covering motivation, composition, and known biases	Academic and enterprise ML teams
FactSheets	IBM's structured transparency document for AI services	IBM customers and enterprise deployments
AI BoM (Bill of Materials)	Inventory of all components, data sources, and dependencies in an AI system	Enterprise AI governance; supply chain compliance
EU AI Act Technical File	Mandatory documentation package for high-risk AI systems under the EU AI Act	EU market deployers of high-risk AI

MLOps

Detailed reference content for mlops.

MLOps — Model Deployment & Lifecycle

MLOps (Machine Learning Operations) is the discipline of deploying, monitoring, and maintaining predictive models in production reliably and at scale.

Model Deployment Patterns

Pattern	Description	Best For
Real-Time Inference (REST API)	Model served as an API endpoint; responds to requests in milliseconds	Fraud detection, credit scoring, recommendation
Batch Prediction	Model runs over a dataset at scheduled intervals; results stored	Daily churn scores, overnight demand forecasts
Streaming Inference	Model applied to events in real time as they arrive (Kafka, Kinesis)	Transaction monitoring, log anomaly detection
Edge Deployment	Model deployed on device; no server round-trip required	Mobile apps, IoT sensors, autonomous vehicles
Embedded ML	Model compiled into a software product or firmware	Spam filters in email clients, ABS in vehicles
Shadow Mode	New model runs in parallel with production; results compared without serving to users	Safe A/B testing before full cutover
A/B Testing	Route a portion of traffic to new model; compare business metrics	Gradual model rollout with statistical confidence
Canary Deployment	Gradually increase traffic to new model while monitoring for degradation	Risk-managed production transitions

Model Serving Infrastructure

Tool	Type	Highlights
TensorFlow Serving	Open-source	Production-grade serving for TF models; gRPC + REST
TorchServe	Open-source	PyTorch's official model server
Triton Inference Server	Open-source (NVIDIA)	Multi-framework; GPU-optimised; high-throughput batching
BentoML	Open-source / SaaS	Framework-agnostic model serving; easy containerisation
Seldon Core	Open-source	Kubernetes-native model serving; explainability integration
KServe	Open-source	Kubernetes-based serverless inference; multi-model serving
Ray Serve	Open-source	Distributed inference; Python-native; composable pipelines
FastAPI	Open-source	Lightweight REST API framework; common for custom serving

Model Monitoring

Monitoring Type	What It Detects	Tools
Data Drift	Input feature distributions shift from training data	Evidently, NannyML, WhyLogs, Arize
Concept Drift	Relationship between inputs and output changes over time	Evidently, NannyML, Fiddler
Label / Prediction Drift	Output distributions shift unexpectedly	Arize Phoenix, Fiddler, Arthur AI
Data Quality	Missing values, outliers, schema violations in production data	Great Expectations, Monte Carlo, Soda
Model Performance Decay	Accuracy, precision, or AUC drops below threshold	MLflow, Arize, Fiddler, SageMaker Monitor
Infrastructure Monitoring	Latency, throughput, error rates of serving infrastructure	Prometheus, Grafana, Datadog

MLOps Platforms

Platform	Type	Highlights
MLflow	Open-source	Experiment tracking + model registry + deployment; industry standard
Kubeflow	Open-source	Kubernetes-native ML pipelines; production-grade orchestration
ZenML	Open-source	Clean MLOps abstraction; framework-agnostic pipelines
Metaflow (Netflix)	Open-source	Python-first ML workflow management; data science-friendly
Prefect / Airflow	Open-source	General workflow orchestration used for ML pipelines
Weights & Biases	SaaS	Experiment tracking + artifact management + model registry
Arize AI	SaaS	Model observability; drift detection; explainability
Fiddler AI	SaaS	Enterprise ML monitoring; explainability; bias detection
Evidently AI	Open-source / SaaS	Data and model monitoring reports; drift analysis
NannyML	Open-source / SaaS	Performance monitoring without ground truth labels
Arthur AI	SaaS	ML monitoring; bias detection; explainability for regulated industries

The Full MLOps Lifecycle

┌─────────────────────────────────────────────────────────────────────┐
│ MLOPS LIFECYCLE │
│ │
│ DATA EXPERIMENTATION DEPLOYMENT │
│ ───────────── ───────────────── ────────────── │
│ Ingest → Train → Evaluate → Package → Serve │
│ Validate → Track → Compare → via API / Batch │
│ Feature Store Register Model / Edge │
│ │
│ MONITORING GOVERNANCE RETRAINING │
│ ───────────── ───────────────── ────────────── │
│ Drift → Audit logs → Trigger → │
│ Performance → Compliance → Retrain → │
│ Alerts Lineage Validate → Redeploy │
└─────────────────────────────────────────────────────────────────────┘

AutoML

Detailed reference content for automl.

AutoML & No-Code ML

AutoML automates the most labour-intensive steps of the ML pipeline — algorithm selection, feature engineering, and hyperparameter tuning — making predictive AI accessible to non-specialist practitioners.

AutoML Capabilities

Capability	What It Automates
Algorithm Selection	Tests multiple algorithms and selects the best performer
Feature Engineering	Automatically creates, selects, and transforms features
Hyperparameter Optimisation	Searches for optimal model parameters
Ensemble Construction	Combines the best models into an ensemble
Model Evaluation	Runs cross-validation and computes performance metrics
Neural Architecture Search (NAS)	Discovers optimal DNN architectures automatically
Data Preprocessing	Handles missing values, encoding, and normalisation automatically

Leading AutoML Platforms

Platform	Type	Highlights
H2O AutoML	Open-source / SaaS	Best open-source AutoML; Driverless AI; widely used in enterprise
Auto-sklearn	Open-source	Bayesian optimisation over scikit-learn pipelines; academic standard
TPOT	Open-source	Genetic programming for pipeline optimisation; exports clean Python code
AutoGluon	Open-source (Amazon)	State-of-the-art AutoML; strong stacking; competitive with manual models
Google Vertex AutoML	SaaS	No-code tabular, image, text, and video ML on Google Cloud
AWS SageMaker Autopilot	SaaS	AutoML with full pipeline visibility; explainability built-in
Azure Automated ML	SaaS	Enterprise AutoML with responsible AI integration
DataRobot	SaaS	Market-leading enterprise AutoML; MLOps; time-to-value focus
Dataiku	SaaS	Collaborative data science and ML platform; low-code/pro-code
RapidMiner	SaaS	Visual ML pipeline builder; no-code + Python integration
Obviously AI	SaaS	One-click predictive models for business users; natural language interface
Pecan AI	SaaS	Automated predictive analytics for business metrics

Foundation Models for Time-Series (Zero-Shot Forecasting)

A rapidly emerging category: large pre-trained models that forecast without task-specific training.

Model	Organisation	Context Length	Highlights
Chronos	Amazon	512 time steps	Tokenises time-series; transformer-based; strong zero-shot
TimesFM	Google	512 time steps	Decoder-only transformer; strong zero-shot forecasting
Moirai	Salesforce	1–5,000 steps	Any-variate; any-frequency; universal forecasting
Lag-Llama	Community	1,024 steps	Open-source; univariate probabilistic forecasting
MOMENT	CMU	512 time steps	Multi-task time-series foundation model

Deep Dives

Detailed reference content for deep dives.

Feature Engineering & Data Preparation

Feature engineering is the process of transforming raw data into the numeric representations that ML models learn from. It remains one of the most impactful steps in the predictive AI pipeline.

Structured / Tabular Feature Types

Feature Type	Description	Handling Technique
Numeric	Continuous or discrete numbers	Normalisation, scaling, log transform
Categorical (Low Cardinality)	Variables with few categories (e.g., country)	One-hot encoding, label encoding
Categorical (High Cardinality)	Variables with many categories (e.g., product ID)	Target encoding, embeddings, hashing
Ordinal	Ordered categories (e.g., education level)	Integer encoding preserving order
Datetime	Timestamps and date values	Extract day, month, year, hour, day-of-week, cyclical encoding
Text	Free-form strings	TF-IDF, word2vec, BERT embeddings
Geospatial	Latitude, longitude, region	Geohashing, distance features, clustering
Interaction Features	Products or ratios of two features	Manual engineering or automated discovery
Lag Features	Prior time period values as features	For time-series; critical for forecasting
Aggregation Features	Grouped statistics (user's last 30-day average spend)	Window functions; feature stores

Feature Engineering Techniques

Technique	Description	When to Use
Normalisation (Min-Max)	Scale features to [0, 1] range	Neural networks; distance-based models
Standardisation (Z-score)	Transform to zero mean, unit variance	Logistic Regression, SVM, KNN
Log Transformation	Reduce skewness of heavily right-skewed distributions	Revenue, transaction amounts, counts
Binning / Discretisation	Convert continuous variable to ordinal buckets	Age groups, income bands, risk tiers
Polynomial Features	Create squared and interaction terms	Capture non-linearity in linear models
Target Encoding	Replace category with mean of target variable	High-cardinality categoricals in GBT
PCA / Dimensionality Reduction	Compress many correlated features into fewer components	High-dimensional data; reduce noise
Embeddings	Dense vector representations of categorical entities	Users, products, documents in deep models
Window Statistics	Rolling mean, max, std over time window	Time-series; fraud; user behaviour

Data Quality & Preprocessing

Challenge	Approach
Missing Values	Impute with mean/median/mode; model-based imputation; flag missingness as a feature
Class Imbalance	Oversample minority (SMOTE), undersample majority, adjust class weights, use AUC not accuracy
Outliers	Detect via IQR/Z-score; clip, transform, or flag as a separate category
Data Leakage	Ensure no future information is used as a feature; strict train/test temporal splits
Concept Drift	Monitor feature distributions in production; trigger retraining when distributions shift
Label Noise	Clean labels via majority vote, label smoothing, or confident learning
Duplicates	Deduplicate before train/test split to avoid data contamination

Feature Stores

Feature stores are the infrastructure layer for managing, storing, and serving features consistently across training and inference.

Tool	Type	Highlights
Feast	Open-source	Most widely used open-source feature store; online + offline serving
Tecton	SaaS	Enterprise feature platform; real-time streaming features
Databricks Feature Store	SaaS	Native integration with MLflow and Delta Lake
Vertex AI Feature Store	SaaS	Google's managed feature store; scalable low-latency serving
AWS SageMaker Feature Store	SaaS	Online and offline store; integrated with SageMaker
Hopsworks	Open-source / SaaS	Full-stack feature store with model registry

Model Training Techniques

Supervised Learning

The foundational paradigm for all predictive AI.

Aspect	Detail
Core Mechanism	Learn from (input, label) pairs; minimise prediction error on known outcomes
Requirement	Labelled training data — each example must have a known correct output
Training Process	Iterate over data; compute loss between predictions and labels; update weights via gradient descent or tree-splitting
Used For	All classification, regression, risk scoring, and ranking tasks

Semi-Supervised Learning

Aspect	Detail
Core Mechanism	Use a small amount of labelled data plus a large amount of unlabelled data
Why It Matters	Labelling is expensive; unlabelled data is abundant
Techniques	Self-training, label propagation, pseudo-labelling, consistency regularisation
Used For	Medical imaging (few labelled scans), NLP text classification, fraud detection

Transfer Learning

Aspect	Detail
Core Mechanism	Fine-tune a large pre-trained model on a small domain-specific labelled dataset
Why It Works	Pre-trained models encode rich general representations; domain data adapts the final layers
Key Benefit	Achieves strong performance with orders of magnitude less labelled data
Examples	BERT fine-tuned for contract classification; ImageNet-pre-trained CNN for medical imaging

Hyperparameter Optimisation

Method	How It Works	Best For
Grid Search	Exhaustively try all combinations in a specified grid	Small parameter spaces; sanity checks
Random Search	Randomly sample parameter combinations	More efficient than grid search; general use
Bayesian Optimisation (Optuna, Hyperopt)	Build a surrogate model of performance; intelligently explore space	Complex, expensive search spaces
Successive Halving / Hyperband	Early-stop underperforming configurations; allocate budget to promising ones	Large-scale ML at resource-constrained budgets
Neural Architecture Search (NAS)	Automatically discover optimal model architectures	Deep learning; automated model design

Ensemble Methods

Method	How It Works	When to Use
Bagging	Train multiple models on random data subsets; aggregate predictions (e.g., Random Forest)	Reduce variance; avoid overfitting
Boosting	Train models sequentially; each corrects predecessor errors (e.g., XGBoost)	Reduce bias; high accuracy on tabular data
Stacking	Use predictions of base models as features for a meta-model	Kaggle competitions; maximum accuracy
Voting	Combine predictions from diverse models by majority vote or averaging	Robust, easy to implement ensemble
Blending	Similar to stacking but uses a holdout set rather than cross-validation	Production ensembles

Class Imbalance Handling

Technique	Description	When to Use
SMOTE	Synthetic Minority Over-sampling Technique — generates synthetic minority examples	Moderate imbalance; tabular data
Class Weights	Increase the training loss for minority class errors	All gradient-based models; simple and effective
Undersampling	Reduce majority class to balance dataset	Very high imbalance; large majority class
Threshold Adjustment	Lower classification threshold to increase minority recall	When false negatives are more costly
Ensemble for Imbalance (BalancedBagging)	Bagging with balanced sampling per tree	Random Forest on imbalanced data

Overview

Detailed reference content for overview.

Definition & Core Concept

Predictive / Discriminative AI is the branch of artificial intelligence focused on systems that learn to map inputs to outputs — classifying what something is, scoring the probability of an outcome, ranking items by relevance, or forecasting future values — based entirely on patterns extracted from historical labelled data.

This is the most widely deployed form of AI in production today. It powers spam filters, credit scores, fraud detection, medical diagnosis, demand forecasting, recommender systems, and the vast majority of "AI in the enterprise" that predated the generative AI era.

Dimension	Detail
Core Capability	Discriminates — learns the boundary between classes or the mapping from inputs to outputs
How It Learns	Supervised or semi-supervised learning on labelled datasets; minimises a loss function
What It Produces	Labels, probabilities, scores, rankings, or numeric forecasts — not new content
Key Differentiator	Predicts from existing patterns; it does not create, reason autonomously, or generate

Predictive AI vs. Other AI Types

AI Type	What It Does	Example
Predictive / Discriminative AI	Classifies, scores, and forecasts from patterns in historical data	Fraud score, churn probability, demand forecast
Agentic AI	Pursues goals autonomously using tools, memory, and planning	Research agent, coding agent
Analytical AI	Extracts insights and explanations from existing data	Dashboard, root-cause analysis
Autonomous AI (Non-Agentic)	Operates independently within fixed boundaries without human input	Autopilot, auto-scaling, algorithmic trading
Bayesian / Probabilistic AI	Reasons under uncertainty using probability distributions	Clinical trial analysis, A/B testing, risk modelling
Cognitive / Neuro-Symbolic AI	Combines neural learning with symbolic reasoning	LLM + knowledge graph, physics-informed neural net
Conversational AI	Manages multi-turn dialogue between humans and machines	Customer service chatbot, voice assistant
Evolutionary / Genetic AI	Optimises solutions through population-based search inspired by natural selection	Neural architecture search, logistics scheduling
Explainable AI (XAI)	Makes AI decisions understandable to humans	SHAP explanations, LIME, Grad-CAM
Generative AI	Creates new original content from learned distributions	Write an essay, generate an image
Multimodal Perception AI	Fuses vision, language, audio, and other modalities	GPT-4o processing image + text, AV sensor fusion
Optimisation / Operations Research AI	Finds optimal solutions to constrained mathematical problems	Vehicle routing, supply chain planning, scheduling
Physical / Embodied AI	Acts in the physical world through sensors and actuators	Autonomous vehicle, robot arm, drone
Privacy-Preserving AI	Trains and runs AI without exposing raw data	Federated hospital models, differential privacy
Reactive AI	Responds to current input with no learning or memory	Chess engine, rule-based spam filter
Recommendation / Retrieval AI	Surfaces relevant items from large catalogues based on user signals	Netflix suggestions, Google Search, Spotify playlists
Reinforcement Learning AI	Learns optimal behaviour from reward signals via trial and error	AlphaGo, robotic locomotion, RLHF
Scientific / Simulation AI	Solves scientific problems and models physical systems	AlphaFold, climate simulation, molecular dynamics
Symbolic / Rule-Based AI	Reasons over explicit rules and knowledge to derive conclusions	Medical expert system, legal reasoning engine

Key Distinction from Generative AI: Generative AI produces new content that did not exist. Predictive AI evaluates what already exists — assigning it to a category, estimating its probability, or forecasting what will happen next.

Key Distinction from Analytical AI: Analytical AI answers "what does this data mean?" and surfaces insights. Predictive AI answers "what will happen?" or "which category does this belong to?" — outputting a specific, actionable prediction.