Physical / Embodied AI — Interactive Architecture Chart (2026)

Sense-Act Loop

The continuous feedback cycle at the heart of every embodied AI system — sense the environment, build a world model, plan actions, execute, observe outcomes, and repeat.

Click a stage to learn more

Each stage in the sense-act loop feeds into the next, forming a continuous cycle that allows embodied AI to interact with and learn from its physical environment.

How Physical AI Works — The Sense-Plan-Act Pipeline

┌─────────────────────────────────────────────────────────────────────────┐
│ PHYSICAL AI PIPELINE │
│ │
│ 1. SENSE 2. PERCEIVE 3. PLAN 4. ACT │
│ ────────────── ────────────── ────────────── ──────── │
│ Raw sensor data: Understand the Generate motion Execute │
│ cameras, LiDAR, scene: objects, plans, paths, physical │
│ IMU, force/ poses, semantics, trajectories, actions │
│ torque, GPS, maps, free space grasps, and via │
│ radar, tactile collision-free motors, │
│ actions joints, │
│ wheels │
│ │
│ ──── REAL-TIME LOOP — SAFETY CONSTRAINTS — IRREVERSIBLE ACTIONS ───── │
└─────────────────────────────────────────────────────────────────────────┘

The Physical AI Process

Step	What Happens
Sensing	Raw data collected from cameras, LiDAR, radar, IMU, GPS, force/torque sensors, tactile sensors, microphones
Perception	Sensor fusion, object detection, semantic segmentation, depth estimation, 3D scene reconstruction, SLAM
World Modelling	Build and maintain an internal representation of the environment — occupancy grids, scene graphs, physics models
Task Planning	High-level reasoning about what to do — sequence of sub-goals, task decomposition
Motion Planning	Generate collision-free trajectories that achieve the task plan — path planning, grasp planning
Control	Low-level motor commands to follow the planned trajectory — PID, torque control, compliance control
Execution	Physical actuators move the robot, vehicle, or drone through the real world
Feedback	Sensory feedback closes the loop — correcting for errors, disturbances, and unexpected obstacles

Key Physical AI Parameters

Parameter	What It Controls
Sensor Update Rate	Frequency of sensory data acquisition — typically 10–100 Hz for cameras, 100–1000 Hz for IMU/force
Control Loop Frequency	How often motor commands are updated — 100–1000 Hz for real-time control
Planning Horizon	How far ahead the system plans — seconds (local avoidance) to minutes (route planning)
Safety Margin	Minimum clearance from obstacles, humans, and hazards
Payload Capacity	Maximum weight the system can carry or manipulate
Degrees of Freedom (DoF)	Number of independent joints or axes of motion — 6 DoF for a standard robot arm, 30+ for a humanoid
Localisation Accuracy	Precision of the system's knowledge of its own position — centimetre to sub-millimetre
Latency Budget	Maximum allowable time from sensing to actuation — milliseconds for safety-critical actions

Did You Know?

Boston Dynamics' Atlas robot can perform parkour, backflips, and navigate complex terrain autonomously.

The autonomous vehicle industry has logged over 100 million miles of real-world testing data.

Surgical robots like da Vinci have assisted in over 12 million procedures worldwide.

Knowledge Check

Test your understanding — select the best answer for each question.

Q1. What does "sim-to-real transfer" refer to?

Q2. Which sensor provides 3D depth information for robots?

Q3. What is SLAM in robotics?

8-Layer Stack

The full technology stack for Physical / Embodied AI, from hardware platform up to fleet orchestration. Click each layer to expand details.

8 Fleet Management & Orchestration ▼

Multi-robot coordination, task allocation across heterogeneous fleets, cloud dashboards for monitoring, remote teleoperation fallback, OTA firmware updates, and fleet-level analytics.

7 Task Planning ▼

Hierarchical task decomposition, behaviour trees for modular execution, PDDL-based symbolic planning, large language model (LLM) task translators that convert natural language instructions into executable plans.

6 Motion Planning & Control ▼

Trajectory optimisation, Model Predictive Control (MPC), impedance/admittance control for compliant manipulation, collision-free path planning (RRT*, A*), and real-time servo-level feedback loops.

5 World Modelling ▼

3D scene graphs, occupancy grids, Neural Radiance Field (NeRF) reconstruction, digital twins, semantic maps linking objects to affordances, and predictive physics models for forward simulation.

4 Perception ▼

Object detection and segmentation, SLAM (Simultaneous Localisation and Mapping), monocular/stereo depth estimation, tactile signal processing, pose estimation, and anomaly detection from sensor streams.

3 Sensor Fusion ▼

Multi-modal fusion combining LiDAR, camera (RGB/thermal), IMU, tactile arrays, GPS/RTK, and radar into a unified representation. Extended Kalman filters, transformer-based fusion, and temporal alignment.

2 Actuators & End-Effectors ▼

Electric/hydraulic motors, precision servo drives, parallel-jaw and soft grippers, suction cups, dexterous hands, haptic feedback arrays, and compliant actuators for safe human interaction.

1 Hardware Platform ▼

Robot chassis/frame (wheeled, legged, aerial), onboard compute (NVIDIA Jetson, GPU clusters), battery/power systems, connectivity (5G, Wi-Fi 6E, mesh radio), and ruggedised enclosures.

The Physical AI Stack — 8 Layers

Layer	What It Covers
1. Hardware / Body	Actuators (motors, servos, hydraulics), sensors (cameras, LiDAR, IMU, tactile), compute (edge GPU, FPGA), power (batteries, fuel cells)
2. Low-Level Control	Motor controllers, PID loops, torque control, joint-level safety limits
3. Perception	Object detection, semantic segmentation, depth estimation, SLAM, 3D reconstruction, sensor fusion
4. World Model	Occupancy grids, scene graphs, physics simulators, digital twins
5. Planning & Decision	Task planning, motion planning, grasp planning, behaviour trees, state machines
6. Learning & Adaptation	RL policies, imitation learning, foundation model inference, online adaptation
7. Communication	Robot-to-robot coordination, cloud connectivity, fleet management, V2X for vehicles
8. Safety & Compliance	Emergency stops, collision avoidance, functional safety (ISO 13849), regulatory compliance

Sub-Types of Embodied AI

The primary morphological categories of Physical AI systems, each designed for distinct environments and tasks.

Industrial

Industrial Robots

Fixed and articulated robotic arms
Welding, assembly, painting, palletising
Key players: Fanuc, ABB, KUKA
High precision, repeatability, 24/7 uptime

Autonomous Vehicles

Autonomous Vehicles (AVs)

Self-driving cars, trucks, and shuttles
Key players: Waymo, Cruise, Aurora
L4/L5 autonomy; lidar + camera + radar stacks
Geofenced commercial deployments

Humanoid

Humanoid Robots

Bipedal general-purpose robots
Key players: Figure 02, Tesla Optimus, 1X NEO
Household, factory, and service tasks
Human-compatible form factor for existing spaces

Aerial

Drones / UAVs

Aerial robots for delivery, inspection, agriculture
Key players: Skydio, DJI, Zipline
Defence, mapping, search & rescue
VTOL, fixed-wing, and hybrid configurations

Logistics

Autonomous Mobile Robots (AMRs)

Warehouse, hospital, and campus logistics
Key players: Locus, Fetch, MiR
SLAM-based navigation; goods-to-person
Fleet coordination via cloud orchestration

Medical

Surgical / Medical Robots

Minimally-invasive surgical procedures
Key players: da Vinci (Intuitive), Medtronic Hugo
Sub-millimetre precision; tremor filtering
Tele-surgery and AI-assisted guidance

Sub-Types by Form Factor

Industrial Robots

Aspect	Detail
Definition	Fixed or mobile robots operating in manufacturing environments for assembly, welding, painting, and material handling
Form Factors	6-axis articulated arms, SCARA robots, delta robots, gantry systems
Key Vendors	Fanuc, ABB, KUKA, Yaskawa, Universal Robots (cobots)
Global Installed Base (2024)	~4.2 million industrial robots (IFR — International Federation of Robotics)
Trends	Cobots (collaborative robots), AI-powered quality inspection, flexible manufacturing

Autonomous Vehicles

Aspect	Detail
Definition	Vehicles that navigate and drive without human intervention, from driver assistance to full autonomy
SAE Levels	L0 (no automation) → L1 (driver assistance) → L2 (partial) → L3 (conditional) → L4 (high) → L5 (full)
Key Players	Waymo (L4 robotaxi), Tesla (L2+ FSD), Cruise, Zoox, Aurora, Mercedes (L3 highway), Mobileye
Sensor Suite	Cameras, LiDAR, radar, ultrasonic, GPS/GNSS, HD maps
Key Challenge	Long-tail edge cases — rare, unexpected scenarios that the system must handle safely

Humanoid Robots

Aspect	Detail
Definition	Robots with human-like form factors — bipedal locomotion, arms, hands, head — designed to operate in human environments
Key Players	Tesla Optimus, Figure (Figure 02), Boston Dynamics Atlas, Agility Robotics Digit, 1X (NEO), Unitree
Why Humanoid	Human environments (stairs, doors, tools, workspaces) are designed for the human body; a humanoid form factor can operate without infrastructure changes
Challenges	Bipedal balance, dexterous manipulation, power/energy efficiency, cost
Status (2026)	Early deployment in warehouses and manufacturing; rapid progress in locomotion and manipulation

Drones (UAVs)

Aspect	Detail
Definition	Unmanned aerial vehicles with AI-powered autonomous flight, navigation, and task execution
Form Factors	Multirotor (quadcopter), fixed-wing, VTOL (vertical take-off and landing), hybrid
Use Cases	Aerial inspection, delivery, agriculture (crop spraying), mapping/survey, search & rescue, defence
Key Players	DJI, Skydio (autonomous drones), Wing (Alphabet delivery), Zipline (medical delivery)
Autonomy	Ranges from remote-piloted to fully autonomous obstacle avoidance and mission planning

Autonomous Mobile Robots (AMRs)

Aspect	Detail
Definition	Wheeled or tracked robots that autonomously navigate indoor environments (warehouses, hospitals, offices)
Use Cases	Warehouse picking and transport, hospital logistics, last-mile delivery, cleaning
Key Players	Amazon Robotics (Kiva), Locus Robotics, 6 River Systems, Fetch Robotics, ANYbotics (quadruped)
Navigation	SLAM-based; dynamic obstacle avoidance; fleet coordination

Surgical & Medical Robots

Aspect	Detail
Definition	Robotic systems that assist or perform surgical procedures with superhuman precision
Key Systems	Intuitive Surgical da Vinci, Medtronic Hugo, Johnson & Johnson Ottava, CMR Versius
Autonomy Level	Currently teleoperated (surgeon controls); moving toward semi-autonomous sub-task execution
Market	~$7.2 billion (2024); growing rapidly with expanding surgical indications

Core Architectures

The fundamental control and planning paradigms that drive embodied AI systems — from fast reflexes to foundation-model planners.

Reactive

Reactive Control

Direct sensor-to-actuator mapping
Subsumption architecture (Brooks)
Ultra-fast reflexive responses (<1 ms)
No internal world model required

Deliberative

Deliberative Planning

PDDL, hierarchical task networks (HTN)
Plans fully before acting
Slower but globally optimal solutions
Requires accurate world model

Modular

Behaviour Trees

Modular hierarchical task structure
Widely used in game AI and robotics
Tick-based execution; composable nodes
Easy to debug and extend

End-to-End

End-to-End Learning

Raw sensor input → control output
Imitation learning, reinforcement learning
Tesla FSD approach; minimal hand-engineering
Requires massive data; hard to interpret

Sim-to-Real

Sim-to-Real Transfer

Train in simulation (Isaac Sim, MuJoCo)
Deploy policies on real hardware
Domain randomisation for robustness
Closes the reality gap iteratively

Foundation

Foundation Models for Robotics

VLMs (RT-2, PaLM-E) as high-level planners
Language-conditioned manipulation
Zero-shot generalisation to novel tasks
Bridges semantic understanding and motor control

Core Architectures & Techniques

Classical Robotics (Sense-Plan-Act)

Aspect	Detail
Core Architecture	Sequential pipeline: sense → build world model → plan → act; each stage processes independently
Perception	Computer vision, point cloud processing, feature extraction, object recognition
Planning	A*, RRT (Rapidly-exploring Random Trees), PRM (Probabilistic Roadmap), optimisation-based planners
Control	PID controllers, computed torque control, impedance control
Strengths	Well-understood, modular, testable; dominant in industrial robotics
Weaknesses	Brittle in unstructured environments; slow to adapt; perception errors cascade

Behaviour-Based / Subsumption Architecture

Aspect	Detail
Core Idea	Proposed by Rodney Brooks (1986); instead of a central model, intelligence emerges from layers of simple behaviour modules
How It Works	Parallel behaviour layers (e.g., avoid obstacle, follow wall, seek goal) compete for control; higher layers subsume lower ones
Strengths	Robust to sensor noise; fast response; works in unstructured environments
Weaknesses	Difficult to scale to complex, multi-step tasks; limited reasoning capability
Used In	Early mobile robots (iRobot Roomba ancestors), insect-inspired robotics

Learning-Based Control (End-to-End)

Aspect	Detail
Core Idea	Learn a direct mapping from sensory input to motor output using neural networks; bypass explicit perception, planning, and control modules
Imitation Learning	Learn by observing human demonstrations (behavioural cloning, DAgger)
Reinforcement Learning	Learn by trial and error in simulation, then transfer to real world
Strengths	Can handle unstructured environments; learns complex sensorimotor skills; adapts to new situations
Weaknesses	Requires massive data/simulation; difficult to verify safety; opaque decision-making
Used In	Autonomous driving (Tesla FSD), dexterous manipulation, locomotion

Robot Foundation Models

Aspect	Detail
Core Idea	Large pre-trained models that provide general-purpose perception, language understanding, and robotic control; fine-tuned for specific tasks
Vision-Language-Action (VLA) Models	Models that take visual observations and language instructions as input and output robot actions
Key Examples	Google RT-2, Octo, OpenVLA, NVIDIA GR00T
Strengths	Generalise across tasks and environments; leverage internet-scale pre-training; support natural language commands
Weaknesses	Still early; real-time inference is challenging; safety guarantees are difficult

SLAM (Simultaneous Localisation and Mapping)

Aspect	Detail
Core Problem	Build a map of an unknown environment while simultaneously tracking the robot's position within it
Visual SLAM	Uses camera images (ORB-SLAM, LSD-SLAM, RTAB-MAP)
LiDAR SLAM	Uses laser scanner point clouds (Cartographer, LOAM, LeGO-LOAM)
Sensor Fusion SLAM	Combines visual, LiDAR, and IMU data for robust localisation
Used In	Autonomous vehicles, warehouse robots, AR/VR headsets, drones

Motion Planning Algorithms

Algorithm	Description	Used In
A*	Optimal graph search; complete and optimal with admissible heuristic	Grid-based path planning
RRT / RRT*	Sampling-based; grows a random tree through configuration space; RRT* converges to optimal	Robot arm motion planning
PRM (Probabilistic Roadmap)	Pre-computes a roadmap of the free space; queries find paths through the roadmap	Multi-query environments
CHOMP / TrajOpt	Optimisation-based trajectory planning; smooths and optimises initial trajectories	Collaborative robots, manipulation
MPC (Model Predictive Control)	Optimises action sequence over a receding horizon using a dynamics model	Autonomous driving, drone control

Tools & Platforms

The essential middleware, simulators, and frameworks powering Physical AI development and deployment.

Tool	Provider	Focus
ROS 2	Open Robotics	De facto robot middleware; pub/sub, transforms, navigation stack
NVIDIA Isaac Sim	NVIDIA	GPU-accelerated robot simulation; synthetic data generation
MuJoCo	Google DeepMind	Fast physics engine; contact-rich manipulation research
Gazebo	Open Robotics	Classic 3D robot simulator; ROS-integrated
PyBullet	Erwin Coumans	Lightweight Python physics sim; RL research
CARLA	Intel / Barcelona	Open-source autonomous vehicle driving simulator
AirSim	Microsoft	Drone / car simulator built on Unreal Engine
Autoware	Autoware Foundation	Full open-source AV stack (perception → planning → control)
Apollo	Baidu	Chinese AV autonomy platform; HD mapping, planning
MoveIt 2	PickNik	ROS 2 motion planning framework; manipulation pipelines
Open3D	Intel ISL	3D point cloud processing library; registration, visualisation
Drake	MIT / TRI	Model-based robot design and control; optimisation-based planning

Leading Platforms, Frameworks & Tools

Robot Operating System & Middleware

Platform	Deployment	Description
ROS 2 (Robot Operating System 2)	Open-Source (Linux Ubuntu 22.04+; x86 or ARM; CPU-based; Docker supported)	Industry standard middleware for robotics; pub/sub messaging, hardware abstraction, libraries
NVIDIA Isaac ROS	Open-Source (Linux; NVIDIA Jetson Orin or x86 + NVIDIA GPU; CUDA 11.4+)	GPU-accelerated computer vision and AI for ROS 2; DNN acceleration on Jetson
micro-ROS	Open-Source (embedded MCUs — STM32, ESP32, NXP; RTOS: FreeRTOS/Zephyr)	ROS 2 for microcontrollers; bringing the ROS ecosystem to embedded systems
MoveIt 2	Open-Source (Linux Ubuntu 22.04+; x86 or ARM; CPU-based; ROS 2 required)	Motion planning framework for ROS 2; arms, manipulators, and mobile bases

Simulation Platforms

Platform	Provider	Deployment	Highlights
NVIDIA Isaac Sim	NVIDIA	On-Prem (Linux; NVIDIA RTX GPU required) / Cloud (AWS EC2 G5/P4d; GCP A2; NVIDIA Omniverse Cloud)	GPU-accelerated physics, photorealistic rendering, domain randomisation, digital twin
MuJoCo	DeepMind (open-source)	Open-Source (Linux/macOS/Windows; C; CPU-only)	Fast, accurate physics for RL research; contact-rich manipulation and locomotion
Gazebo	Open Robotics (open-source)	Open-Source (Linux Ubuntu; x86; CPU + optional GPU for rendering)	Standard ROS simulator; multi-robot, sensor simulation
Unity Robotics	Unity	On-Prem (Windows/Linux/macOS; NVIDIA GPU recommended for rendering)	3D game engine adapted for robotics simulation; visual realism
Unreal + AirSim	Microsoft / Epic	On-Prem (Windows/Linux; NVIDIA GPU required for photorealistic rendering)	Photorealistic drone and car simulation
PyBullet	Open-source	Open-Source (any OS; Python 3.8+; CPU-only)	Python physics simulator; quick prototyping; RL integration
CARLA	Open-source	Open-Source (Linux/Windows; NVIDIA GPU — GTX 1080+ recommended; Unreal Engine)	Autonomous driving simulator; weather, traffic, sensors

Autonomous Vehicle Platforms

Platform	Provider	Deployment	Highlights
NVIDIA DRIVE	NVIDIA	Edge (NVIDIA DRIVE Orin / Thor SoC on vehicle; development on Linux x86 + NVIDIA GPU)	End-to-end AV platform; perception, planning, mapping on NVIDIA hardware
Apollo	Baidu (open-source)	Open-Source (Linux; x86 + NVIDIA GPU; Docker; vehicle-mounted compute unit)	Full-stack AV platform; perception, planning, control
Autoware	Open-source	Open-Source (Linux Ubuntu; x86 + NVIDIA GPU; ROS 2; vehicle-mounted compute)	ROS-based AV software stack; used in research and commercial deployments
Waymo Driver	Waymo (Alphabet)	Edge (custom compute module in vehicle; GCP for cloud training and HD map processing)	Proprietary L4 software stack; ~$10B+ invested
Tesla FSD	Tesla	Edge (Tesla HW4 computer — dual AI chips; in-vehicle only)	Vision-only approach; end-to-end neural network; deployed at scale
Mobileye EyeQ	Mobileye (Intel)	Edge (Mobileye EyeQ Ultra SoC; in-vehicle)	ADAS and AV chipset and software; camera-first approach

Robot Foundation Model Platforms

Model / Platform	Provider	Deployment	Highlights
RT-2 / RT-X	Google DeepMind	Cloud (GCP — TPU for training; Jetson/GPU for on-robot inference)	Vision-Language-Action model; generalises across tasks and embodiments
Octo	UC Berkeley et al.	Open-Source (Linux; Python 3.9+; NVIDIA GPU — A100 recommended for training; Jetson for inference)	Open-source robot foundation model; diverse data, multi-task
GR00T	NVIDIA	Cloud (NVIDIA DGX Cloud for training) / Edge (Jetson Thor for on-robot inference)	Foundation model for humanoid robots; multimodal input, action output
OpenVLA	Stanford et al.	Open-Source (Linux; Python 3.9+; NVIDIA GPU — A100 for training; smaller GPU for inference)	Open-source Vision-Language-Action model; fine-tunable for specific robots
pi0	Physical Intelligence	Cloud (private GPU infrastructure for training; robot-mounted compute for inference)	General-purpose robot foundation model; manipulation and locomotion

Hardware Platforms

Platform	Deployment	Highlights
NVIDIA Jetson (Orin / Thor)	Edge (ARM SoC with integrated NVIDIA GPU; 15–275 W; Linux)	Edge AI compute for robots and autonomous machines; GPU inference
Qualcomm Robotics RB5/RB7	Edge (ARM SoC with Qualcomm AI Engine; low-power; Linux)	Low-power AI compute for mobile robots and drones
Intel RealSense	Edge (USB depth camera; works with any x86/ARM host running Linux/Windows)	Depth cameras for robot perception (being sunset; legacy installed base)
Universal Robots (UR)	On-Prem (UR controller box; 6-axis cobot arm; 100–240 V AC; ROS 2 driver available)	Collaborative robot arms; easy programming; dominant in SME manufacturing
Boston Dynamics Spot / Atlas	On-Prem (self-contained compute on robot; Wi-Fi / Ethernet control; Spot SDK on Linux/macOS/Windows)	Quadruped (Spot) and humanoid (Atlas) research platforms
Franka Emika Panda	On-Prem (Franka control box; 7-axis arm; real-time Linux host PC required; ROS 2 driver)	Research-grade collaborative arm; open interface; popular in labs

Use Cases

Real-world deployment domains for Physical / Embodied AI, with concrete examples and impact metrics.

Manufacturing & Assembly

Fanuc and ABB articulated arms perform welding, painting, and pick-and-place at cycle times under 2 seconds
Vision-guided bin picking handles unstructured parts with >99% grasp success
Inline quality inspection using 3D vision detects defects at line speed
Lights-out factories operate 24/7 with minimal human oversight

Autonomous Driving

Waymo Driver operates geofenced L4 robotaxi fleets in San Francisco, Phoenix, and LA
Sensor fusion: lidar + camera + radar produces 360° perception at 10 Hz
Cruise Origin purpose-built AV removes steering wheel and pedals entirely
Over 40 million autonomous miles driven across major AV programmes

Warehouse Logistics

Amazon Sparrow and Proteus robots handle goods-to-person picking and transport
AMR fleets coordinate via cloud to deliver 3× throughput vs. manual operations
SLAM-based navigation enables deployment without fixed infrastructure
Dynamic re-routing avoids congestion and adapts to real-time order waves

Surgical Robotics

da Vinci Xi system: >12 million minimally-invasive procedures completed worldwide
Tremor filtering and motion scaling give surgeons sub-millimetre precision
Remote tele-surgery demonstrated across continental distances
AI-assisted guidance overlays anatomy and suggests optimal instrument paths

Agricultural Robotics

John Deere See & Spray uses computer vision to target weeds, cutting herbicide use by 77%
Autonomous tractors operate GPS-guided row following with cm-level accuracy
Robotic harvesters identify ripe fruit via hyperspectral imaging
Scout drones survey crops for disease, irrigation needs, and yield prediction

Delivery & Drones

Zipline delivers medical supplies via autonomous fixed-wing drones across Rwanda and Ghana
Wing (Alphabet) provides last-mile consumer delivery with <10 min flight time
Over 500,000 autonomous commercial flights completed per year globally
BVLOS (Beyond Visual Line of Sight) operations expanding with regulatory approval

Industry Use Cases

Manufacturing

Use Case	Description	Key Examples
Assembly	Robotic arms assemble components — automotive, electronics, consumer goods	Fanuc, ABB, KUKA assembly lines
Welding	Automated arc welding, spot welding, laser welding	Automotive body-in-white lines
Quality Inspection	AI-powered visual inspection for defects at production speed	Cognex, Keyence, Landing AI
Material Handling	Autonomous transport of parts and materials within facilities	AGVs, AMRs (Amazon Robotics, Locus)
Collaborative Assembly	Cobots working alongside humans for flexible, mixed automation	Universal Robots, Fanuc CRX
Bin Picking	AI-guided manipulation of randomly oriented parts from bins	Covariant, Plus One Robotics

Logistics & Warehousing

Use Case	Description	Key Examples
Goods-to-Person	AMRs bring shelves/bins to human pickers	Amazon Robotics (Kiva), Geek+
Autonomous Picking	Robot arms pick individual items from bins or shelves	Covariant, Berkshire Grey, RightHand Robotics
Sorting	Automated sorting of packages by destination	Autonomous parcel sorting (FedEx, UPS)
Last-Mile Delivery	Autonomous delivery robots and drones	Starship Technologies, Nuro, Wing
Autonomous Trucking	Self-driving long-haul trucks	Aurora, Kodiak Robotics, Waymo Via (TuSimple rebranded to CreateAI in 2024, pivoted away from trucking)

Healthcare

Use Case	Description	Key Examples
Surgical Robotics	Robot-assisted minimally invasive surgery	da Vinci (Intuitive), Hugo (Medtronic)
Hospital Logistics	AMRs delivering supplies, medications, and meals	Aethon TUG, Diligent Robotics Moxi
Rehabilitation	Robotic exoskeletons and therapy devices	Ekso, ReWalk, Fourier Intelligence
Disinfection	UV-C autonomous disinfection robots	Xenex, UVD Robots

Agriculture

Use Case	Description	Key Examples
Autonomous Tractors	Self-driving tractors for ploughing, seeding, spraying	John Deere autonomous, CNH, AGCO
Crop Monitoring	Drones for aerial crop health assessment	DJI Agriculture, PrecisionHawk
Harvesting Robots	Autonomous picking of fruits, vegetables, and specialty crops	Tortuga AgTech, Agrobot (Abundant Robotics shut down 2021)
Weeding Robots	AI-guided precision weeding to reduce herbicide use	Blue River (See & Spray), FarmWise

Construction & Mining

Use Case	Description	Key Examples
Autonomous Haul Trucks	Self-driving haul trucks in mining operations	Caterpillar, Komatsu, Hitachi autonomous trucks
Autonomous Drilling	Automated drill rigs in mining and oil & gas	Epiroc, Sandvik autonomous drills
Construction Robotics	Robotic bricklaying, 3D printing, rebar tying	Hadrian X, ICON 3D printing, Dusty Robotics
Site Inspection	Drone and robot-based construction site monitoring	Skydio, DJI, Spot (Boston Dynamics)

Defence & Security

Use Case	Description	Key Examples
Unmanned Ground Vehicles	Autonomous or remote-controlled ground vehicles for reconnaissance and EOD	Various defence contractors
Unmanned Aerial Vehicles	Autonomous drones for surveillance, reconnaissance, and delivery	Military UAV programmes worldwide
Unmanned Maritime	Autonomous surface and underwater vehicles	Navy UUV and USV programmes
Perimeter Security	Autonomous patrol robots for facility security	Knightscope, Cobalt Robotics

Benchmarks

Key performance benchmarks for robot manipulation and autonomous vehicle safety evaluation.

Robot Manipulation Benchmarks (% Success)

AV Safety Benchmarks (Score)

Evaluation & Performance Metrics

Robot Performance Metrics

Metric	What It Measures
Task Success Rate	% of attempts that achieve the desired outcome (e.g., successful grasp, delivery)
Cycle Time	Time to complete one full task cycle (e.g., pick-place, navigation to target)
Positional Accuracy	How precisely the robot reaches the intended position — mm or sub-mm
Repeatability	Consistency of positioning across repeated motions — key for manufacturing
Payload Capacity	Maximum weight the robot can manipulate reliably
Mean Time Between Failures (MTBF)	Average operational time before a system failure
Uptime / Availability	% of scheduled time the system is operational

Autonomous Vehicle Metrics

Metric	What It Measures
Miles Between Disengagement	How far the vehicle drives autonomously before a human must take over
Collision Rate	Collisions per million miles driven
Minimum Risk Condition (MRC) Events	How often the system must execute an emergency stop or pullover
Perception Recall / Precision	Accuracy of object detection for vehicles, pedestrians, cyclists
Prediction Accuracy	How well the system predicts the future trajectories of other road users
Ride Comfort Score	Passenger comfort metrics: lateral acceleration, jerk, braking smoothness

Sim-to-Real Metrics

Metric	What It Measures
Transfer Success Rate	% of simulation-trained policies that succeed in the real world without additional training
Reality Gap	Performance difference between simulation and real-world deployment
Fine-Tuning Data Efficiency	Amount of real-world data needed to close the reality gap
Domain Randomisation Coverage	How well the randomised simulation covers the distribution of real-world conditions

Market Data

Investment and revenue projections across Physical / Embodied AI market segments, with a 2024–2030 growth trajectory.

Market Segments ($B)

2024 → 2030 Total Embodied AI Market ($B)

Market & Adoption Data

Market Context

Metric	Value	Source / Notes
Global Industrial Robot Installations (2024)	~590,000 units/year	IFR World Robotics 2024
Global Industrial Robot Installed Base (2024)	~4.2 million units	IFR
Autonomous Vehicle Market (2024)	~$54 billion	Including ADAS and L2+ systems; robotaxi revenue still small
Service Robot Market (2024)	~$22 billion	Logistics, medical, professional, consumer
Drone Market (2024)	~$38 billion	Commercial + defence; DJI dominant in commercial
Surgical Robotics Market (2024)	~$7.2 billion	Intuitive Surgical dominant; Medtronic, J&J entering
Humanoid Robot Investment (2024)	~$5.3 billion cumulative	Tesla, Figure, 1X, Unitree, Agility; explosive growth
Collaborative Robot Market (2024)	~$2.4 billion	Universal Robots, Fanuc CRX, ABB GoFa

Adoption Trends

Trend	Description
Robot Foundation Models	VLA models (RT-2, Octo, GR00T) enabling generalisation across tasks and embodiments
Humanoid Race	Intense competition among Tesla, Figure, 1X, and others to deploy humanoid robots in real workplaces
Sim-to-Real Maturing	NVIDIA Isaac and MuJoCo enabling increasingly reliable sim-to-real transfer
Autonomous Trucking	Multiple companies nearing commercial L4 autonomous trucking on US highways
China Robot Boom	China installed more robots than any other country; aggressive humanoid development
Surgical Expansion	Robot-assisted surgery expanding beyond urology to general surgery, orthopaedics, and neurosurgery
Agriculture Automation	Labour shortages driving rapid adoption of autonomous tractors and harvesting robots

Risks & Challenges

Critical hazards and open challenges in deploying Physical AI systems at scale.

Physical Safety

Robots can injure or kill — collision, pinch-point, runaway, and crushing scenarios. Functional safety standards (ISO 13849, IEC 61508) must be rigorously applied.

Sim-to-Real Gap

Models trained in simulation routinely underperform in the messy, unstructured real world. Domain randomisation and system identification help but don't fully close the gap.

Sensor Degradation

Rain, fog, dust, vibration, and extreme temperatures degrade perception quality. Redundancy and graceful degradation strategies are essential for safety-critical deployments.

Regulatory Fragmentation

Inconsistent global standards for autonomous vehicles, drones, and surgical robots create compliance complexity and slow cross-border deployment.

Liability & Insurance

Unclear fault attribution when autonomous systems cause harm. Product liability, operator negligence, and software defects create complex legal landscapes.

Job Displacement

Automation replacing manual labour at scale — warehouse, manufacturing, driving, and agriculture. Requires proactive workforce retraining and social safety nets.

Risks, Limitations & Boundaries

Fundamental Limitations

Limitation	Description
Real-World Complexity	The physical world is infinitely variable — lighting, weather, terrain, human behaviour; impossible to fully anticipate
Safety Criticality	Physical actions can injure humans, damage property, or cause death; stakes are fundamentally different from software AI
Irreversibility	Physical actions cannot be "undone" — a collision, fall, or drop has permanent consequences
Sim-to-Real Gap	Even the best simulators cannot perfectly replicate real-world physics, sensing, and conditions
Power & Energy	Battery life limits operational endurance; heavy computation conflicts with energy efficiency
Dexterity Gap	Current manipulation capability is far below human dexterity for fine motor tasks
Cost	Advanced robots remain expensive; ROI justification is challenging for many use cases
Edge Cases	Long-tail scenarios (unusual objects, unexpected human behaviour, rare conditions) are extremely difficult to handle

Safety Risks

Risk	Description	Mitigation
Collision / Impact	Robot collides with humans, objects, or infrastructure	Collision detection, force limiting, safety-rated speed zones
Sensor Failure	Sensor degradation or failure leads to incorrect perception	Sensor redundancy, cross-modal validation, degraded-mode operation
Software Failure	Bugs, edge cases, or model errors cause dangerous behaviour	Functional safety standards, extensive testing, runtime monitoring
Cybersecurity	Hacked robots or vehicles could be weaponised or caused to malfunction	Secure communications, OTA update authentication, intrusion detection
Environmental	Adverse weather, lighting, or terrain causes system failure	Robust perception, weather detection, operational domain restrictions
Human Interaction	Unpredictable human behaviour near robots causes dangerous situations	Safety-rated collaborative operation, proximity detection

When Physical AI Is the Right Choice

Criterion	Why Physical AI Excels
Physical Task	When the task inherently requires acting on the physical world
Dangerous Environments	When human presence is dangerous (nuclear, undersea, space, hazardous materials)
Repetitive Physical Labour	When the task is physically demanding, repetitive, and consistent
Scale & Speed	When throughput requirements exceed human physical capability
Precision	When sub-millimetre accuracy is required (surgery, microassembly)
24/7 Operation	When continuous operation without fatigue is needed

Related AI System Types

Explore how this system type connects to others in the AI landscape:

Autonomous AI Reinforcement Learning AI Multimodal Perception AI Reactive AI Agentic AI

Glossary

Key terms in Physical / Embodied AI. Use the search box to filter.

ActuatorDevice converting energy (electric, hydraulic, pneumatic) into physical motion.

SLAMSimultaneous Localisation and Mapping — building a map while tracking the robot's position within it.

Sim-to-Real TransferTransferring policies trained in simulation to physical robots, overcoming the reality gap.

Digital TwinVirtual replica of a physical system used for simulation, monitoring, and predictive maintenance.

ROSRobot Operating System — open-source middleware providing tools, libraries, and conventions for robotics.

Inverse KinematicsComputing joint angles needed to place a robotic end-effector at a desired position and orientation.

Motion PlanningAlgorithmic computation of collision-free paths for robots in configuration space.

GraspingRobotic manipulation of objects — involves grasp planning, force control, and tactile sensing.

LiDARLight Detection and Ranging — laser-based sensor producing 3D point cloud representations of the environment.

ProprioceptionRobot's internal sensing of its own body state (joint positions, velocities, forces).

End EffectorThe device at the end of a robotic arm that interacts with the environment (gripper, tool, sensor).

Path PlanningFinding an optimal route from start to goal while avoiding obstacles in the environment.

Compliant ControlControl strategy allowing the robot to adapt forces in response to environmental contact.

Visual ServoingUsing real-time visual feedback to control robot motion towards a target pose.

Swarm RoboticsCoordination of large numbers of simple robots to achieve complex collective behaviours.

CobotsCollaborative robots designed to work safely alongside humans in shared workspaces.

Behaviour TreeModular hierarchical task-execution graph with composable condition and action nodes; tick-based control flow.

Degrees of Freedom (DoF)Independent axes of motion for a joint or robot; a 6-DoF arm can reach any position and orientation.

Domain RandomisationVarying simulation parameters (texture, lighting, physics) to produce policies robust to real-world variability.

End-EffectorTool at the end of a robot arm — gripper, welder, suction cup, or dexterous hand.

GNSS/RTKGlobal Navigation Satellite System with Real-Time Kinematic corrections; centimetre-level positioning accuracy.

IMUInertial Measurement Unit combining accelerometers and gyroscopes; measures orientation and acceleration.

KinematicsStudy of motion without considering forces — maps joint angles to end-effector positions (forward kinematics) and vice versa (inverse kinematics).

LiDARLight Detection and Ranging; emits laser pulses and measures returns to create high-resolution 3D point clouds.

MPCModel Predictive Control; optimises the control trajectory over a rolling time horizon, re-planning at each step.

OdometryEstimating position change over time from sensor data (wheel encoders, visual features, or IMU integration).

SLAMSimultaneous Localisation and Mapping — building a map of the environment while tracking the robot's position within it.

Sim-to-RealTransferring a control policy trained in a physics simulator to a physical robot; often requires domain adaptation techniques.

Tactile SensingContact and force sensing on robot surfaces; enables dexterous manipulation and safe human-robot interaction.

TeleoperationRemote human control of a robot, often with force feedback; used for hazardous environments and surgical assistance.

Key Terminology Glossary

Term	Definition
Actuator	A physical device (motor, servo, hydraulic cylinder) that converts control signals into physical motion
AMR (Autonomous Mobile Robot)	A wheeled or tracked robot that navigates autonomously using onboard sensors and mapping
AV (Autonomous Vehicle)	A vehicle capable of driving without human intervention at some level of automation
Behavioural Cloning	Imitation learning by training a policy to replicate expert demonstrations via supervised learning
Cobot (Collaborative Robot)	A robot designed to work safely alongside humans without safety cages
DAgger	Dataset Aggregation — iterative imitation learning that corrects distributional shift by querying the expert
Degrees of Freedom (DoF)	Number of independent axes of motion a robot can control
Digital Twin	A virtual replica of a physical system, updated with real-time data, used for simulation and monitoring
Domain Randomisation	Randomising visual and physical parameters in simulation to produce policies robust to real-world variation
End Effector	The tool or gripper attached to the end of a robot arm; the part that interacts with the workpiece
GNSS	Global Navigation Satellite System — includes GPS (US), Galileo (EU), GLONASS (Russia), BeiDou (China)
HD Map	High-definition map with centimetre-accurate road geometry, lane markings, and traffic features
Humanoid Robot	A robot with a human-like body form — bipedal legs, arms, hands, and typically a head
IFR	International Federation of Robotics — the industry body tracking global robot deployment statistics
Imitation Learning	Learning a policy from demonstrations of desired behaviour (behavioural cloning, DAgger, IRL)
Inverse Kinematics	Computing the joint angles needed to place the end effector at a desired position and orientation
LiDAR	Light Detection and Ranging — a sensor that measures distances by emitting laser pulses and timing their return
Localisation	Determining the robot's or vehicle's precise position and orientation within a map or coordinate frame
MPC (Model Predictive Control)	Control strategy that optimises actions over a future horizon using a predictive model of system dynamics
Odometry	Estimating position change over time from wheel rotation, visual features, or inertial measurements
PID Controller	Proportional-Integral-Derivative controller — the foundational algorithm for low-level actuator control
Point Cloud	A set of 3D points (x, y, z) representing the surface geometry of a scene, typically from LiDAR
ROS (Robot Operating System)	Open-source middleware framework providing tools, libraries, and conventions for robot software development
RRT (Rapidly-exploring Random Tree)	A sampling-based motion planning algorithm that grows a search tree through free space
SAE Level	Society of Automotive Engineers automation level (L0–L5) classifying the degree of vehicle autonomy
Sensor Fusion	Combining data from multiple sensor modalities to produce more accurate and robust perception
Sim-to-Real	Transferring a policy trained in simulation to a real-world physical system
SLAM (Simultaneous Localisation and Mapping)	Building a map of an unknown environment while simultaneously tracking the robot's position within it
SOTIF (ISO 21448)	Safety of the Intended Functionality — a standard addressing performance limitations of automated driving
Teleoperation	Remote human control of a robot, typically with visual feedback
VLA (Vision-Language-Action) Model	A model that takes visual and language inputs and outputs robot actions; the robotic counterpart of VLMs
Workspace	The physical volume reachable by a robot's end effector

Visual Infographics

Animation infographics for Physical / Embodied AI — overview and full technology stack.

Conceptual Overview

Physical / Embodied AI — Overview Infographic

Animation overview · Physical / Embodied AI · 2026

Full Technology Stack

Physical / Embodied AI — Tech Stack Infographic

Animation tech stack · Hardware → Compute → Data → Frameworks → Orchestration → Serving → Application · 2026

Regulation

Detailed reference content for regulation.

Regulation & Governance

Key Regulatory Standards

Standard	Domain	What It Requires
ISO 10218	Industrial robots	Safety requirements for industrial robot systems
ISO 15066	Collaborative robots	Safety requirements for collaborative robot operation (force limits, speed limits)
ISO 13849	Machinery safety	Safety-related parts of control systems; Performance Levels (PL a-e)
IEC 62443	Industrial cybersecurity	Security for industrial automation and control systems
SAE J3016	Autonomous vehicles	Defines the 6 levels of driving automation (L0–L5)
ISO 21448 (SOTIF)	Autonomous vehicles	Safety of the intended functionality — addresses performance limitations and misuse
EASA (EU) / FAA (US)	Drones	Drone registration, airspace rules, remote ID, operational categories
FDA	Surgical robots	Classification as medical devices; pre-market clearance or approval
EU AI Act	All high-risk AI	Conformity assessment for high-risk AI applications including autonomous vehicles and robots
EU Machinery Regulation (2023/1230)	Machinery / robots	Updated regulation covering AI-enabled machinery; replaces Machinery Directive

Governance Challenges

Challenge	Description
Liability	Who is responsible when an autonomous system causes harm — manufacturer, operator, software provider, or the AI itself?
Certification	How to certify systems that learn and adapt; deterministic testing is insufficient for learned policies
Operational Design Domain	Precisely defining the conditions under which the system is safe to operate
Continuous Learning	If robots update policies in the field, every update needs safety re-validation
Data Privacy	Robots with cameras in public and private spaces raise significant surveillance and privacy concerns
Workforce Displacement	Automation of physical labour has significant social and economic implications

Deep Dives

Detailed reference content for deep dives.

Perception Systems for Physical AI

Sensor Modalities

Sensor	Data Type	Strengths	Limitations
Camera (RGB)	2D images	Rich semantic information; cheap; high resolution	No direct depth; affected by lighting
Stereo Camera	2D images + depth	Depth from disparity; moderate cost	Short-range depth; calibration-sensitive
LiDAR	3D point clouds	Precise 3D geometry; works in dark	Expensive; sparse data; affected by rain/fog
Radar	Range + velocity	Works in all weather; measures velocity directly	Low resolution; no semantic information
Ultrasonic	Range (short)	Cheap; good for close-range detection	Very limited range; no detail
IMU (Inertial)	Acceleration + rotation	Fast (1 kHz+); no external dependencies	Drifts over time; must be fused with other sensors
GPS/GNSS	Global position	Global reference; ubiquitous	~1m accuracy outdoor; poor indoor; latency
Force/Torque	Contact forces	Enables compliant manipulation; detects contact	Only measures at sensor location
Tactile	Surface contact patterns	Enables dexterous manipulation; object property sensing	Low spatial coverage; emerging technology

Perception Tasks

Task	Description	Key Techniques
Object Detection	Identify and localise objects in 2D images	YOLO, DETR, Faster R-CNN
Semantic Segmentation	Classify every pixel/point into a category	Mask R-CNN, SegFormer, PointNet++
Depth Estimation	Estimate distance from camera to each pixel	Stereo disparity, monocular depth (MiDaS, DPT)
3D Reconstruction	Build 3D models of scenes from sensor data	NeRF, 3D Gaussian Splatting, Structure from Motion
Pose Estimation	Determine the 6-DoF position/orientation of objects or humans	PoseNet, FoundationPose, MediaPipe
SLAM	Build a map and localise simultaneously	ORB-SLAM3, Cartographer, RTAB-MAP
Semantic Scene Understanding	Understand spatial relationships, affordances, and meaning	Scene graphs, VLMs (vision-language models)

Robot Learning & Sim-to-Real Transfer

Learning Paradigms for Robots

Paradigm	Description	Examples
Imitation Learning (IL)	Learn from human demonstrations (teleoperation, motion capture, video)	Behavioural cloning, DAgger, ACT
Reinforcement Learning (RL)	Learn from reward signals in simulation, then transfer to real world	PPO in MuJoCo, sim-to-real for locomotion
Self-Supervised Learning	Learn representations from unlabeled sensor data (e.g., predicting next frame)	Robotic pre-training, world models
Language-Conditioned Learning	Natural language instructions guide robot behaviour	RT-2, SayCan, Inner Monologue
Hybrid IL + RL	Bootstrap with demonstrations; refine with RL	Residual RL, demo-augmented RL

Sim-to-Real Transfer

┌──────────────────────────────────────────────────────────────────────────┐
│ SIM-TO-REAL PIPELINE │
│ │
│ 1. SIMULATE 2. RANDOMISE 3. TRANSFER │
│ ────────────── ────────────── ────────────── │
│ Train policy in Apply domain Deploy trained │
│ high-fidelity randomisation: policy on real │
│ physics vary physics, robot; fine-tune │
│ simulator visuals, and with real-world │
│ dynamics data │
│ │
│ NVIDIA Isaac Sim Randomise mass, Zero-shot or │
│ MuJoCo friction, lighting, few-shot transfer │
│ PyBullet texture, noise │
│ │
│ ──── GOAL: POLICY LEARNED IN SIMULATION WORKS IN REAL WORLD ────── │
└──────────────────────────────────────────────────────────────────────────┘

Key Sim-to-Real Techniques

Technique	Description
Domain Randomisation	Randomise visual and physical parameters in simulation to produce robust, transferable policies
System Identification	Measure real-world physical parameters and replicate them accurately in simulation
Progressive Nets	Transfer features learned in simulation; fine-tune additional columns on real data
Sim-to-Real + Real-to-Sim	Iteratively refine the simulator using real-world data; then retrain
Teacher-Student Distillation	Train a privileged "teacher" in simulation with full state; distil into a "student" using only real sensor inputs

Autonomous Vehicles & Navigation

AV Architecture

Module	Function
Sensor Suite	Cameras (8–16), LiDAR (1–5), radar (4–6), ultrasonic, GPS/IMU
Perception	3D object detection, lane detection, traffic sign/light recognition, free space estimation
Prediction	Predict trajectories of other road users (vehicles, pedestrians, cyclists)
Planning	Route planning, behaviour planning (lane change, merge), trajectory optimisation
Control	Steering, throttle, brake commands — following the planned trajectory
HD Maps	Centimetre-accurate maps with lane markings, traffic signs, and road topology
Localisation	Fuse GPS, IMU, LiDAR, and camera with HD maps for centimetre-level self-localisation

SAE Automation Levels

Level	Name	Description	Example
L0	No Automation	Human performs all driving tasks	Manual car
L1	Driver Assistance	System controls steering OR acceleration	Adaptive cruise control
L2	Partial Automation	System controls steering AND acceleration; human monitors	Tesla Autopilot, GM Super Cruise
L3	Conditional Automation	System handles all driving in defined conditions; human must be ready to take over	Mercedes Drive Pilot (highway)
L4	High Automation	System handles all driving in defined conditions; no human intervention needed	Waymo robotaxi (geofenced)
L5	Full Automation	System handles all driving in all conditions	Not yet achieved

Overview

Detailed reference content for overview.

Definition & Core Concept

Physical / Embodied AI places artificial intelligence inside a physical body that must perceive, navigate, and manipulate the real world. Unlike purely software AI systems that process digital data, embodied AI must deal with the fundamental challenges of the physical world: continuous sensory streams, real-time constraints, noise, uncertainty, safety hazards, and the irreversibility of physical actions.

Embodied AI bridges the gap between digital intelligence and real-world impact. It encompasses robotics (industrial, service, humanoid), autonomous vehicles (cars, trucks, drones, ships), wearable devices, and smart infrastructure. The defining characteristic is that these systems have a physical body with sensors and actuators, and their intelligence is grounded in real-world spatial and temporal experience.

The field has accelerated dramatically with advances in foundation models for robotics, sim-to-real transfer, and the convergence of vision-language models with physical control. Systems like Google RT-2, NVIDIA's GR00T, Tesla Optimus, and Figure's humanoids represent a new generation of physically embodied intelligence.

Dimension	Detail
Core Capability	Embodies — acts in the physical world through a body with sensors and actuators, perceiving and manipulating real environments
How It Works	Sensor fusion, perception, world modelling, motion planning, control, sim-to-real transfer, robot foundation models
What It Produces	Physical actions — movement, manipulation, navigation, assembly, delivery, inspection
Key Differentiator	Grounded in the real world — must handle continuous physics, real-time constraints, safety risks, and irreversible actions

Physical AI vs. Other AI Types

AI Type	What It Does	Example
Physical / Embodied AI	Acts in the physical world through a body with sensors and actuators	Robot arm, autonomous vehicle, drone
Agentic AI	Pursues goals autonomously in digital environments	Research agent, coding agent
Analytical AI	Extracts insights and explanations from data	Dashboard, root-cause analysis, anomaly detection
Autonomous AI (Non-Agentic)	Operates independently within fixed boundaries without human input	Autopilot, auto-scaling, algorithmic trading
Bayesian / Probabilistic AI	Reasons under uncertainty using probability distributions	Clinical trial analysis, A/B testing, risk modelling
Cognitive / Neuro-Symbolic AI	Combines neural learning with symbolic reasoning	LLM + knowledge graph, physics-informed neural net
Conversational AI	Manages multi-turn dialogue between humans and machines	Customer service chatbot, voice assistant
Evolutionary / Genetic AI	Optimises solutions through population-based search inspired by natural selection	Neural architecture search, logistics scheduling
Explainable AI (XAI)	Makes AI decisions understandable to humans	SHAP explanations, LIME, Grad-CAM
Generative AI	Creates new digital content from learned distributions	Text generation, image synthesis
Multimodal Perception AI	Fuses vision, language, audio, and other modalities	GPT-4o processing image + text, AV sensor fusion
Optimisation / Operations Research AI	Finds optimal solutions to constrained mathematical problems	Vehicle routing, supply chain planning, scheduling
Predictive / Discriminative AI	Classifies or forecasts from historical data	Fraud detection, demand forecasting
Privacy-Preserving AI	Trains and runs AI without exposing raw data	Federated hospital models, differential privacy
Reactive AI	Maps input to output with no learning or planning	Thermostat, ABS braking system
Recommendation / Retrieval AI	Surfaces relevant items from large catalogues based on user signals	Netflix suggestions, Google Search, Spotify playlists
Reinforcement Learning AI	Learns from reward signals via trial and error	AlphaGo, RL-based robot policy
Scientific / Simulation AI	Solves scientific problems and models physical systems	AlphaFold, climate simulation, molecular dynamics
Symbolic / Rule-Based AI	Reasons over explicit rules and knowledge to derive conclusions	Medical expert system, legal reasoning engine

Key Distinction from Agentic AI: Agentic AI operates in digital environments using software tools (APIs, web browsers, code interpreters). Physical AI operates in the real physical world — its actions move matter, consume energy, and carry safety risks.

Key Distinction from Reactive AI: Reactive AI responds to stimuli with no planning or learning. Physical AI performs complex planning, learns from experience, and maintains world models — but shares the need for real-time, deterministic safety subsystems.

Key Distinction from Reinforcement Learning AI: RL is a learning paradigm — a technique used within Physical AI systems. Physical AI is a deployment domain — a system that exists in the real world and may use RL, supervised learning, foundation models, or classical control.