Intro V8

April 11, 2025 - 3 mins read

Series: Understanding V8 internals

Preliminaries

Chrome uses a multi-process architecture:

Browser Process (Parent): Manages everything.
Renderer Process (Child): Handles tabs, extensions, iframes from different origins. Runs web content in a sandbox for security.
Utility Process: Manages network and file system access.
GPU Process: Manages graphics rendering.

Renderer Process Details:

Everything from the internet runs here (because it’s untrusted).
Sandboxed: Limited system access for safety.
Communicates with Browser Process via Mojo IPC (Inter-Process Communication).
IPC allows two processes to share information safely and efficiently.

V8 Engine:

Chrome’s JavaScript and WebAssembly engine.
Uses JIT (Just-In-Time) Compilation to convert code to machine code for faster execution.
Optimizes frequently used code (“hot code”) for near-native speed.

Security Upgrade - Ubercage (V8 Heap SBX):

New sandboxing for V8 to protect against arbitrary memory access, even if renderer is compromised.

graph TD A[Browser Process] --> B[Renderer Process] A --> C[Utility Process] A --> D[GPU Process] B --> E[V8 Engine] B --> F[Web Content] B -- Mojo IPC --> A E --> G[JIT Compilation] G --> H[Optimized Hot Code] E --> I[WebAssembly Execution] E --> J[Ubercage] C --> K[Network Management] C --> L[File System Access] D --> M[Graphics Rendering]

JS Engine Pipeline (V8)

When JavaScript code runs in V8, it goes through several steps:

Lexer
- Breaks the JS code into tokens (keywords, constants, operators, variables) and stores it into cache
- Makes the c0_ point to the first character of the js code that will be compiled.
Parser
- Uses tokens from the token cache to build an AST (Abstract Syntax Tree).
- Insert the node into the AST tree.
- Checks if the code is valid.
- When encountered with a Cache miss, it calls the lexer to generate the token and store it in cache as well and then continues to work.
Ignition (Interpreter)
- Converts AST into bytecode.
- Runs the bytecode using a register-based machine model.
Sparkplug (Baseline Compiler)
- Converts bytecode into machine code.
- No optimizations at this stage.
- Works mostly as a dispatcher
Maglev (Mid-tier JIT Compiler)
- Some optimizations.
- Generates faster machine code than Sparkplug.
- Uses CFG for IR generation
Turbofan (High-end JIT Compiler)
- Advanced optimizations.
- Produces highly optimized machine code for performance.
- Uses the concept of Sea Of Nodes

Supporting Components

Profiler
- Monitors code at runtime.
- Detects “hot code” (frequently executed code).
- Sends it to the appropriate compiler for optimization.
Deoptimizer
- If assumptions made by JIT compiler fail, it stops JIT execution and falls back to the interpreter.
Garbage Collector (GC)
- Frees unused memory.

Uses Generational GC:

NewSpace: For new objects.
OldSpace: For objects that survive multiple GC cycles.

Two GC types:

Scavenger: Cleans NewSpace.
Mark-compact: Cleans OldSpace.

GC Interaction Use --expose-gc with d8 to access GC manually. Trigger:

gc({"type":"scavenger"}) for Scavenger.
gc() for Mark-compact.

Here’s a visual of the whole flow:

graph TD A[JS Code] --> B[Lexer] B --> C[Parser] C --> D[AST] D --> E[Ignition] E --> F[Bytecode] F --> G[Sparkplug] G --> H[Non-Optimized Machine Code] H --> I[Maglev] I --> J[Optimized Machine Code] J --> K[Turbofan] K --> L[Highly Optimized Machine Code] subgraph Support Components M[Profiler] --> I M --> K N[Deoptimizer] --> E end subgraph Garbage Collection O[GC] O --> P[NewSpace] O --> Q[OldSpace] end

Useful References

This is a post in the Understanding V8 internals series.
Other posts in this series:

April 14, 2025 - Parser Workflow
April 13, 2025 - Lexical Analysis
April 12, 2025 - Compiler Design Principles in V8
April 12, 2025 - Compiler Design 1
April 11, 2025 - Intro V8