add internal architecture docs

2025-02-06 11:02:01 +00:00 · 2025-02-04 09:17:43 +01:00 · 2025-02-04 09:17:43 +01:00 · 7af9c93f5b
commit 7af9c93f5b
parent f5b7a12d84
1 changed files with 446 additions and 0 deletions
--- a/docs/extra/internal.md
+++ b/docs/extra/internal.md
@ -0,0 +1,446 @@
+---
+title: 🏗️ Internal Architecture
+description: >-
+  Learn about the internal architecture of Fiber, including the overall structure, request handling flow, routing, and path parsing.
+sidebar_position: 3
+---
+## Overall Architecture
+
+At the heart of Fiber is the **App** struct. It is responsible for configuring the server, managing a pool of Contexts (either our default implementation, **DefaultCtx**, or a user‑supplied **CustomCtx**), and holding the router stack with all registered routes and groups. In addition, the App contains mount fields to support sub‑applications and hooks that allow developers to run custom code at key stages (e.g. when registering routes or starting the server).
+
+```mermaid
+flowchart TD
+    A[App]
+    B["Configuration (Config)"]
+    C[Context Pool]
+    D["DefaultCtx \/ CustomCtx"]
+    E[Router Stack]
+    F["Groups & Routes"]
+    G["MountFields (Sub‑Apps)"]
+    H[Hooks]
+
+    A --> B
+    A --> C
+    C --> D
+    A --> E
+    E --> F
+    A --> G
+    A --> H
+```
+
+### Explanation
+
+- App: The central object that bootstraps and runs the Fiber server.
+- Configuration (Config): Contains settings for body limits, timeouts, TLS options, routing behavior (e.g. case‑sensitivity, strict routing), and more.
+- Context Pool: A synchronized pool from which Contexts are acquired per request. This design minimizes allocations by recycling DefaultCtx (or CustomCtx) instances.
+- Router Stack: Organizes all registered routes. It is later processed into a tree structure for fast route‑matching.
+- MountFields: Support for mounting sub‑applications so that large APIs can be segmented into independent routers.
+- Hooks: Allow for custom behavior at critical points (e.g., on route registration, route naming, on listen, on shutdown, etc.).
+
+## Request Processing Flow
+
+Fiber’s request processing is designed for performance and minimal overhead. When an HTTP request is received by the underlying fasthttp server, the flow is as follows:
+
+1. Request Arrival: The fasthttp server receives the HTTP request.
+2. Context Acquisition: The App calls AcquireCtx() to fetch a Context from the pool.
+3. Context Reset: The acquired Context is reset (via DefaultCtx.Reset()) with the new request’s data.
+4. Request Handling: The request handler (default or custom) is invoked.
+5. Route Matching: The framework uses the next() (or nextCustom()) function to traverse the pre‑built route tree and find a matching route based on the URL and HTTP method.
+6. Middleware Chain Execution: The matched route’s handler chain is executed in sequence.
+7. Error Handling (if required): Any errors encountered trigger the registered error handler.
+8. Response Generation: The response is sent back to the client.
+9. Context Release: Finally, the Context is cleaned up and returned to the pool.
+
+```mermaid
+flowchart LR
+    R["HTTP Request (fasthttp)"]
+    A["App.RequestHandler<br/>(default or custom)"]
+    C["Acquire Context<br/>(from Pool)"]
+    X["Reset Context<br/>(DefaultCtx.Reset())"]
+    N["Route Matching<br/>(next() \/ nextCustom())"]
+    M["Handler Chain Execution"]
+    EH["Error Handling<br/>(if needed)"]
+    S["HTTP Response"]
+    RC["Release Context<br/>(to Pool)"]
+
+    R --> A
+    A --> C
+    C --> X
+    X --> N
+    N --> M
+    M --> EH
+    EH --> S
+    S --> RC
+```
+
+### Additional Note
+
+Fiber minimizes memory allocations by reusing Context objects and uses an optimized route‑matching algorithm to rapidly determine the correct handler chain.
+
+## Routing & Path Parsing
+
+Fiber allows you to register routes using helper methods (e.g. Get(), Post()) or by creating groups and sub‑routers. Internally, the route pattern is parsed by the parseRoute() function. This function decomposes the route string into segments:
+
+- Constant Segments: Fixed parts of the path (e.g. /api).
+- Parameter Segments: Dynamic parts that begin with a colon. For example, a route may be defined as:
+  /api/\:userId&lt;int&gt;
+  Here, the segment \:userId&lt;int&gt; is a parameter segment with a type constraint (an integer).
+- Constraints: Constraints (such as int, bool, datetime, or even regular expressions) are extracted from the parameter part and stored in the route’s metadata for validation at runtime.
+
+```mermaid
+flowchart TD
+    P["Route Pattern String<br/>(e.g., '/api/\\:userId\\&lt;int&gt;')"]
+    PA["parseRoute()"]
+    RP[routeParser]
+    RS["routeSegment(s)"]
+    C["Constraints<br/>(e.g., int, datetime, regex)"]
+    PARAM[Extracted Parameter Names]
+
+    P --> PA
+    PA --> RP
+    RP --> RS
+    RS --> C
+    RP --> PARAM
+```
+
+### Explanation
+
+- parseRoute(): Takes a route string and returns a routeParser struct that includes a list of routeSegment objects.
+- routeSegment: Represents a portion of the route. If it is a parameter segment, it may include constraints that determine the allowed format (for example, ensuring that a parameter is an integer).
+- Extracted Parameter Names: These are later used to populate the request’s Context with the actual values parsed from the URL.
+
+## Route Matching and Parameter Extraction
+
+When a request is processed, Fiber uses its pre‑computed route tree (the treeStack) to efficiently match the incoming URL against registered routes.
+
+1. Normalization: The URL is normalized (converted to lowercase, trailing slashes trimmed) to create a “detection path.”
+2. Tree Traversal: The route tree, grouped by common prefixes, is traversed based on the HTTP method.
+3. Matching: Constant segments are compared exactly, while parameter segments extract dynamic values.
+4. Constraint Validation: Extracted parameter values are validated against any defined constraints.
+
+```mermaid
+flowchart TD
+    A["Incoming Request URL<br/>(e.g., '/api/john')"]
+    B["Normalize URL<br/>(lowercase, trim trailing slashes)"]
+    C["Detection Path"]
+    D["Traverse Route Tree<br/>(treeStack based on method)"]
+    E["Match Constant Segments"]
+    F["Identify Parameter Segments<br/>(e.g., ':userId')"]
+    G["Extract Parameter Values"]
+    H["Validate Constraints<br/>(e.g., 'int', 'datetime', 'regex')"]
+    I["Route Found"]
+
+    A --> B
+    B --> C
+    C --> D
+    D --> E
+    E --> F
+    F --> G
+    G --> H
+    H --> I
+```
+
+### Insight
+
+This efficient matching mechanism leverages pre‑grouped routes to minimize comparisons, while dynamic segments allow for flexible URL structures and runtime validation.
+
+## Middleware Chain Execution
+
+Once a matching route is found, Fiber executes the chain of middleware and route handlers sequentially. The process is as follows:
+
+1. Initial Handler Execution: The first handler of the matched route is invoked.
+2. Calling Next(): Each handler calls Ctx.Next() to pass control to the next handler in the chain.
+3. Termination: When no further handlers remain, the chain terminates and the response is sent.
+
+```mermaid
+flowchart TD
+    A[Matched Route]
+    B[Handler 1]
+    C[Handler 2]
+    D[Handler 3]
+    E[Response Generation]
+
+    A --> B
+    B -- "Calls C via Next()" --> C
+    C -- "Calls D via Next()" --> D
+    D -- "No Next() available" --> E
+```
+
+### Explanation
+
+- Each handler in the chain can perform operations (e.g. authentication, logging, transformation) before calling Next() to forward control.
+- This sequential processing ensures that middleware are executed in the order they were registered.
+- If an error occurs or a handler does not call Next(), the chain may be terminated early, and an error handler may be invoked.
+
+### Observations
+
+Middleware are executed in the order they are registered. This sequential design allows each handler to perform tasks such as authentication, logging, or transformation before delegating to the next handler.
+
+## Sub-Application Mounting & Grouping
+
+Fiber allows mounting sub‑applications (or sub‑routers) under specific path prefixes. This enables modular design of large APIs. The mounting process works as follows:
+
+1. Defining a Mount Point: A parent application calls `App.Mount()` or a Group calls its own `mount()` method.
+2. Merging Mount Fields: The sub‑app’s mount fields are updated with the prefix of the parent, and its routes are integrated into the parent’s routing structure.
+3. Processing Sub‑App Routes: During startup, the parent app collects routes from mounted sub‑apps and builds a unified route tree.
+
+```mermaid
+flowchart TD
+    A[Parent App]
+    B["Sub-App (Mounted)"]
+    C["Define Mount Point<br/>(e.g. \'/admin\')"]
+    D["Update MountFields<br/>(assign mount path)"]
+    E["Merge Sub-App Routes<br/>(append to Router Stack)"]
+    F[Generate Unified Route Tree]
+
+    A --> C
+    C --> B
+    B --> D
+    D --> E
+    E --> F
+```
+
+### Impact
+
+This mechanism enables large APIs to be broken down into smaller, maintainable modules while still benefiting from Fiber’s optimized routing and request handling.
+
+## Route Tree Building
+
+Fiber builds a route tree (the treeStack) to optimize route matching. This involves grouping routes based on a prefix (usually the first few characters) to reduce the number of comparisons during a request.
+
+1. Iterating Over the Router Stack: Each registered route is examined.
+2. Computing the Tree Key: A key is computed from the route’s normalized path (e.g. the first 3 characters).
+3. Grouping Routes: Routes are added to the appropriate branch of the tree.
+4. Sorting: Within each group, routes are sorted based on their registration order (or position) to ensure the correct match is found.
+
+```mermaid
+flowchart TD
+    A["Router Stack<br/>(All Registered Routes)"]
+    B["Compute Tree Key<br/>(e.g. first 3 characters)"]
+    C["Group Routes by Key<br/>(treeStack)"]
+    D["Merge Global Routes<br/>(key \'\' for global matches)"]
+    E[Sort Routes within Groups]
+    F[Optimized Route Tree]
+
+    A --> B
+    B --> C
+    C --> D
+    D --> E
+    E --> F
+```
+
+### Explanation
+
+- Building a route tree is an optimization step that reduces the matching overhead by limiting the search space to a subset of routes that share a common prefix.
+- The tree is rebuilt whenever new routes are registered, ensuring that the latest routing configuration is always used for matching.
+
+## Context Lifecycle Management
+
+Fiber minimizes allocations by pooling Context objects. The lifecycle of a Context is as follows:
+
+1. **Acquisition:** When a new HTTP request arrives, a Context is retrieved from the pool via `App.AcquireCtx()`.
+2. **Reset:** The acquired Context is reset with the current `fasthttp.RequestCtx` to clear previous data and initialize new request‑specific values.
+3. **Processing:** The Context is passed along the middleware and handler chain.
+4. **Release:** After processing the request (or when an error occurs), the Context is released back to the pool via `App.ReleaseCtx()`, making it available for reuse.
+
+```mermaid
+flowchart TD
+    A["HTTP Request<br/>(fasthttp)"]
+    B["Acquire Context<br/>(App.AcquireCtx())"]
+    C["Reset Context<br/>(DefaultCtx.Reset())"]
+    D["Process Request<br/>(Handlers & Middleware)"]
+    E["Error Handling<br/>(if needed)"]
+    F["Release Context<br/>(App.ReleaseCtx())"]
+
+    A --> B
+    B --> C
+    C --> D
+    D --> E
+    E --> F
+```
+
+### Key Benefit
+
+Reusing Context objects significantly reduces garbage collection overhead, ensuring Fiber remains fast and memory‑efficient even under heavy load.
+
+## Preforking Mechanism
+
+To take full advantage of multi‑core systems, Fiber offers a prefork mode. In this mode, the master process spawns several child processes that listen on the same port using OS features such as SO_REUSEPORT (or a fallback to SO_REUSEADDR).
+
+```mermaid
+flowchart LR
+    M["Master Process (App)"]
+    C[Child Processes]
+    GOMAX["Set GOMAXPROCS(1)"]
+    REQ[Handle HTTP Requests]
+    WM["watchMaster()"]
+
+    M -->|Spawns| C
+    C --> GOMAX
+    C -->|Processes| REQ
+    C --> WM
+```
+
+### Explanation
+
+- Master Process: The main process determines the number of available CPU cores and spawns that many child processes.
+- Child Processes: Each child sets GOMAXPROCS(1) to run on a single CPU core and listens on the shared port.
+- watchMaster(): Each child process runs a watchdog routine to monitor the master process; if the master exits (or its parent process ID becomes 1 on Unix‑like systems), the child terminates gracefully.
+
+### Detailed Preforking Workflow
+
+Fiber’s prefork mode uses OS‑level mechanisms to allow multiple processes to listen on the same port. Here’s a more detailed look:
+
+1. Master Process Spawning: The master process detects the number of CPU cores and spawns that many child processes.
+2. Child Process Initialization: Each child process sets GOMAXPROCS(1) so that it runs on a single core.
+3. Binding to Port: Child processes use packages like reuseport to bind to the same address and port.
+4. Parent Monitoring: Each child runs a watchdog function (watchMaster()) to monitor the master process; if the master terminates, children exit.
+5. Request Handling: Each child independently handles incoming HTTP requests.
+
+```mermaid
+flowchart TD
+    A[Master Process]
+    B[Determine CPU Cores]
+    C[Spawn Child Processes]
+    D["Child Process Initialization<br/>(GOMAXPROCS(1))"]
+    E["Bind to Port<br/>(reuseport)"]
+    F["Run watchMaster()<br/>(Monitor Parent)"]
+    G[Handle HTTP Requests]
+
+    A --> B
+    B --> C
+    C --> D
+    D --> E
+    E --> F
+    F --> G
+```
+
+#### Explanation
+
+- Preforking improves performance by allowing multiple processes to handle requests concurrently.
+- Using reuseport (or a fallback) ensures that all child processes can listen on the same port without conflicts.
+- The watchdog routine in each child ensures that they exit if the master process is no longer running, maintaining process integrity.
+
+## Redirection & Flash Messages
+
+Fiber’s redirection mechanism is implemented via the Redirect struct. This structure allows not only setting a new location for redirection but also passing along flash messages and old input data via a special cookie.
+
+```mermaid
+flowchart LR
+    R[Redirect Struct]
+    RP[redirectPool]
+    FM["Flash Messages \/ Old Inputs"]
+    M["Methods:<br/>To(), Route(), Back()"]
+    LH[Set Location Header]
+    CK["Flash Cookie<br/>(fiber\_flash)"]
+
+    R -->|Acquired from| RP
+    R --> FM
+    R --> M
+    M --> LH
+    FM -->|Serialized| CK
+```
+
+### Explanation
+
+- Redirect Struct: Retrieved from a pool (to minimize allocations), it stores redirection settings such as the HTTP status code (defaulting to 302) and any flash messages.
+- Flash Messages & Old Inputs: These are collected via methods like With() or WithInput() and then serialized and stored in a cookie named fiber_flash.
+- Redirection Methods: The To(), Route(), and Back() methods determine the target URL and set the Location header accordingly.
+
+### Flash Message Handling in Redirection
+
+When performing redirections, Fiber can send flash messages or preserve old input data. This process involves:
+
+1. Collecting Flash Data: When a redirect is initiated, developers can add flash messages via Redirect.With() or old input data via Redirect.WithInput().
+2. Serialization: The flash messages and input data are serialized (using a fast marshalling method) into a byte sequence.
+3. Setting a Cookie: The serialized data is stored in a special cookie (named fiber_flash) that will be sent to the client.
+4. Retrieval & Clearing: On the subsequent request, the flash data is read from the cookie, deserialized, and then cleared.
+
+```mermaid
+flowchart TD
+    A[Initiate Redirect]
+    B["Add Flash Messages<br/>(With(), WithInput())"]
+    C[Serialize Flash Data]
+    D["Set Flash Cookie<br/>(\'fiber\_flash\')"]
+    E[Client Receives Redirect]
+    F[Next Request Reads Flash Cookie]
+    G["Deserialize & Clear Flash Data"]
+
+    A --> B
+    B --> C
+    C --> D
+    D --> E
+    E --> F
+    F --> G
+```
+
+#### Explanation
+
+- Flash messages provide a way to pass transient data (such as notifications or error messages) to the next request after a redirect.
+- The data is stored temporarily in a cookie, which is then read and cleared upon processing the next request.
+- This mechanism is essential for implementing post‑redirect‑get patterns and ensuring a smooth user experience.
+
+## Hooks, Error Handling & Context Lifecycle
+
+### Hooks
+
+Fiber provides a comprehensive hook system that allows you to run custom functions at key moments:
+
+- OnRoute: Called when a route is registered.
+- OnName: Invoked when a route is assigned a name.
+- OnGroup: Triggered when a group is created.
+- OnListen: Runs when the server starts listening.
+- OnShutdown: Called during graceful shutdown.
+- OnFork: Invoked when a child process is forked.
+- OnMount: Used when a sub‑application is mounted.
+
+```mermaid
+flowchart TD
+    H[Hooks]
+    OR[OnRoute]
+    ON[OnName]
+    OG[OnGroup]
+    OL[OnListen]
+    OS[OnShutdown]
+    OF[OnFork]
+    OM[OnMount]
+
+    H --> OR
+    H --> ON
+    H --> OG
+    H --> OL
+    H --> OS
+    H --> OF
+    H --> OM
+```
+
+#### Explanation
+
+- Hooks provide extension points for developers and maintainers to inject custom logic without modifying the core Fiber code.
+- They are executed at various stages (for example, every time a new route is registered, the OnRoute hooks are executed to allow for logging, validation, or transformation of the route).
+
+### Error Handling & Context Lifecycle
+
+Fiber’s DefaultCtx (or CustomCtx) represents the per‑request context. The lifecycle is as follows:
+
+- Acquire: A Context is obtained from the pool at the beginning of a request.
+- Processing: The context is passed along to the route handlers and middleware.
+- Error Handling: If an error occurs (e.g., route not found, method not allowed, or a panic in the handler), Fiber calls the registered error handler. Errors such as ErrMethodNotAllowed or StatusNotFound are generated as needed.
+- Release: Once the request is processed, the Context is released back into the pool for reuse.
+
+```mermaid
+flowchart LR
+    AC["Acquire Context<br/>(from Pool)"]
+    HP["Handle Request<br/>(Handlers & Middleware)"]
+    EH["Error Handling<br/>(if needed)"]
+    RC["Release Context<br/>(to Pool)"]
+
+    AC --> HP
+    HP --> EH
+    EH --> RC
+```
+
+#### Explanation
+
+- This lifecycle ensures that Fiber minimizes allocations by reusing Context objects.
+- Errors are propagated and handled consistently, and the context is properly reset after every request.