mirror of https://github.com/H0llyW00dzZ/fiber-stream-open-ai.git synced 2025-02-06 09:44:47 +00:00

Go to file

- [+] docs(README.md): add security considerations section

2024-12-09 22:13:16 +07:00

ai/openai

Improve Code Example (#11 )

2024-12-09 10:07:16 +07:00

cmd/server

Improve Code Example (#9 )

2024-12-09 09:21:11 +07:00

frontend/views

First Commit

2024-12-08 00:30:18 +07:00

.gitignore

First Commit

2024-12-08 00:30:18 +07:00

go.mod

Improve Code Example Implementation (#5 )

2024-12-09 01:09:04 +07:00

go.sum

Improve Code Example Implementation (#5 )

2024-12-09 01:09:04 +07:00

LICENSE

First Commit

2024-12-08 00:30:18 +07:00

README.md

Update Documentation (#13 )

2024-12-09 22:13:16 +07:00

README.md

Fiber Stream Example

This repository demonstrates a simple implementation of a streaming chat application using the Fiber web framework, built on top of FastHTTP, and the OpenAI API.

Overview

The application serves a web page where users can initiate a chat with an AI assistant. It uses Fiber to handle HTTP requests and streams responses from the OpenAI API to the client.

Note

This example requires further improvements, such as reducing latency, if used in production

Features

HTML Template Rendering: Uses Fiber's HTML template engine to render the interface.
Streaming API Integration: Connects to the OpenAI API to stream chat responses.
FastHTTP Performance: Leverages FastHTTP for high-performance HTTP handling.
Sonic JSON Optimization: Utilizes Sonic for efficient JSON encoding and decoding, reducing latency and improving performance.

Project Structure

cmd/server/run.go: Entry point of the application, sets up routes and handles server lifecycle.
frontend/views/: Contains HTML templates for rendering the web interface.
ai/openai/: Contains the client implementation for interacting with the OpenAI API, including optimized JSON handling with Sonic.

Key Functions

StreamChatCompletion: Handles requests to the /stream endpoint, streaming chat responses from the OpenAI API to the client.
Graceful Shutdown: Listens for OS signals to gracefully shut down the server, ensuring all ongoing requests are completed.

Sonic JSON Integration

Performance Enhancement: Sonic is used for JSON encoding and decoding, which enhances performance by utilizing a pool of decoders and encoders.
Stream Handling: The use of Sonic's StreamDecoder and StreamEncoder allows for efficient processing of streaming data, minimizing overhead.

Compatibility

Kubernetes: Supports Horizontal Pod Autoscaler (HPA) when using external AI services (e.g., the current example implementation that interacts with OpenAI). For an in-house AI solution (e.g., own AI), consider using Vertical Pod Autoscaler (VPA) for resource management.

Note

It also depends on the ingress configuration. If you have extensive Kubernetes knowledge (e.g., a Captain Kubernetes), it can be managed easily

Tip

For Horizontal Pod Autoscaler (HPA), it's recommended to use custom metrics based on connections instead of the default CPU/Memory metrics. This approach can help achieve scalability, such as reaching up to 1000 nodes to handle 1 billion connections.

Security Considerations

The current example implementation of the SSE primarily involves client-side operations (e.g., even if there are vulnerabilities, it only affect the browser of the who views it), which inherently carry a lower security risk. However, it's crucial to address potential vulnerabilities to ensure the safety and integrity of the application:

Cross-Site Scripting (XSS):
- Ensure that any data rendered in the HTML is properly sanitized to prevent XSS attacks.
- Validate and sanitize server-side responses before sending them to the client.
Content Security Policy (CSP):
- Implement a CSP in HTTP headers to restrict the sources from which resources can be loaded. This helps mitigate XSS and data injection attacks.
Secure Fetch Requests:
- Use HTTPS for all API requests to ensure data encryption during transmission.
- Validate server responses and handle errors gracefully in client-side JavaScript to avoid exposing sensitive information.
Access Control:
- Protect sensitive endpoints with authentication and authorization mechanisms to ensure only authorized users can access them.
Error Handling:
- Implement robust error handling to prevent the exposure of sensitive information through error messages.
Environment Variables:
- Store sensitive data, such as API keys, in environment variables and never hard-code them into the application.

By addressing these considerations, the security of the application can be enhanced while maintaining its performance and functionality.

Acknowledgments

Fiber for the web framework.
FastHTTP for high-performance HTTP handling.
OpenAI for the API services.
Sonic for optimized JSON processing.