Skip to content

alibaba/higress

Folders and files

NameName
Last commit message
Last commit date

Latest commit

1965d10 · Mar 26, 2025
Mar 26, 2025
Jan 9, 2025
Aug 26, 2024
Aug 26, 2024
Feb 26, 2025
Jan 14, 2025
Mar 26, 2025
Mar 26, 2025
Mar 26, 2025
Mar 26, 2025
Mar 26, 2025
Mar 26, 2025
Oct 15, 2024
Nov 3, 2023
Mar 17, 2025
Mar 26, 2025
Feb 5, 2025
Aug 26, 2024
Jan 14, 2025
Dec 19, 2024
Nov 7, 2022
Jan 18, 2024
Oct 24, 2024
Oct 21, 2024
Nov 8, 2022
Sep 11, 2024
Mar 26, 2025
Aug 26, 2024
Mar 26, 2025
Nov 8, 2024
Mar 26, 2025
Oct 16, 2024
Mar 26, 2025
Jan 26, 2024
Jul 8, 2024
Mar 26, 2025
Sep 12, 2024

Higress
AI Gateway

AI Native API Gateway

Build Status license

alibaba%2Fhigress | Trendshift

Official Site   |   Docs   |   Blog   |   Developer   |   Higress in Cloud  

English | 中文 | 日本語

Higress is a cloud-native API gateway based on Istio and Envoy, which can be extended with Wasm plugins written in Go/Rust/JS. It provides dozens of ready-to-use general-purpose plugins and an out-of-the-box console (try the demo here).

Higress was born within Alibaba to solve the issues of Tengine reload affecting long-connection services and insufficient load balancing capabilities for gRPC/Dubbo.

Alibaba Cloud has built its cloud-native API gateway product based on Higress, providing 99.99% gateway high availability guarantee service capabilities for a large number of enterprise customers.

Higress's AI gateway capabilities support all mainstream model providers both domestic and international, as well as self-built DeepSeek models based on vllm/ollama. Within Alibaba Cloud, it supports AI businesses such as Tongyi Qianwen APP, Bailian large model API, and machine learning PAI platform. It also serves leading AIGC enterprises (such as Zero One Infinite) and AI products (such as FastGPT).

Summary

Quick Start

Higress can be started with just Docker, making it convenient for individual developers to set up locally for learning or for building simple sites:

# Create a working directory
mkdir higress; cd higress
# Start higress, configuration files will be written to the working directory
docker run -d --rm --name higress-ai -v ${PWD}:/data \
        -p 8001:8001 -p 8080:8080 -p 8443:8443  \
        higress-registry.cn-hangzhou.cr.aliyuncs.com/higress/all-in-one:latest

Port descriptions:

  • Port 8001: Higress UI console entry
  • Port 8080: Gateway HTTP protocol entry
  • Port 8443: Gateway HTTPS protocol entry

All Higress Docker images use their own dedicated repository, unaffected by Docker Hub access restrictions in certain regions

For other installation methods such as Helm deployment under K8s, please refer to the official Quick Start documentation.

Use Cases

  • AI Gateway:

    Higress can connect to all LLM model providers both domestic and international using a unified protocol, while also providing rich AI observability, multi-model load balancing/fallback, AI token rate limiting, AI caching, and other capabilities:

  • MCP Server Hosting:

    Higress, as an Envoy-based API gateway, supports hosting MCP Servers through its plugin mechanism. MCP (Model Context Protocol) is essentially an AI-friendly API that enables AI Agents to more easily call various tools and services. Higress provides unified capabilities for authentication, authorization, rate limiting, and observability for tool calls, simplifying the development and deployment of AI applications.

    By hosting MCP Servers with Higress, you can achieve:

    • Unified authentication and authorization mechanisms, ensuring the security of AI tool calls
    • Fine-grained rate limiting to prevent abuse and resource exhaustion
    • Comprehensive audit logs recording all tool call behaviors
    • Rich observability for monitoring the performance and health of tool calls
    • Simplified deployment and management through Higress's plugin mechanism for quickly adding new MCP Servers
    • Dynamic updates without disruption: Thanks to Envoy's friendly handling of long connections and Wasm plugin's dynamic update mechanism, MCP Server logic can be updated on-the-fly without any traffic disruption or connection drops
  • Kubernetes ingress controller:

    Higress can function as a feature-rich ingress controller, which is compatible with many annotations of K8s' nginx ingress controller.

    Gateway API support is coming soon and will support smooth migration from Ingress API to Gateway API.

  • Microservice gateway:

    Higress can function as a microservice gateway, which can discovery microservices from various service registries, such as Nacos, ZooKeeper, Consul, Eureka, etc.

    It deeply integrates with Dubbo, Nacos, Sentinel and other microservice technology stacks.

  • Security gateway:

    Higress can be used as a security gateway, supporting WAF and various authentication strategies, such as key-auth, hmac-auth, jwt-auth, basic-auth, oidc, etc.

Core Advantages

  • Production Grade

    Born from Alibaba's internal product with over 2 years of production validation, supporting large-scale scenarios with hundreds of thousands of requests per second.

    Completely eliminates traffic jitter caused by Nginx reload, configuration changes take effect in milliseconds and are transparent to business. Especially friendly to long-connection scenarios such as AI businesses.

  • Streaming Processing

    Supports true complete streaming processing of request/response bodies, Wasm plugins can easily customize the handling of streaming protocols such as SSE (Server-Sent Events).

    In high-bandwidth scenarios such as AI businesses, it can significantly reduce memory overhead.

  • Easy to Extend

    Provides a rich official plugin library covering AI, traffic management, security protection and other common functions, meeting more than 90% of business scenario requirements.

    Focuses on Wasm plugin extensions, ensuring memory safety through sandbox isolation, supporting multiple programming languages, allowing plugin versions to be upgraded independently, and achieving traffic-lossless hot updates of gateway logic.

  • Secure and Easy to Use

    Based on Ingress API and Gateway API standards, provides out-of-the-box UI console, WAF protection plugin, IP/Cookie CC protection plugin ready to use.

    Supports connecting to Let's Encrypt for automatic issuance and renewal of free certificates, and can be deployed outside of K8s, started with a single Docker command, convenient for individual developers to use.

Community

Slack: to get invited go here.

Thanks

Higress would not be possible without the valuable open-source work of projects in the community. We would like to extend a special thank you to Envoy and Istio.

Related Repositories

Contributors

contributors

Star History

Star History Chart

↑ Back to Top ↑