AI Gateway

AI Native API Gateway

Official Site | Docs | Blog | Developer | Higress in Cloud

Higress is a cloud-native API gateway based on Istio and Envoy, which can be extended with Wasm plugins written in Go/Rust/JS. It provides dozens of ready-to-use general-purpose plugins and an out-of-the-box console (try the demo here).

Higress was born within Alibaba to solve the issues of Tengine reload affecting long-connection services and insufficient load balancing capabilities for gRPC/Dubbo.

Alibaba Cloud has built its cloud-native API gateway product based on Higress, providing 99.99% gateway high availability guarantee service capabilities for a large number of enterprise customers.

Higress's AI gateway capabilities support all mainstream model providers both domestic and international, as well as self-built DeepSeek models based on vllm/ollama. Within Alibaba Cloud, it supports AI businesses such as Tongyi Qianwen APP, Bailian large model API, and machine learning PAI platform. It also serves leading AIGC enterprises (such as Zero One Infinite) and AI products (such as FastGPT).

Quick Start

Higress can be started with just Docker, making it convenient for individual developers to set up locally for learning or for building simple sites:

# Create a working directory
mkdir higress; cd higress
# Start higress, configuration files will be written to the working directory
docker run -d --rm --name higress-ai -v ${PWD}:/data \
        -p 8001:8001 -p 8080:8080 -p 8443:8443  \
        higress-registry.cn-hangzhou.cr.aliyuncs.com/higress/all-in-one:latest

Port descriptions:

Port 8001: Higress UI console entry
Port 8080: Gateway HTTP protocol entry
Port 8443: Gateway HTTPS protocol entry

All Higress Docker images use their own dedicated repository, unaffected by Docker Hub access restrictions in certain regions

For other installation methods such as Helm deployment under K8s, please refer to the official Quick Start documentation.

Use Cases

AI Gateway:

Higress can connect to all LLM model providers both domestic and international using a unified protocol, while also providing rich AI observability, multi-model load balancing/fallback, AI token rate limiting, AI caching, and other capabilities:
MCP Server Hosting:

Higress, as an Envoy-based API gateway, supports hosting MCP Servers through its plugin mechanism. MCP (Model Context Protocol) is essentially an AI-friendly API that enables AI Agents to more easily call various tools and services. Higress provides unified capabilities for authentication, authorization, rate limiting, and observability for tool calls, simplifying the development and deployment of AI applications.

By hosting MCP Servers with Higress, you can achieve:
- Unified authentication and authorization mechanisms, ensuring the security of AI tool calls
- Fine-grained rate limiting to prevent abuse and resource exhaustion
- Comprehensive audit logs recording all tool call behaviors
- Rich observability for monitoring the performance and health of tool calls
- Simplified deployment and management through Higress's plugin mechanism for quickly adding new MCP Servers
- Dynamic updates without disruption: Thanks to Envoy's friendly handling of long connections and Wasm plugin's dynamic update mechanism, MCP Server logic can be updated on-the-fly without any traffic disruption or connection drops
Kubernetes ingress controller:

Higress can function as a feature-rich ingress controller, which is compatible with many annotations of K8s' nginx ingress controller.

Gateway API support is coming soon and will support smooth migration from Ingress API to Gateway API.
Microservice gateway:

Higress can function as a microservice gateway, which can discovery microservices from various service registries, such as Nacos, ZooKeeper, Consul, Eureka, etc.

It deeply integrates with Dubbo, Nacos, Sentinel and other microservice technology stacks.
Security gateway:

Higress can be used as a security gateway, supporting WAF and various authentication strategies, such as key-auth, hmac-auth, jwt-auth, basic-auth, oidc, etc.

Core Advantages

Production Grade

Born from Alibaba's internal product with over 2 years of production validation, supporting large-scale scenarios with hundreds of thousands of requests per second.

Completely eliminates traffic jitter caused by Nginx reload, configuration changes take effect in milliseconds and are transparent to business. Especially friendly to long-connection scenarios such as AI businesses.
Streaming Processing

Supports true complete streaming processing of request/response bodies, Wasm plugins can easily customize the handling of streaming protocols such as SSE (Server-Sent Events).

In high-bandwidth scenarios such as AI businesses, it can significantly reduce memory overhead.
Easy to Extend

Provides a rich official plugin library covering AI, traffic management, security protection and other common functions, meeting more than 90% of business scenario requirements.

Focuses on Wasm plugin extensions, ensuring memory safety through sandbox isolation, supporting multiple programming languages, allowing plugin versions to be upgraded independently, and achieving traffic-lossless hot updates of gateway logic.
Secure and Easy to Use

Based on Ingress API and Gateway API standards, provides out-of-the-box UI console, WAF protection plugin, IP/Cookie CC protection plugin ready to use.

Supports connecting to Let's Encrypt for automatic issuance and renewal of free certificates, and can be deployed outside of K8s, started with a single Docker command, convenient for individual developers to use.

Community

Slack: to get invited go here.

Thanks

Higress would not be possible without the valuable open-source work of projects in the community. We would like to extend a special thank you to Envoy and Istio.

Related Repositories

Higress Console: https://github.com/higress-group/higress-console
Higress Standalone: https://github.com/higress-group/higress-standalone

Contributors

Star History

↑ Back to Top ↑

Name	Name	Last commit message	Last commit date
Latest commit johnlanni Update README.md Mar 26, 2025 1965d10 · Mar 26, 2025 History 1,016 Commits
.github	.github	fix: Fetch get-higress.sh from standalone repo (#1945 )	Mar 26, 2025
api	api	Optimize wasmplugin proto (#1656 )	Jan 9, 2025
client	client	upgrade to istio 1.19 (#1211 )	Aug 26, 2024
cmd/higress	cmd/higress	upgrade to istio 1.19 (#1211 )	Aug 26, 2024
docker	docker	feat: Support pushing multi-arch images to a custom image registry (#…	Feb 26, 2025
docs	docs	add higress architecture doc (#1662 )	Jan 14, 2025
envoy	envoy	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
helm	helm	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
hgctl	hgctl	update gomod in hgctl	Mar 26, 2025
istio	istio	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
pkg	pkg	feat: add mcpServer in config map (#1953 )	Mar 26, 2025
plugins	plugins	update mcp readme	Mar 26, 2025
registry	registry	fix mcp service port protocol name (#1383 )	Oct 15, 2024
samples	samples	support sort httproute when use gateway api (#622 )	Nov 3, 2023
test	test	add variable from secret when applying istio cr (#1877 )	Mar 17, 2025
tools	tools	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
.gitignore	.gitignore	wasm32-wasi to wasm32-wasip1 (#1716 )	Feb 5, 2025
.gitmodules	.gitmodules	fix: Use shallow mode when cloning submodules (#1253 )	Aug 26, 2024
.licenserc.yaml	.licenserc.yaml	add higress architecture doc (#1662 )	Jan 14, 2025
CODEOWNERS	CODEOWNERS	Update CODEOWNERS	Dec 19, 2024
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	optimize: add contributing docs (#35 )	Nov 7, 2022
CONTRIBUTING_CN.md	CONTRIBUTING_CN.md	docs: Pre-development preparation (#762 )	Jan 18, 2024
CONTRIBUTING_EN.md	CONTRIBUTING_EN.md	Improve the grammar of the sentence (#1426 )	Oct 24, 2024
CONTRIBUTING_JP.md	CONTRIBUTING_JP.md	docs: add Japanese README and CONTRIBUTING files (#1407 )	Oct 21, 2024
LICENSE	LICENSE	optimize: update license notices (#38 )	Nov 8, 2022
Makefile	Makefile	Fix typos (#1300 )	Sep 11, 2024
Makefile.core.mk	Makefile.core.mk	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
Makefile.overrides.mk	Makefile.overrides.mk	upgrade to istio 1.19 (#1211 )	Aug 26, 2024
README.md	README.md	Update README.md	Mar 26, 2025
README_JP.md	README_JP.md	docs: Added back to top , contributors section and star history graph (…	Nov 8, 2024
README_ZH.md	README_ZH.md	update README	Mar 26, 2025
SECURITY.md	SECURITY.md	Update SECURITY.md	Oct 16, 2024
VERSION	VERSION	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
codecov.yml	codecov.yml	test: add codecov target for patch (#792 )	Jan 26, 2024
get_helm.sh	get_helm.sh	feat: loadBalancerClass (#1071 )	Jul 8, 2024
go.mod	go.mod	rel 2.1.0-rc.1 (#1959 )	Mar 26, 2025
go.sum	go.sum	rel: Release v2.0.0 (#1298 )	Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Gateway

AI Native API Gateway

Summary

Quick Start

Use Cases

Core Advantages

Community

Thanks

Related Repositories

Contributors

Star History

About

Releases 48

Contributors 119

Languages

License

alibaba/higress

Folders and files

Latest commit

History

Repository files navigation

AI Gateway

AI Native API Gateway

Summary

Quick Start

Use Cases

Core Advantages

Community

Thanks

Related Repositories

Contributors

Star History

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 48

Contributors 119

Languages