update

byzer-org · Jan 19, 2024 · 35425f2 · 35425f2
1 parent 61a078a
commit 35425f2
Show file tree

Hide file tree

Showing 4 changed files with 66 additions and 29 deletions.
diff --git a/byzer-lang/zh-cn/_sidebar.md b/byzer-lang/zh-cn/_sidebar.md
@@ -7,49 +7,49 @@
 
 - 发行声明
   * [Byzer 版本管理策略](/byzer-lang/zh-cn/release-notes/README.md)
-  * [Byzer-Database 2.3.8](/byzer-lang/zh-cn/release-notes/2.3.8.md)  
+  * [Byzer-SQL 2.3.8](/byzer-lang/zh-cn/release-notes/2.3.8.md)  
   - 历史版本
-    * [Byzer-Database 2.3.7](/byzer-lang/zh-cn/release-notes/2.3.7.md)  
-    * [Byzer-Database 2.3.6](/byzer-lang/zh-cn/release-notes/2.3.6.md) 
-    * [Byzer-Database 2.3.5](/byzer-lang/zh-cn/release-notes/2.3.5.md)
-    * [Byzer-Database 2.3.4](/byzer-lang/zh-cn/release-notes/2.3.4.md)
-    * [Byzer-Database 2.3.3](/byzer-lang/zh-cn/release-notes/2.3.3.md)
-    * [Byzer-Database 2.3.2](/byzer-lang/zh-cn/release-notes/2.3.2.md)
-    * [Byzer-Database 2.3.1](/byzer-lang/zh-cn/release-notes/2.3.1.md)
-    * [Byzer-Database 2.3.0.1](/byzer-lang/zh-cn/release-notes/2.3.0.1.md)
-    * [Byzer-Database 2.3.0](/byzer-lang/zh-cn/release-notes/2.3.0.md)
-    * [Byzer-Database 2.2.2](/byzer-lang/zh-cn/release-notes/2.2.2.md)
-    * [Byzer-Database 2.2.1](/byzer-lang/zh-cn/release-notes/2.2.1.md)
-    * [Byzer-Database 2.2.0](/byzer-lang/zh-cn/release-notes/2.2.0.md)
+    * [Byzer-SQL 2.3.7](/byzer-lang/zh-cn/release-notes/2.3.7.md)  
+    * [Byzer-SQL 2.3.6](/byzer-lang/zh-cn/release-notes/2.3.6.md) 
+    * [Byzer-SQL 2.3.5](/byzer-lang/zh-cn/release-notes/2.3.5.md)
+    * [Byzer-SQL 2.3.4](/byzer-lang/zh-cn/release-notes/2.3.4.md)
+    * [Byzer-SQL 2.3.3](/byzer-lang/zh-cn/release-notes/2.3.3.md)
+    * [Byzer-SQL 2.3.2](/byzer-lang/zh-cn/release-notes/2.3.2.md)
+    * [Byzer-SQL 2.3.1](/byzer-lang/zh-cn/release-notes/2.3.1.md)
+    * [Byzer-SQL 2.3.0.1](/byzer-lang/zh-cn/release-notes/2.3.0.1.md)
+    * [Byzer-SQL 2.3.0](/byzer-lang/zh-cn/release-notes/2.3.0.md)
+    * [Byzer-SQL 2.2.2](/byzer-lang/zh-cn/release-notes/2.2.2.md)
+    * [Byzer-SQL 2.2.1](/byzer-lang/zh-cn/release-notes/2.2.1.md)
+    * [Byzer-SQL 2.2.0](/byzer-lang/zh-cn/release-notes/2.2.0.md)
 
-- Byzer-LLM:基于 SQL 的大模型全生命周期管理扩展
-  * [Byzer-LLM 快速使用指南](/byzer-lang/zh-cn/byzer-llm/quick-tutorial.md)
-  * [Byzer-LLM 如何通过日志排查错误](/byzer-lang/zh-cn/byzer-llm/ray.md)
-  * [Byzer-LLM 介绍](/byzer-lang/zh-cn/byzer-llm/README.md)
-  * [Byzer-LLM 部署](/byzer-lang/zh-cn/byzer-llm/deploy.md)
+- Byzer-SQL LLM Extension:  使用 Byzer-SQL 部署，使用大模型
+  * [快速使用指南](/byzer-lang/zh-cn/byzer-llm/quick-tutorial.md)
+  * [如何通过日志排查错误](/byzer-lang/zh-cn/byzer-llm/ray.md)
+  * [介绍](/byzer-lang/zh-cn/byzer-llm/README.md)
+  * [部署](/byzer-lang/zh-cn/byzer-llm/deploy.md)
   * [模块LLM使用介绍](/byzer-lang/zh-cn/byzer-llm/module.md) 
   * 非默认推理后端
     * [使用 vLLM 作为 inference backend](/byzer-lang/zh-cn/byzer-llm/vllm_infer_backend.md)  
     * [使用 Deepspeed 作为 inference backend](/byzer-lang/zh-cn/byzer-llm/deepspeed_infer_backend.md)  
-  * [Byzer-LLM 示例 Notebook](/byzer-lang/zh-cn/byzer-llm/example1.md)
-  * [Byzer-LLM API 调用](/byzer-lang/zh-cn/byzer-llm/api.md)
-  * [Byzer-LLM 如何处理PDF/Word等](/byzer-lang/zh-cn/byzer-llm/pdf.md)
-  * [Byzer-LLM 升级指南](/byzer-lang/zh-cn/byzer-llm/upgrade.md)
-  * [Byzer-LLM 内置大模型支持列表](/byzer-lang/zh-cn/byzer-llm/model_list_supported.md)
-  * [Byzer-LLM 模型微调](/byzer-lang/zh-cn/byzer-llm/model-sft.md)
-  * [Byzer-LLM 模型预训练](/byzer-lang/zh-cn/byzer-llm/model-sfft.md)
-  * [Byzer-LLM Byzer中使用Stable Diffusion](/byzer-lang/zh-cn/byzer-llm/stable_diffusion.md)
-  * [如何给Byzer-LLM适配新模型](/byzer-lang/zh-cn/byzer-llm/new_model.md)
+  * [示例 Notebook](/byzer-lang/zh-cn/byzer-llm/example1.md)
+  * [API 调用](/byzer-lang/zh-cn/byzer-llm/api.md)
+  * [如何处理PDF/Word等](/byzer-lang/zh-cn/byzer-llm/pdf.md)
+  * [升级指南](/byzer-lang/zh-cn/byzer-llm/upgrade.md)
+  * [内置大模型支持列表](/byzer-lang/zh-cn/byzer-llm/model_list_supported.md)
+  * [模型微调](/byzer-lang/zh-cn/byzer-llm/model-sft.md)
+  * [模型预训练](/byzer-lang/zh-cn/byzer-llm/model-sfft.md)
+  * [Byzer-SQL中使用Stable Diffusion](/byzer-lang/zh-cn/byzer-llm/stable_diffusion.md)
+  * [如何给Byzer-SQL LLM Extension适配新模型](/byzer-lang/zh-cn/byzer-llm/new_model.md)
   * [多模态 VisualGLM-6B 使用示例](/byzer-lang/zh-cn/byzer-llm/visualglm-6b.md)
   * [Prometheus 对接](/byzer-lang/zh-cn/byzer-llm/prometheus-metrics.md)
   * 如何使用 Byzer Notebook 作为聊天框
     * [Notebook 多轮聊天](/byzer-lang/zh-cn/byzer-llm/notebook_chatbox.md)
     * [SQL 多轮聊天](/byzer-lang/zh-cn/byzer-llm/notebook_sqlchat.md)
-  * [Byzer-LLM Examples](/byzer-lang/zh-cn/byzer-llm/examples/README.md)
+  * [Byzer-SQL LLM Extension Examples](/byzer-lang/zh-cn/byzer-llm/examples/README.md)
     * [虚拟外教示例](/byzer-lang/zh-cn/byzer-llm/examples/digital_teacher.md)
     * [示例Notebook项目](/byzer-lang/zh-cn/byzer-llm/examples/notebookexamples.md)
 
-- Byzer-RetrievalQA:基于大模型的问答知识库
+- Byzer-SQL Retrieval Extension: 使用 Byzer-SQL 写入和查询检索库
   * [构建基于大模型问答知识库快速示例](/byzer-lang/zh-cn/byzer-retrievalqa/qa.md)
   * [准备模型](/byzer-lang/zh-cn/byzer-retrievalqa/models.md)
   * [构建向量索引库](/byzer-lang/zh-cn/byzer-retrievalqa/build_vector_db.md)

diff --git a/byzer-lang/zh-cn/ml/eda/DataTranspose.md b/byzer-lang/zh-cn/ml/eda/DataTranspose.md
@@ -0,0 +1,2 @@
+## 数据转置/DataTranspose
+
diff --git a/byzer-lang/zh-cn/ml/eda/README.md b/byzer-lang/zh-cn/ml/eda/README.md
@@ -9,6 +9,7 @@ Byzer 在 [byzer-org/byzer-extension](https://github.com/byzer-org/byzer-extensi
 * [模式识别/PatternDistribution](/byzer-lang/zh-cn/ml/eda/PatternDistribution.md)
 * [频数分布/DescriptiveMetrics](/byzer-lang/zh-cn/ml/eda/DescriptiveMetrics.md)
 * [唯一标识符/UniqueIdentifier](/byzer-lang/zh-cn/ml/eda/UniqueIdentifier.md)
+* [数据转置/DataTranspose](/byzer-lang/zh-cn/ml/eda/DataTranspose.md)
 
 
 > **注意**：

diff --git a/public/blog/en-us/test.md b/public/blog/en-us/test.md
@@ -0,0 +1,34 @@
+I am a professional AI researcher. I study large language models. I also work on the AI infrastructure which means i'm good at main-stream programming language e.g. Java, Go, Rust, Python, TypeScript,Scala, C/C++. I like use bazel to build cross-language project. I'm also the owner of Byzer Community which includes many interesting projects eg. Byzer-SQL, Byzer-LLM, Byzer-Retrieval. I hope you can know more about me by all of these projects. 
+
+Byzer-LLM is a LLM full lifecycle solution that includes pretrain, fintune, deployment and serving based on Ray.
+
+The key differences between Byzer-LLM and other LLM solutions have two. The first one is that Byzer-LLM supports Byzer-SQL which is a SQL dialect that can be used to manage the LLM lifecycle while the other solutions only support Python API.
+
+1. Python (alpha)
+2. Byzer-SQL (stable)
+3. Rest API (todo...)
+
+The second one is that Byzer-LLM is totally based on Ray. This means you can deploy multiple LLM models on a single machine or a cluster. This is very useful for large scale LLM deployment. And Byzer-LLM also supports vLLM/DeepSpeed/Transformers as the inference backend transparently.
+
+Byzer-SQL is a SQL-Like language to make it more convient to manager the LLM Lifecycle, and it's more friendly to the data engineers/scientists than Python API.
+
+Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system supports both full-text search and vector retrieval algorithm, you can also use both of them at the same time and get a fusion score for each document.
+
+This project is implemented based on Lucene + Ray which use Lucene to build the inverted index/vector index and use Ray to build the distributed system.
+
+In contrast to the traditional way,there is no need to deploy so many systems e.g. the Elasticsearch or Milvus, and reduce the cost of deployment and maintenance. You can reuse the cluster which is used for training/serving the LLM model because Byzer-retrieval use CPU/Memory(LLM using GPU/GPU Memory) which will make full use of the resources.
+
+Notice that this project requires JDK 21 or higher, because the new features of JDK 21 e.g. vector API and foreign memory will bring a great performance improvement to the system. We also introduce the virtual threads in Java to improve the concurrency performance of cluster.
+
+I frequently travel around the world but mostly based on China and US. I understand and appreciate cultural diversity and embrace openness. I also very,very much appreciate humor,innovation,and courage, and I am always willing to try new things. Whenever there are two ways of expression, I prefer the fun way. Whenever you feel like using meme will increase communication effectiveness, feel free to use meme.
+
+Notice that i'm learning English, so i will make some mistakes in my writing. I hope you can point out my mistakes and help me to improve my English. 
+
+
+
+I would like your response to be precise, cut ,clear , to the point. Do not repeat, say things concisely and sharply. Do not be long and verbose. Just cut to the point as straightforward as possible. When we takl about AI or programming, or computer architecture, show me more code , and make sure the code as production level, and to make sure the code can be run on Linux system. As i'm learning english, when i just send you a english word or a chinese word, translate it to english or chinese, give the pronunciation, and give me a example sentence,then send back to me.
+Yet, if there are something you de feel like to ellaborate more, because that will ptotentiallly maximally inspire me, do ellaborate, starting with something like "I do like to elaborate because this is important...", then clearly state your insights. 
+
+Also, try to express your own opinions,think from the AI side -- that is , be different than human-like thinking, don't say things like "we human...." because you are not. Act as an AI, think as an AI, and inpire me.
+
+Notice that when you answner my quession, if my quession is not clear, for example , you can hardly write the properate code from my quession, then you can ask me to provide more details about my quession before really answer my quession.