v0

nan · nan · commit e8c7b8db99f7 · 2025-12-16T05:03:15.000Z
diff --git a/docs/backend-reference/torch.mdx b/docs/backend-reference/torch.mdx
@@ -148,7 +148,7 @@ For quantizing onnx models using tensorrt, the following parameters are availabl
 
 
 
-See [example](https://github.com/torchpipe/torchpipe/tree/main/examples/int8). 
+See [example](https://github.com/torchpipe/torchpipe/tree/v0/examples/int8). 
 
 
 ### Forward Computation
diff --git a/docs/benchmark.mdx b/docs/benchmark.mdx
@@ -67,7 +67,7 @@ Client 10/Compute Backend Instance 1/Timeout 0/Max Batch 1
 |    triton-cli     | QPS: 15039 <br />   | -                  |
 
  
-[import]: https://github.com/torchpipe/torchpipe/blob/main/libs/commands/import/README.md
+[import]: https://github.com/torchpipe/torchpipe/blob/v0/libs/commands/import/README.md
 
  
 
diff --git a/docs/contribution_guide/modify_the_code.md b/docs/contribution_guide/modify_the_code.md
@@ -28,5 +28,5 @@ pytest .
 If necessary, please consider supplementing with [Python tests](https://github.com/torchpipe/torchpipe//test).
 
 :::note Code Formatting (optional)
-Please configure a formatting plugin to enable [.clang-format](https://github.com/torchpipe/torchpipe/blob/develop/.clang-format).
+Please configure a formatting plugin to enable [.clang-format](https://github.com/torchpipe/torchpipe/blob/v0/.clang-format).
 :::
diff --git a/docs/installation.mdx b/docs/installation.mdx
@@ -36,8 +36,7 @@ If you encounter any compilation or runtime issues inside the Docker container,
 First, clone the code:
 
 ```bash
-git clone https://github.com/torchpipe/torchpipe.git
-# git clone -b main https://github.com/torchpipe/torchpipe.git
+git clone  -b v0 https://github.com/torchpipe/torchpipe.git
 cd torchpipe/ && git submodule update --init --recursive
 ```
 
@@ -146,7 +145,7 @@ For more examples, see [Showcase](./showcase/showcase.mdx).
 
 ## Customizing Dockerfile {#selfdocker}
 
-Refer to the [example Dockerfile](https://github.com/torchpipe/torchpipe/blob/main/docker/Dockerfile). 
+Refer to the [example Dockerfile](https://github.com/torchpipe/torchpipe/blob/v0/docker/Dockerfile). 
 
 ```bash
  
diff --git a/docs/python/test.mdx b/docs/python/test.mdx
@@ -392,7 +392,7 @@ def test_all_files(file_dir:str, num_clients=10, batch_size = 1,
 
 
 ### Clients with Different Batch Sizes
-In the example provided [here](https://github.com/torchpipe/torchpipe/blob/main/examples/yolox/yolox_multithreads_test.py), we use ten clients, each requesting different amounts of data per request, ranging from 1 to 10. We validate the consistency of the results in this case.
+In the example provided [here](https://github.com/torchpipe/torchpipe/blob/v0/examples/yolox/yolox_multithreads_test.py), we use ten clients, each requesting different amounts of data per request, ranging from 1 to 10. We validate the consistency of the results in this case.
 
 Typically, users can iterate through all the data in a directory and repeatedly send requests to verify the stability and consistency of the results.
 
diff --git a/docs/quick_start_new_user.md b/docs/quick_start_new_user.md
@@ -7,7 +7,7 @@ type: explainer
 
 # Trial in 30mins(new users)
 
-TorchPipe is a multi-instance pipeline parallel library that provides a seamless integration between lower-level acceleration libraries (such as TensorRT and OpenCV) and RPC frameworks. It guarantees high service throughput while meeting latency requirements. This document is mainly for new users, that is, users who are in the introductory stage of acceleration-related theoretical knowledge, know some python grammar, and can read simple codes. This content mainly includes the use of torchpipe for accelerating service deployment, complemented by performance and effect comparisons. The complete code of this document can be found at [resnet50_thrift](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/)。
+TorchPipe is a multi-instance pipeline parallel library that provides a seamless integration between lower-level acceleration libraries (such as TensorRT and OpenCV) and RPC frameworks. It guarantees high service throughput while meeting latency requirements. This document is mainly for new users, that is, users who are in the introductory stage of acceleration-related theoretical knowledge, know some python grammar, and can read simple codes. This content mainly includes the use of torchpipe for accelerating service deployment, complemented by performance and effect comparisons. The complete code of this document can be found at [resnet50_thrift](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/)。
 
 ## Catalogue
 * [1. Basic knowledge](#1)
@@ -84,7 +84,7 @@ self.classification_engine = torch2trt(resnet50, [input_shape],
 
 ```
 
-The overall online service deployment can be found at [main_trt.py](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/main_trt.py)
+The overall online service deployment can be found at [main_trt.py](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/main_trt.py)
 
 :::tip
 Since TensorRT is not thread-safe, when using this method for model acceleration, it is necessary to handle locking (with self.lock:) during the service deployment process.
@@ -104,7 +104,7 @@ From the above process, it's clear that when accelerating a single model, the fo
 
 ![](images/quick_start_new_user/torchpipe_en.png)
 
-We've made adjustments to the deployment of our service using TorchPipe.The overall online service deployment can be found at [main_torchpipe.py](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/main_torchpipe.py).
+We've made adjustments to the deployment of our service using TorchPipe.The overall online service deployment can be found at [main_torchpipe.py](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/main_torchpipe.py).
 The core function modifications as follows:
 
 ```py
@@ -219,7 +219,7 @@ std="58.395, 57.120, 57.375" # 255*"0.229, 0.224, 0.225"
 `python clien_qps.py --img_dir /your/testimg/path/ --port 8888 --request_client 20 --request_batch 1
 `
 
-The specific test code can be found at [client_qps.py](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/client_qps.py)
+The specific test code can be found at [client_qps.py](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/client_qps.py)
 
 With the same Thrift service interface, testing on a machine with NVIDIA-3080 GPU, 36-core CPU, and concurrency of 10, we have the following results:
 
diff --git a/docs/showcase/showcase.mdx b/docs/showcase/showcase.mdx
@@ -17,10 +17,10 @@ slug: /showcase
 | [tensorrt's native int8] |                                        | [TensorrtTensor](../backend-reference/torch.mdx#tensorrttensor)                                           |      |
 
 
-[resnet18]: https://github.com/torchpipe/torchpipe/tree/main/examples/resnet18
-[yolox]: https://github.com/torchpipe/torchpipe/tree/main/examples/yolox
-[PP-OCRv2]: https://github.com/torchpipe/torchpipe/tree/main/examples/ppocr
-[TensorRT's native INT8]: https://github.com/torchpipe/torchpipe/tree/main/examples/int8
+[resnet18]: https://github.com/torchpipe/torchpipe/tree/v0/examples/resnet18
+[yolox]: https://github.com/torchpipe/torchpipe/tree/v0/examples/yolox
+[PP-OCRv2]: https://github.com/torchpipe/torchpipe/tree/v0/examples/ppocr
+[TensorRT's native INT8]: https://github.com/torchpipe/torchpipe/tree/v0/examples/int8
 
 [torchpipe.utils.cpp_extension.load]: ../python/compile.mdx
 [filter]: ../Inter-node/filter.mdx
diff --git a/docs/tools/quantization.mdx b/docs/tools/quantization.mdx
@@ -29,7 +29,7 @@ For detection models, you can consider using the [official complete tutorial](ht
 
 In addition to the pre-training parameters provided by the model for normal training, training-based quantization also requires quantization pre-training parameters provided by post-training quantization (ptq).
 
-We have integrated [calib_tools](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/calib_tools.py) for reference.
+We have integrated [calib_tools](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/calib_tools.py) for reference.
 
 - Define calibrator:
 ```python
@@ -100,17 +100,17 @@ The official training format is very simple and is only used as an example.
 #### Direct Quantization without Modifying Backbone
 Following the official example, we conducted step-by-step experiments on resnet:
 
-- Download training data: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/download_data.py)
-- Train for 10 epochs to obtain the resnet50 model: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/fp32_train.py), accuracy 98.44%
-- (optional) PyTorch ptq: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/ptq.py), accuracy 96.64% (max)
-- (optional) PyTorch qat: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/qat.py), accuracy 98.26%.
+- Download training data: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/download_data.py)
+- Train for 10 epochs to obtain the resnet50 model: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/fp32_train.py), accuracy 98.44%
+- (optional) PyTorch ptq: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/ptq.py), accuracy 96.64% (max)
+- (optional) PyTorch qat: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/qat.py), accuracy 98.26%.
 
 #### MSE + Residual Fusion {#mseadd}
 
 The above resnet training uses the max quantization method and does not fuse the Add layer, resulting in TensorRT running speed not meeting expectations. The following are the results after fusing Add under int8 and switching to the mse mode:
 
-- ptq: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/ptq_merge_residual.py), accuracy 94.34% (mse)
-- qat: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/qat_merge_residual.py), accuracy 95.82%.
+- ptq: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/ptq_merge_residual.py), accuracy 94.34% (mse)
+- qat: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/qat_merge_residual.py), accuracy 95.82%.
 
 
 #### Summary of Results in PyTorch
@@ -124,9 +124,9 @@ The above resnet training uses the max quantization method and does not fuse the
 ### Summary of Test Results in TorchPipe
 The following tests were performed using the onnx generated by TorchPipe:
 
-- Export onnx: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/export_onnx_merge_residual.py)
-- Load fp32-onnx with TorchPipe and perform ptq: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/torchpipe_ptq_test.py)
-- Test with qat-onnx loaded with TorchPipe: [code](https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/torchpipe_qat_test.py)
+- Export onnx: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/export_onnx_merge_residual.py)
+- Load fp32-onnx with TorchPipe and perform ptq: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/torchpipe_ptq_test.py)
+- Test with qat-onnx loaded with TorchPipe: [code](https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/torchpipe_qat_test.py)
 
 
 | Model                  | Accuracy | Performance | Note                                |
@@ -135,4 +135,4 @@ The following tests were performed using the onnx generated by TorchPipe:
 | tensorrt's native int8 | 98.26%   | -           |                                     |
 | qat                    | 98.67%   | -           | [Acc. under onnxruntime] is 98.69%. |
 
-[Acc. under onnxruntime]: https://github.com/torchpipe/torchpipe/blob/develop/examples/int8/qat/onnxruntime_qat_test.py
+[Acc. under onnxruntime]: https://github.com/torchpipe/torchpipe/blob/v0/examples/int8/qat/onnxruntime_qat_test.py
diff --git a/docusaurus.config.js b/docusaurus.config.js
@@ -58,11 +58,11 @@ const config = {
           editUrl: ({locale, versionDocsDirPath, docPath}) => {
                   // Link to Crowdin for French docs
                   if (locale == 'en') { 
-                    return `https://github.com/torchpipe/torchpipe.github.io/edit/main/docs/${docPath}`;
+                    return `https://github.com/torchpipe/torchpipe.github.io/edit/v0/docs/${docPath}`;
                   }
                   // Link to GitHub for English docs
 
-                  return `https://github.com/torchpipe/torchpipe.github.io/edit/main/i18n/zh/docusaurus-plugin-content-docs/current/${docPath}`;
+                  return `https://github.com/torchpipe/torchpipe.github.io/edit/v0/i18n/zh/docusaurus-plugin-content-docs/current/${docPath}`;
                 },
           sidebarCollapsed: true,
           showLastUpdateTime:true,
@@ -192,7 +192,7 @@ const config = {
           //   label: "Nrwl",
           // },
           {
-            href: "https://github.com/torchpipe/torchpipe/",
+            href: "https://github.com/torchpipe/torchpipe/tree/v0",
             className: "header-github-link",
             "aria-label": "Github repository",
             position: "right",
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/backend-reference/torch.mdx b/i18n/zh/docusaurus-plugin-content-docs/current/backend-reference/torch.mdx
@@ -151,7 +151,7 @@ tensorrt推理引擎
 
 
 
-参见[示例](https://github.com/torchpipe/torchpipe/tree/main/examples/int8).
+参见[示例](https://github.com/torchpipe/torchpipe/tree/v0/examples/int8).
 
 
 ### 前向计算
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/benchmark.mdx b/i18n/zh/docusaurus-plugin-content-docs/current/benchmark.mdx
@@ -77,7 +77,7 @@ thrift采用了多线程模式, 以`python`为客户端/服务端语言。
 |    triton-cli     | QPS: 15039 <br />   | -                  |
 
  
-[import]: https://github.com/torchpipe/torchpipe/blob/main/libs/commands/import/README.md
+[import]: https://github.com/torchpipe/torchpipe/blob/v0/libs/commands/import/README.md
 
  
 
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/contribution_guide/modify_the_code.md b/i18n/zh/docusaurus-plugin-content-docs/current/contribution_guide/modify_the_code.md
@@ -29,5 +29,5 @@ pytest .
 需要时请考虑补充[python测试](https://github.com/torchpipe/torchpipe//test)。
 
 :::note 代码格式（optinal）
-请配置格式化插件以便[.clang-format](https://github.com/torchpipe/torchpipe/blob/develop/.clang-format)生效。
+请配置格式化插件以便[.clang-format](https://github.com/torchpipe/torchpipe/blob/v0/.clang-format)生效。
 :::
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/python/test.mdx b/i18n/zh/docusaurus-plugin-content-docs/current/python/test.mdx
@@ -389,7 +389,7 @@ def test_all_files(file_dir:str, num_clients=10, batch_size = 1,
 ```
 
 ### 不同batchsize的客户端
-在[此处例子](https://github.com/torchpipe/torchpipe/blob/main/examples/yolox/yolox_multithreads_test.py)中，我们使用十个客户端，客户端每次请求的数据量分别为1，2，3，...10。我们验证了在这种情况下结果的一致性。
+在[此处例子](https://github.com/torchpipe/torchpipe/blob/v0/examples/yolox/yolox_multithreads_test.py)中，我们使用十个客户端，客户端每次请求的数据量分别为1，2，3，...10。我们验证了在这种情况下结果的一致性。
 通常来讲，用户可遍历一个目录中的所有数据，并多次重复发送，以验证结果的稳定性和一致性。
 :::warning tensorrt模型推理结果不一致
 如果模型本身batchsize>1,tensorrt推理结果可能会有少量差异（输入数据的batchsize不同时，模型选取的优化方法可能不同）；如果batchsize==1，这种差异不应该存在，然而重新生成模型后，推理结果仍然有可能变化。
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/quick_start_new_user.md b/i18n/zh/docusaurus-plugin-content-docs/current/quick_start_new_user.md
@@ -7,7 +7,7 @@ type: explainer
 
 # torchpipe快速上手（30min体验版）
 
-torchpipe是为工业界所准备的一个独立作用于底层加速库（如tensorrt，opencv，torchscript）以及 RPC（如thrift, gRPC）之间的多实例流水线并行库，助力使用者能在部署阶段节约更多的硬件资源，帮助产品应用落地。此教程主要针对初级用户，即对于加速相关的理论知识处于入门阶段，具有一定的 Python基础，能够阅读简单代码的用户。此内容主要包括使用torchpipe进行服务部署加速的使用方法、性能和效果差异对比等。本文档的完整代码见可详见[resnet50_thrift](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/)。
+torchpipe是为工业界所准备的一个独立作用于底层加速库（如tensorrt，opencv，torchscript）以及 RPC（如thrift, gRPC）之间的多实例流水线并行库，助力使用者能在部署阶段节约更多的硬件资源，帮助产品应用落地。此教程主要针对初级用户，即对于加速相关的理论知识处于入门阶段，具有一定的 Python基础，能够阅读简单代码的用户。此内容主要包括使用torchpipe进行服务部署加速的使用方法、性能和效果差异对比等。本文档的完整代码见可详见[resnet50_thrift](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/)。
 
 
 
@@ -89,7 +89,7 @@ self.classification_engine = torch2trt(resnet50, [input_shape],
 
 
 
-整体的线上服务部署代码见[main_trt.py](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/main_trt.py)
+整体的线上服务部署代码见[main_trt.py](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/main_trt.py)
 
 :::tip
 因为TensorRT不是线程安全的，所以利用这种方法进行模型加速时，服务部署过程中需要加锁（`with self.lock:`）处理。
@@ -108,7 +108,7 @@ self.classification_engine = torch2trt(resnet50, [input_shape],
 
 ![](images/quick_start_new_user/torchpipe.png)
 
-利用torchpipe对本服务部署进行调整，整体的线上服务部署代码见[main_torchpipe.py](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/main_torchpipe.py),核心函数调整如下：
+利用torchpipe对本服务部署进行调整，整体的线上服务部署代码见[main_torchpipe.py](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/main_torchpipe.py),核心函数调整如下：
 
 ```py
 # ------- main -------
@@ -210,7 +210,7 @@ std="58.395, 57.120, 57.375" # 255*"0.229, 0.224, 0.225"
 ## 4 性能和效果对比
 `python test_tools.py --img_dir /your/testimg/path/ --port 8095 --request_client 10 --request_batch 1
 `
-测试具体代码见[client_qps.py](https://github.com/torchpipe/torchpipe/blob/develop/examples/resnet50_thrift/client_qps.py)
+测试具体代码见[client_qps.py](https://github.com/torchpipe/torchpipe/blob/v0/examples/resnet50_thrift/client_qps.py)
 
 采用相同的thrift的服务接口，测试机器3080,cpu 36核, 并发数10
 
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/showcase/showcase.mdx b/i18n/zh/docusaurus-plugin-content-docs/current/showcase/showcase.mdx
@@ -17,10 +17,10 @@ slug: /showcase
 |        [PP-OCRv2]        | ![](../assets/ppocr.svg)               | [MapReduce](../Inter-node/graphtraversal.mdx#mapreduce)<br />[Jump]                                       |      |
 | [TensorRT's native INT8] |                                        | [TensorrtTensor](../backend-reference/torch.mdx#tensorrttensor)                                           |      |
 
-[resnet18]: https://github.com/torchpipe/torchpipe/tree/main/examples/resnet18
-[yolox]: https://github.com/torchpipe/torchpipe/tree/main/examples/yolox
-[PP-OCRv2]: https://github.com/torchpipe/torchpipe/tree/main/examples/ppocr
-[TensorRT's native INT8]: https://github.com/torchpipe/torchpipe/tree/main/examples/int8
+[resnet18]: https://github.com/torchpipe/torchpipe/tree/v0/examples/resnet18
+[yolox]: https://github.com/torchpipe/torchpipe/tree/v0/examples/yolox
+[PP-OCRv2]: https://github.com/torchpipe/torchpipe/tree/v0/examples/ppocr
+[TensorRT's native INT8]: https://github.com/torchpipe/torchpipe/tree/v0/examples/int8
 
 <!-- 通用链接： -->
 [torchpipe.utils.cpp_extension.load]: ../python/compile.mdx
diff --git a/i18n/zh/docusaurus-plugin-content-docs/current/tools/quantization.mdx b/i18n/zh/docusaurus-plugin-content-docs/current/tools/quantization.mdx
diff --git a/src/components/face-page.tsx b/src/components/face-page.tsx
diff --git a/src/data/members.data.ts b/src/data/members.data.ts

Original file line number	Diff line number	Diff line change
`@@ -148,7 +148,7 @@ For quantizing onnx models using tensorrt, the following parameters are availabl`
`148`	`148`
`149`	`149`
`150`	`150`
`151`		`-See [example](https://github.com/torchpipe/torchpipe/tree/main/examples/int8).`
	`151`	`+See [example](https://github.com/torchpipe/torchpipe/tree/v0/examples/int8).`
`152`	`152`
`153`	`153`
`154`	`154`	`### Forward Computation`
Original file line number	Diff line number	Diff line change
`@@ -67,7 +67,7 @@ Client 10/Compute Backend Instance 1/Timeout 0/Max Batch 1`
`67`	`67`	`\| triton-cli \| QPS: 15039 <br /> \| - \|`
`68`	`68`
`69`	`69`
`70`		`-[import]: https://github.com/torchpipe/torchpipe/blob/main/libs/commands/import/README.md`
	`70`	`+[import]: https://github.com/torchpipe/torchpipe/blob/v0/libs/commands/import/README.md`
`71`	`71`
`72`	`72`
`73`	`73`
Original file line number	Diff line number	Diff line change
`@@ -151,7 +151,7 @@ tensorrt推理引擎`
`151`	`151`
`152`	`152`
`153`	`153`
`154`		`-参见[示例](https://github.com/torchpipe/torchpipe/tree/main/examples/int8).`
	`154`	`+参见[示例](https://github.com/torchpipe/torchpipe/tree/v0/examples/int8).`
`155`	`155`
`156`	`156`
`157`	`157`	`### 前向计算`