-
Notifications
You must be signed in to change notification settings - Fork 151
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* GPT-2 first support * GPT-2 first support * cover TRT, ORT and PT (GPT-2) * optimize perf * makes ONNX Runtime fast * makes ONNX Runtime fast * remove pycuda dependency use torch tensors as input / output for tensorrt * move ort bindings to lib fix export fix trt gpt2 * fix sentence transfo export add doc * make int32 tensor default * small notebook fixes * change command line add new python generative task * delete old src * fix linter * add cache / no cache XP * make config more modular add gpt2 to the documentation * update Python tests * fix test * config for generative models * refactoring * fix sentence transformer * fix gpt2 generation * fix format * fix test * move infinity demo in a specific folder * fix decoder configuration generation * fix model call * add measures for CPU (gpt2) * fix code * add cache use in GPT-2 notebook * update notebook add tests update docker Triton image update ORT bump VERSION * add getting started section to README add sentence-transformers to Docker improve Makefile update docker version in documentation use tensorrt Python lib 8.2.2 (like Triton Docker image) Add tolerance check message * doc update * docker image includes notebook and pytorch-quantization documentation update change in index titles complete README.md * add social logo * add social logo * update notebook text * update notebook * fix Makefile
- Loading branch information
1 parent
c87a39e
commit ccfeb21
Showing
52 changed files
with
2,982 additions
and
818 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -213,7 +213,7 @@ cython_debug/ | |
|
||
# custom | ||
*.onnx | ||
*.plan | ||
.idea/ | ||
TensorRT/ | ||
triton_models/ | ||
demo/roberta-*/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,9 @@ | ||
FROM nvcr.io/nvidia/tritonserver:21.12-py3 | ||
FROM nvcr.io/nvidia/tritonserver:22.01-py3 | ||
|
||
# see .dockerignore to check what is transfered | ||
COPY . ./ | ||
|
||
RUN pip3 install -U pip && \ | ||
pip3 install nvidia-pyindex && \ | ||
pip3 install -e ".[GPU]" -f https://download.pytorch.org/whl/cu113/torch_stable.html --extra-index-url https://pypi.ngc.nvidia.com --no-cache-dir | ||
pip3 install ".[GPU]" -f https://download.pytorch.org/whl/cu113/torch_stable.html --extra-index-url https://pypi.ngc.nvidia.com --no-cache-dir && \ | ||
pip3 install sentence-transformers notebook pytorch-quantization |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
0.3.0 | ||
0.4.0 |
Oops, something went wrong.