Commit graph

31 commits

Author SHA1 Message Date
James Hoekzema
acc6746114
Support for CodeGen2 (#230)
* add config files, update docs, add codegen2 to setup, small tweaks to converter scripts - JH RJ

* Update documentation/server.md

---------

Co-authored-by: James Hoekzema <james.hoekzema@wwt.com>
Co-authored-by: Fred de Gier <freddegier@me.com>
2024-02-07 14:43:09 +01:00
Geunsik Lim
0e80eefb76
fix: fixed an incorreect if statement (#158)
Resolved #157.
The setup.sh generates the "integer expression expected" issue.
This commit is to fix this issue.

Signed-off-by: Geunsik Lim <leemgs@gmail.com>
Signed-off-by: Geunsik Lim <geunsik.lim@samsung.com>
2023-03-13 10:00:01 +01:00
Parth Thakkar
6c1b7fb404
Merge pull request #137 from leemgs/upstream-python-backend-cc-type
fix: Disable INT8 as a default CC type in Python-backend to support CC 6.0+
2023-02-13 17:29:29 +00:00
Geunsik Lim
1ecd68aeb1 fix: enabled !INT8 as a default CC type in the Python-backend
CC (Compute Capability) of Nvidia GPU supports INT8 from version 9.0.
Therefore, this patch is to change the default SM (Streaming Machine) type to INT8
so that stable operation is compatible with versions between CC 6.0 and CC 9.0.

* Prerequisites: An NVIDIA GPU with Compute Capability >= 6.0
  https://github.com/fauxpilot/fauxpilot#prerequisites

* Architecture of Compute Capability (CC)9.0
  https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#id100

Signed-off-by: Geunsik Lim <leemgs@gmail.com>
2023-01-03 14:51:16 +09:00
Geunsik Lim
c2caa897c2 fix: improved the docker compose statement to handle various docker env
This commit is to handle both "docker-compose" and "docker compose"
environment while running the python_backend function in the setup.sh.

Signed-off-by: Geunsik Lim <leemgs@gmail.com>
Signed-off-by: Geunsik Lim <geunsik.lim@samsuang.com>
2023-01-03 11:28:34 +09:00
Brendan Dolan-Gavitt
e153c17194 Fix bug introduced by Python backend merge
HF_CACHE_DIR needs to be set in .env, even if the FT backend is in
use, or docker compose will complain and fail to start.
2022-11-24 15:44:15 -05:00
Brendan Dolan-Gavitt
f84f040a58 remove spurious common_config and force a container build for the python backend 2022-11-23 16:58:30 -05:00
Parth Thakkar
fa423d1da1 Update location of hf_cache in case user doesn't want to share cache
Signed-off-by: Parth Thakkar <thakkarparth007@gmail.com>
2022-11-08 17:57:43 -06:00
Parth Thakkar
c6be12979e Fix setup issues and add test script
Signed-off-by: Parth Thakkar <thakkarparth007@gmail.com>
2022-10-21 13:23:10 -05:00
Fred de Gier
2a91018792 Resolve merge conflicts and fix issues with setup.sh 2022-10-20 16:09:12 +02:00
fdegier
d3180fd306 Implement code review suggestions 2022-10-19 14:19:37 +02:00
Parth Thakkar
01f1cbb629 Add python backend support
- Modify dockerfile to include bitsandbytes, transformers and latest version of pytorch
- Minor modifications in utils/codegen.py so that same client works with FT and Py-backend
- Minor modifications in launch.sh (no need to name models by GPU)
- Add installation script for adding a new python model (with super simple config_template)
- Modify setup.sh so that it aworks with both FT and Python backend models

Signed-off-by: Parth Thakkar <thakkarparth007@gmail.com>
2022-10-16 22:05:09 -05:00
Fred de Gier
de71bb6ff5 Resole merge conflicts 2022-10-03 14:27:32 +02:00
Fred de Gier
ed1d877b7c Simplify config 2022-10-03 14:20:58 +02:00
Fred de Gier
6739b7c53d Merge branch 'main' into dev 2022-10-03 14:13:39 +02:00
Fred de Gier
87f4f53e27 Simplify config and port handling 2022-10-03 14:13:10 +02:00
Brendan Dolan-Gavitt
9165bae910
Merge pull request #58 from Frederisk/patch-1
Fixed the bug that caused downloading of models to be skipped
2022-09-21 11:31:18 -04:00
Geunsik Lim
be800f4e9d
fix: fixed incorrect return value in setup.sh
This commit fixes an incorrect return value in setup.sh. In this bash program. 
A score of zero (0) indicates that everything went smoothly. Anything else indicates 
a problem. A value of 1 indicates that some type of error has occurred.

Signed-off-by: Geunsik Lim <leemgs@gmail.com>
Signed-off-by: Geunsik Lim <geunsik.lim@samsung.com>
2022-09-21 17:50:44 +09:00
Rowe Wilson Frederisk Holme
f3b3a9cf83
Rewrite allow re-use function 2022-09-21 09:22:45 +08:00
Rowe Wilson Frederisk Holme
dbd75f5ff8
Revert "Allow re-use of converted models"
This reverts commit 6aa53f2eb3.
2022-09-21 09:19:45 +08:00
Brendan Dolan-Gavitt
36a9c52a08
Merge pull request #33 from leemgs/upstream-check-cmd-dep
feat:  checking cmd dependency while running setup.sh
2022-09-19 20:01:26 -04:00
Brendan Dolan-Gavitt
d638ff4598
Merge pull request #52 from leemgs/patch-4
fix: added shebang in the setup.sh
2022-09-16 13:10:23 -04:00
Geunsik Lim
927be7b730
fix: added shebang in the setup.sh
This commit is trivial. It is to append the shebang in the setup.sh file. 

It is not always the case that /bin/bash is the path to the Bash shell.
It is only a different path in extremely rare instances, so it is typically irrelevant.
This commit is to specify the Bash shell (i.e., bash) using the environment variable.

Signed-off-by: Geunsik Lim <leemgs@gmail.com>
2022-09-15 09:14:24 +09:00
Fred de Gier
6aa53f2eb3 Allow re-use of converted models 2022-09-12 12:57:24 +02:00
Rowe Wilson Frederisk Holme
704ef5ad58
Convert MODEL_DIR to an absolute path
To fix issue #41.
2022-09-06 02:51:06 +08:00
Geunsik Lim
cad047ba4f feat: added a function to check dependent cmds
This commit is append a routine that checks if required commands are
installed or not.

Signed-off-by: Geunsik Lim <leemgs@gmail.com>
2022-08-27 13:32:08 +09:00
Josh Soref
a3b8e71150 spelling: converter
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>
2022-08-07 12:49:50 -04:00
Brendan Dolan-Gavitt
9363bcd586 Download preconverted models from HuggingFace when possible
I added the 1 and 2 GPU versions of the converted models to HuggingFace
so that in the most common case we don't have to do the (very RAM
intensive) conversion process at all. Hopefully HuggingFace won't mind
us hosting them there!

This should fix #7.
2022-08-06 00:05:39 -04:00
Brendan Dolan-Gavitt
89fdf6a727 Update with support for CodeGen 2B models 2022-08-03 16:14:12 -04:00
Brendan Dolan-Gavitt
dc5d09b270 Add a readme 2022-08-02 22:13:32 -04:00
Brendan Dolan-Gavitt
a23a1478f7 Initial commit 2022-08-02 21:47:27 -04:00