Data Science Kitchen Sink

Published

May 2023

The ‘Data Science Kitchen Sink (DSKS)’ is an environment with many typical packages for analysis and machine learning. It is intended as a starting point that users can start working with the service without needing to make your own. As their work matures and specialised packages are needed, this can be cloned to serve as a basis for the new environment.

Note that this version was been archived with conda-pack during a migration. The environment can still be used if needed to rerun experiments, but has limitations such as not being compatible with Strudel2. Consider using a later version of DSKS if possible.

Executable Path

/apps/conda-packs/dsks_2023.05/bin/python3 </path/to/script.py>

Activation Path

source /apps/conda-packs/dsks_2023.05/bin/activate

Environment Definition

Code
cat /apps/conda-envs/dsks_2023.05.yml
name: dsks_2023.05
channels:
  - plotly
  - huggingface
  - fastchan
  - rapidsai
  - pytorch
  - nvidia
  - conda-forge

dependencies:
  # Interactivity
  - jupyter
  - jupyterlab
  - dask
  - dask-jobqueue
  - autopep8
  - tqdm
  - matplotlib
  - plotly
  - wandb
  - tensorboard
  
  # Data Science
  - numpy
  - scipy
  - pandas
  - rapids
  - cupy
  
  # Machine Learning
  - cudatoolkit
  - tensorflow
  - pytorch
  - torchvision
  - torchaudio
  - lightning
  - fastai
  - transformers
  - scikit-learn
  - py-xgboost-gpu
  - gensim

Full Package List

Code
!pip list
Package                       Version
----------------------------- ----------------
absl-py                       1.4.0
aiohttp                       3.8.5
aiosignal                     1.3.1
anyio                         3.7.1
appdirs                       1.4.4
argon2-cffi                   21.3.0
argon2-cffi-bindings          21.2.0
arrow                         1.2.3
asttokens                     2.2.1
astunparse                    1.6.3
async-lru                     2.0.4
async-timeout                 4.0.2
attrs                         23.1.0
autopep8                      2.0.2
Babel                         2.12.1
backcall                      0.2.0
backoff                       2.2.1
backports.cached-property     1.0.2
backports.functools-lru-cache 1.6.5
beautifulsoup4                4.12.2
bleach                        5.0.1
blessed                       1.19.1
blinker                       1.6.2
blis                          0.7.10
bokeh                         3.2.1
brotlipy                      0.7.0
build                         0.10.0
CacheControl                  0.12.14
cached-property               1.5.2
cachetools                    5.3.1
catalogue                     2.0.9
certifi                       2023.11.17
cffi                          1.15.1
charset-normalizer            3.2.0
cleo                          2.0.1
click                         8.1.6
cloudpickle                   2.2.1
colorama                      0.4.6
conda-pack                    0.7.1
confection                    0.1.1
contourpy                     1.1.0
crashtest                     0.4.1
croniter                      1.4.1
cryptography                  41.0.3
cupy                          12.1.0
cycler                        0.11.0
cymem                         2.0.7
cytoolz                       0.12.0
dask                          2023.8.1
dask-jobqueue                 0.8.2
dataclasses                   0.8
datasets                      2.14.4
dateutils                     0.6.12
debugpy                       1.6.3
decorator                     5.1.1
deepdiff                      6.3.1
defusedxml                    0.7.1
dill                          0.3.7
distlib                       0.3.6
distributed                   2023.8.1
docker-pycreds                0.4.0
dulwich                       0.21.5
entrypoints                   0.4
exceptiongroup                1.1.2
executing                     1.2.0
fastai                        2.7.12
fastapi                       0.101.0
fastcore                      1.5.29
fastdownload                  0.0.7
fastjsonschema                2.18.0
fastprogress                  1.0.3
fastrlock                     0.8
filelock                      3.12.2
flatbuffers                   23.5.26
flit_core                     3.7.1
fonttools                     4.42.0
fqdn                          1.5.1
frozenlist                    1.4.0
fsspec                        2023.6.0
gast                          0.4.0
gensim                        4.3.1
gitdb                         4.0.10
GitPython                     3.1.32
gmpy2                         2.1.2
google-auth                   2.22.0
google-auth-oauthlib          1.0.0
google-pasta                  0.2.0
grpcio                        1.54.3
h11                           0.14.0
h5py                          3.9.0
html5lib                      1.1
huggingface-hub               0.16.4
hupper                        1.12
idna                          3.4
importlib-metadata            6.8.0
importlib-resources           6.0.1
inquirer                      3.1.3
installer                     0.7.0
ipykernel                     6.16.2
ipython                       8.14.0
ipython-genutils              0.2.0
ipywidgets                    8.0.3
isoduration                   20.11.0
itsdangerous                  2.1.2
jaraco.classes                3.3.0
jax                           0.4.14
jaxlib                        0.4.14
jedi                          0.19.0
jeepney                       0.8.0
Jinja2                        3.1.2
joblib                        1.3.2
json5                         0.9.14
jsonpointer                   2.0
jsonschema                    4.19.0
jsonschema-specifications     2023.7.1
jupyter                       1.0.0
jupyter_client                7.4.8
jupyter-console               6.4.4
jupyter_core                  5.3.1
jupyter-events                0.7.0
jupyter-lsp                   2.2.0
jupyter_server                2.7.0
jupyter-server-mathjax        0.2.6
jupyter_server_terminals      0.4.4
jupyterlab                    4.0.5
jupyterlab_git                0.50.0
jupyterlab-pygments           0.2.2
jupyterlab_server             2.24.0
jupyterlab-widgets            3.0.8
keras                         2.12.0
Keras-Preprocessing           1.1.2
keyring                       23.13.1
kiwisolver                    1.4.4
langcodes                     3.3.0
lightning                     2.0.7
lightning-cloud               0.5.37
lightning-utilities           0.9.0
locket                        1.0.0
lockfile                      0.12.2
lz4                           4.3.2
Markdown                      3.4.4
markdown-it-py                3.0.0
MarkupSafe                    2.1.3
matplotlib                    3.7.2
matplotlib-inline             0.1.6
mdurl                         0.1.0
mistune                       0.8.4
ml-dtypes                     0.2.0
more-itertools                10.1.0
mpmath                        1.3.0
msgpack                       1.0.5
multidict                     6.0.4
multiprocess                  0.70.15
munkres                       1.1.4
murmurhash                    1.0.9
nbclient                      0.6.6
nbconvert                     6.5.0
nbdime                        4.0.1
nbformat                      5.9.2
nest-asyncio                  1.5.6
networkx                      3.1
notebook                      6.4.12
notebook_shim                 0.2.3
numpy                         1.25.2
oauthlib                      3.2.2
opt-einsum                    3.3.0
ordered-set                   4.1.0
orjson                        3.9.3
overrides                     7.4.0
packaging                     23.1
pandas                        2.0.3
pandocfilters                 1.5.0
parso                         0.8.3
partd                         1.3.0
PasteDeploy                   3.0.1
pathtools                     0.1.2
pathy                         0.10.2
pexpect                       4.8.0
pickleshare                   0.7.5
Pillow                        10.0.0
pip                           23.2.1
pkginfo                       1.9.6
pkgutil_resolve_name          1.3.10
plaster                       1.0
plaster-pastedeploy           0.7
platformdirs                  3.5.1
plotly                        5.16.1
ply                           3.11
poetry                        1.5.1
poetry-core                   1.6.1
poetry-plugin-export          1.4.0
pooch                         1.7.0
preshed                       3.0.8
prometheus-client             0.14.1
prompt-toolkit                3.0.39
protobuf                      4.21.12
psutil                        5.9.5
ptyprocess                    0.7.0
pure-eval                     0.2.2
pyarrow                       12.0.1
pyasn1                        0.4.8
pyasn1-modules                0.2.7
pycodestyle                   2.11.0
pycparser                     2.21
pydantic                      1.10.12
Pygments                      2.16.1
PyJWT                         2.8.0
pyOpenSSL                     23.2.0
pyparsing                     3.0.9
pyproject_hooks               1.0.0
PyQt5                         5.15.9
PyQt5-sip                     12.12.2
pyramid                       2.0.1
PySocks                       1.7.1
python-dateutil               2.8.2
python-editor                 1.0.4
python-json-logger            2.0.7
python-multipart              0.0.6
pytorch-lightning             2.0.6
pytz                          2023.3
pyu2f                         0.1.5
PyYAML                        6.0
pyzmq                         24.0.1
qtconsole                     5.3.1
QtPy                          2.1.0
rapidfuzz                     2.15.1
rapids                        0.0.1
readchar                      4.0.5.dev0
referencing                   0.30.2
regex                         2023.8.8
requests                      2.31.0
requests-oauthlib             1.3.1
requests-toolbelt             1.0.0
rfc3339-validator             0.1.4
rfc3986-validator             0.1.1
rich                          13.5.1
rpds-py                       0.9.2
rsa                           4.9
sacremoses                    0.0.43
safetensors                   0.3.2
scikit-learn                  1.3.0
scipy                         1.11.2
SecretStorage                 3.3.3
Send2Trash                    1.8.0
sentry-sdk                    1.29.2
setproctitle                  1.3.2
setuptools                    65.6.3
shellingham                   1.5.1
sip                           6.7.11
six                           1.16.0
smart-open                    5.2.1
smmap                         3.0.5
sniffio                       1.3.0
sortedcontainers              2.4.0
soupsieve                     2.3.2.post1
spacy                         3.6.1
spacy-legacy                  3.0.12
spacy-loggers                 1.0.4
srsly                         2.4.7
ssossh                        0.0.4
stack-data                    0.6.2
starlette                     0.27.0
starsessions                  1.3.0
sympy                         1.12
tblib                         1.7.0
tenacity                      8.2.2
tensorboard                   2.12.3
tensorboard-data-server       0.7.0
tensorflow                    2.12.1
tensorflow-estimator          2.12.0
termcolor                     2.3.0
terminado                     0.15.0
thinc                         8.1.11
threadpoolctl                 3.2.0
tinycss2                      1.1.1
tokenizers                    0.13.3
toml                          0.10.2
tomli                         2.0.1
tomlkit                       0.12.1
toolz                         0.12.0
torch                         2.0.0.post200
torchaudio                    2.0.0
torchmetrics                  1.0.3
torchvision                   0.15.2a0+072ec57
tornado                       6.3.2
tqdm                          4.66.1
traitlets                     5.9.0
transformers                  4.31.0
translationstring             1.4
trove-classifiers             2023.8.7
typer                         0.9.0
typing_extensions             4.5.0
typing-utils                  0.1.0
tzdata                        2023.3
unicodedata2                  15.0.0
uri-template                  1.3.0
urllib3                       1.26.15
uvicorn                       0.23.2
venusian                      3.0.0
virtualenv                    20.24.1
wandb                         0.15.8
wasabi                        1.1.2
wcwidth                       0.2.6
webcolors                     1.13
webencodings                  0.5.1
WebOb                         1.8.7
websocket-client              1.6.1
websockets                    11.0.3
Werkzeug                      2.3.6
wheel                         0.41.1
widgetsnbextension            4.0.4
wrapt                         1.15.0
xgboost                       1.7.6
xxhash                        0.0.0
xyzservices                   2023.7.0
yarl                          1.9.2
zict                          3.0.0
zipp                          3.16.2
zope.deprecation              4.4.0
zope.interface                6.0