OpenFold3 Setup¶

Installation¶

Pre-requisites¶

OpenFold3 inference requires a system with a GPU with a minimum of CUDA 12.1 and 32GB of memory. Most of our testing has been performed on A100s with 40GB of memory.

Modern conda environments with pixi (recommended)¶

Although OpenFold3 can be installed directly with:

pip install openfold3

for a more reproducible and streamlined setup, especially when installing OpenFold3 with its full set of dependencies, we recommend using pixi. Using pixi makes it easier to choose the right pre-configured environment for your hardware, such as CPU-only, NVIDIA CUDA, or AMD ROCm.

First, install pixi by following the official installation instructions:

# You only need to do this once, and then you can use pixi for future projects.
curl -fsSL https://pixi.sh/install.sh | sh

# Restart your shell after installation.
# Optionally, you can also install shell completions for pixi.

Next, clone the OpenFold3 repository:

git clone git@github.com:aqlaboratory/openfold-3.git
cd openfold-3

OpenFold3 includes several pre-configured pixi environments for different systems and hardware configurations.

To set up and run OpenFold3 using the CPU environment, run:

pixi run -e openfold3-cpu setup_openfold
pixi run -e openfold3-cpu run_openfold

Available pixi environments:

Environment	Supported platforms	Description
`openfold3-cpu`	`linux-64`, `linux-aarch64`, `osx-64`, `osx-arm64`	CPU-only environment for running OpenFold3 without GPU acceleration.
`openfold3-cuda12`	`linux-64`, `linux-aarch64`	NVIDIA GPU environment using CUDA 12.
`openfold3-cuda13`	`linux-64`, `linux-aarch64`	NVIDIA GPU environment using CUDA 13.
`openfold3-rocm7`	`linux-64`	AMD GPU environment using ROCm 7.

Choose the environment that matches your system. For example, use openfold3-cpu for CPU-only installations, openfold3-cuda12 or openfold3-cuda13 for NVIDIA GPU systems, and openfold3-rocm7 for AMD ROCm systems.

For more information, including rationale, tips and tricks, see Modern Conda Environments with Pixi.

Installation via pip¶

[Optional] Create a fresh environment with python. Python versions 3.10 - 3.13 are supported

conda create -n openfold3 python=3.13 

Install openfold3 the pypi server:

pip install openfold3

Installation with kernels¶

Using Attention kernels compatible with your system can provide 4-5x speedup for inference on long (>1000 residue) sequences. We currently support 3 different kernels

Deepspeed Installation of Deepspeed requires installation of Nvidia CUTLASS

Nvidia CUTLASS may be installed directly from source.

Alternatively, a pypi package of nvidia-cutlass is available, but it is no longer being maintained after version 4.2.0. To use this package, run pip install nvidia-cutlass && export CUTLASS_PATH=DS_USE_CUTLASS_PYTHON_BINDINGS

Once nvidia cutlass is installed, the deepspeed extension may be installed with

pip install openfold3[deepspeed]

cuEquivariance To install GPU accelerated cuEquivariance attention kernels, use:

pip install openfold3[cuequivariance]

Triton To use AMD ROCm-compatible Triton kernels, first install the ROCm PyTorch wheel (which bundles ROCm Triton), then install openfold3:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm7.2
pip install openfold3

For AMD system installation: After installation, verify your ROCm environment is correctly configured:

validate-openfold3-rocm

Environment variables¶

Note: This may need a revision given the pixi managed envs above (JD).

OpenFold may need a few environment variables set so CUDA, compilation, and JIT-built extensions can be found correctly.

CUDA_HOME should point to the CUDA installation. On many HPC clusters you will this can be set by loading the appropriate toolchain using environment modules, for example module load cuda. If you do not set this you will likely get a No such file or directory: '/usr/local/cuda/bin/nvcc' error.

CUTLASS_PATH will need to be set for most systems. If you do not set this you will get Deepspeed related errors such as Error: Unable to JIT load the evoformer_attn op. Generally this can be set using

# Start your environment which as openfold3 installed
source .venv/bin/activate
# Set CUTLASS_PATH using the resolved path  
export CUTLASS_PATH=$(python - << 'PY'
import cutlass_library, pathlib
print(pathlib.Path(cutlass_library.__file__).resolve().parent.joinpath("source"))
PY
)

LD_LIBRARY_PATH may need to be set to the matching CUDA directories. How to set this will depend on the system.
- Example: export LD_LIBRARY_PATH="$CUDA_HOME/targets/x86_64-linux/lib:${LD_LIBRARY_PATH:-}"
- You can often run find "$CUDA_HOME" -name 'libcurand.so*' 2>/dev/null to find the CUDA layout of your system.
If you get a /usr/bin/ld: cannot find -lcurand error, this usually means the CUDA math libraries (which include libcurand) are not on your library search path. You may need to add the appropriate CUDA library directory to LIBRARY_PATH.
- Example: export LIBRARY_PATH="$(echo "$CUDA_HOME" | sed 's|/cuda/|/math_libs/|')/targets/sbsa-linux/lib:${LIBRARY_PATH:-}"

OpenFold3 Docker Image¶

Dockerhub¶

The OpenFold3 Docker Image is now available on Docker Hub: openfoldconsortium/openfold3

To get the latest stable version, you can use the following command

docker pull openfoldconsortium/openfold3:stable

GitHub Container Registry (GHCR)¶

You can download the openfold3 docker image from GHCR, you’ll need to install ‘gh-cli’ first, instructions here.

You’ll need to authenticate with GitHub, make sure you request the read:packages scope.

gh auth login --scopes read:packages

Verify that login succeeded and scope is assigned

gh auth status 
github.com
  ✓ Logged in to github.com account ******* (/home/ubuntu/.config/gh/hosts.yml)
  - Active account: true
  - Git operations protocol: ssh
  - Token: gho_************************************
  - Token scopes: 'admin:public_key', 'gist', 'read:org', 'read:packages', 'repo'

Let’s inject the GitHub token into the docker config. Note this will expire.

gh auth token | docker login ghcr.io -u $(gh api user --jq .login) --password-stdin

Pull the image itself

docker pull ghcr.io/aqlaboratory/openfold-3/openfold3-docker:0.4.2

Building the OpenFold3 Docker Image¶

If you would like to build an OpenFold docker image locally, we provide a dockerfile. You may build this image with the following command:

docker build -f Dockerfile -t openfold-docker .

Downloading OpenFold3 model parameters¶

On the first inference run, default model parameters will be downloaded to the $HOME/.openfold3. To customize your checkpoint download path, you use one of the following options:

Using `setup_openfold`¶

We provide a one-stop binary that sets up openfold and runs integration tests.

Usage: setup_openfold [OPTIONS]

Options:
  --non-interactive  Non-interactively run setup using all default config
                     values.
  --config FILE      Path to a JSON file containing an OpenFoldSetupConfig.
  --help             Show this message and exit.

This script will:

Create an $OPENFOLD_CACHE environment [Optional, default: ~/.openfold3]
Setup a directory for OpenFold3 model parameters [default: ~/.openfold3]
- Writes the path to $OPENFOLD_CACHE/ckpt_root
Download the model parameters, if the parameter file does not already exist. You will have the option to download one set of parameters or all parameters. See OpenFold3 Parameters for more information on available parameters.
Download and setup the Chemical Component Dictionary (CCD) with Biotite
Optionally run an inference integration test on two samples, without MSA alignments (~5 min on A100)
- N.B. To run the integration tests, pytest must be installed.

Downloading the model parameters manually

If preferred, the model parameters (~2GB) for the trained OpenFold3 model can be downloaded from our AWS RODA bucket using the AWS CLI as follows:

aws s3 cp s3://openfold/staging/of3-p2-155k.pt <dst_path> --no-sign-request

To use these checkpoints with OpenFold3, it is then necessary to pass in the full path to the parameters through the command line arguments, e.g. --inference_ckpt_path. See Inference instructions for more details.

Setting OpenFold3 Cache environment variable¶

You can optionally set your OpenFold3 Cache path as an environment variable:

export OPENFOLD_CACHE=`/<custom-dir>/.openfold3/`

This can be used to provide some default paths for model parameters (see section below).

Running OpenFold Tests¶

OpenFold tests require pytest, which can be installed with:

mamba install pytest

Once installed, tests can be run using:

pytest openfold3/tests/

To run the inference verification tests, run:

pytest tests/ -m "inference_verification"

Note: To build deepspeed, it may be necessary to include the environment $LD_LIBRARY_PATH and $LIBRARY_PATH, which can be done via the following

export LIBRARY_PATH=$CONDA_PREFIX/lib:$LIBRARY_PATH
export LD_LIBRARY_PATH=$CONDA_PREFIX/lib:$LD_LIBRARY_PATH