Migrate a Git repo into Git LFS with BFG

Using Git LFS can help you to reduce the size of your Git repository and improve its performance.

However, simply adding the large files that are already in your repository to Git LFS, will not actually reduce the size of your repository because the files are still referenced by previous commits.

Through the method described on this document, first migrate to Git LFS with BFG through a mirror repo, then clean up the repository's history, and lastly create LFS tracking rules to prevent new binary files from being added.

This tutorial was inspired by the guide Use BFG to migrate a repo to Git LFS. For more information on Git LFS, see the references below.

CAUTION: Warning: The method described on this guide rewrites Git history. Make sure to back up your repo before beginning and use it at your own risk.

Requirements

Before beginning, make sure:

  • You have enough LFS storage for the files you want to convert. Storage is required for the entire history of all files.
  • All the team members you share the repository with have pushed all changes. Branches based on the repository before applying this method cannot be merged. Branches based on the repo before applying this method cannot be merged.

To follow this tutorial, you'll need:

  • Maintainer permissions to the existing Git repository you'd like to migrate to LFS with access through the command line.

  • Git and Java Runtime Environment (Java 7 or above) installed locally.

  • BFG installed locally:

    brew install bfg
  • Git LFS installed locally:

    brew install git-lfs

NOTE: Note: This guide was tested on macOS Mojave.

Steps

Consider an example upstream project, git@gitlab.com:gitlab-tests/test-git-lfs-repo-migration.git.

  1. Back up your repository:

    Create a copy of your repository so that you can recover it in case something goes wrong.

  2. Clone --mirror the repo:

    Cloning with the mirror flag will create a bare repository. This ensures you get all the branches within the repo.

    It creates a directory called <repo-name>.git (in our example, test-git-lfs-repo-migration.git), mirroring the upstream project:

    git clone --mirror git@gitlab.com:gitlab-tests/test-git-lfs-repo-migration.git
  3. Convert the Git history with BFG:

    bfg --convert-to-git-lfs "*.{png,mp4,jpg,gif}" --no-blob-protection test-git-lfs-repo-migration.git

    It is scanning all the history, and looking for any files with that extension, and then converting them to an LFS pointer.

  4. Clean up the repository:

    # cd path/to/mirror/repo:
    cd test-git-lfs-repo-migration.git
    # clean up the repo:
    git reflog expire --expire=now --all && git gc --prune=now --aggressive

    You can also take a look on how to further clean the repo, but it's not necessary for the purposes of this guide.

  5. Install Git LFS in the mirror repository:

    git lfs install
  6. Unprotect the default branch, so that we can force-push the rewritten repository:

    1. Navigate to your project's Settings > Repository and expand Protected Branches.
    2. Scroll down to locate the protected branches and click Unprotect the default branch.
  7. Force-push to GitLab:

    git push --force
  8. Track the files you want with LFS:

    # cd path/to/upstream/repo:
    cd test-git-lfs-repo-migration
    # You may need to reset your local copy with upstream's `master` after force-pushing from the mirror:
    git reset --hard origin/master
    # Track the files with LFS:
    git lfs track "*.gif" "*.png" "*.jpg" "*.psd" "*.mp4" ".gitattributes" "img/"

    Now all existing the files you converted, as well as the new ones you add, will be properly tracked with LFS.

  9. Re-protect the default branch:

    1. Navigate to your project's Settings > Repository and expand Protected Branches.
    2. Select the default branch from the Branch dropdown menu, and set up the Allowed to push and Allowed to merge rules.
    3. Click Protect.

References