Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/b1f6c1c4/git-get

Blazingly fast git clone alternative
https://github.com/b1f6c1c4/git-get

git git-clone git-clone-partial git-shallow-clone

Last synced: 3 months ago
JSON representation

Blazingly fast git clone alternative

Awesome Lists containing this project

README

        

# `git-get`: Blazingly fast, incredibly handy `git clone`

[![Appveyor](https://img.shields.io/appveyor/build/b1f6c1c4/git-get?style=flat-square)](https://ci.appveyor.com/project/b1f6c1c4/git-get/build/master)
[![Coveralls](https://img.shields.io/coveralls/github/b1f6c1c4/git-get?style=flat-square)](https://coveralls.io/github/b1f6c1c4/git-get)

- ✅ Automatic shallow clone
- ✅ Automatic partial clone
- ✅ Single file/directory clone, even crossing submodule boundaries
- ✅ Arbitrary clone given commit SHA1
- ✅ Fast parallel submodules clone
- ✅ Interactive submodules clone selection
- ✅ Full `git sparse-checkout` support
- ✅ Optional single branch/tag clone
- ✅ Tag file `VERSION`
- ✅ Automatic origin + upstream clone (for GitHub only)

## TL;DR

- Download a file:
```bash
# Method 1: Paste the original URL into the terminal:
git get https://github.com/b1f6c1c4/git-get/blob/master/README.md
# Method 2: Of course, a full URL is acceptable:
git get [email protected]:b1f6c1c4/git-get.git -- README.md
# Method 3a: Type a few words in the terminal:
git get b1f6c1c4/git-get -- README.md
# Method 3b: If the above doesn't work because of SSH, use HTTPS:
git get -H b1f6c1c4/git-get -- README.md
```
- Download a folder:
```bash
# The same as before:
git get https://github.com/b1f6c1c4/git-get/tree/master/tests
git get b1f6c1c4/git-get -- tests
# Optionally, you may want a VERSION file to record the commit SHA1:
git get -t ...
```
- Download a repo:
```bash
git get[s] [-X|-Y] https://github.com/b1f6c1c4/git-get
git get[s] [-X|-Y] https://github.com/b1f6c1c4/git-get/tree/example-repo2
git get[s] [-X|-Y] https://github.com/b1f6c1c4/git-get/commit/2dd50b6
git get[s] [-X|-Y] b1f6c1c4/git-get
git get[s] [-X|-Y] b1f6c1c4/git-get example-repo2
git get[s] [-X|-Y] b1f6c1c4/git-get 2dd50b6
```
- __`s`__ to include submodules
- Depending on the scenario, use __one__ of the following:
- __`-X`__: clone a repo and make changes
- __`-Y`__: download a repo to compile it
- You already have a cloned repo, and you want its submodules:
```bash
git gets # Just give me all
git gets -c # Let me choose
git gets --no-init # Only those with 'git submodule init ...'
```

## Performance

```bash
git get -t
# is 1x~10000x faster than:
# git clone repo
# git -C repo rev-parse HEAD > repo/VERSION
# rm -rf repo/.git

git get -o --
# is 1x~1000000x faster than:
# git clone repo
# git -C repo submodule update --init --recursive
# cp repo/ && rm -rf repo

git get --
# is 1x~1000000000x faster than:
# git clone --mirror repo
# git -C repo switch --detach
# rm -rf repo/.git

git gets -P
# is 1x~10000000x faster than:
# git clone --mirror repo
# git -C repo switch --detach
# git -C repo submodule update --init --recursive
# rm -rf repo/**/.git

# If you already have a repo and want to inflate all its submodules:
git gets
# is 1x~10000000x faster than (and 8x shorter to type):
# git submodule update --init --recursive
```

## Why we need it, and why is it so fast?

### A brief but lengthened story of `git clone` performance

So many times we want to download something hosted on GitHub.
What we actually want is a complete working copy of the code and configurations,
without any development history or irrelavent informations.
Once upon a time, there were only two ways to retrieve data from git/GitHub:

1. Simply call `git clone`.
Well, by default `git` downloads the entire development history.
Some huge projects can have 100,000 commits, each with 1GiB files.
Even though there are some duplicated files that saves some space,
this is **totally undesirable unless you are one of the developer**.

1. On GitHub, click `Clone or download`/`Download ZIP`.
OK, now we only have what we want downloaded.
But is that complete? What about `git submodule`s?
Some huge projects can have more than 30 nested submodules.
Are you willing to download one by another with your own hands?
This is **only applicable if there is no or very few submodules**.

1. On GitHub, click `Raw` button when displaying a file.
OK, but it only works for a single file.
Are you willing to download one by another with your own hands?
This is **only applicable if you need up to a few files**.

Some time later, `git` has improved.

1. We now have `git clone --depth=1`: to clone the very first commit of a branch.
But some problems also arose:
We cannot get a specific commit buried inside the development history.
This may not be a problem for big matured project where there
people only need to look for its tags and branches.
However, we frequently need to retrieve a specified version of a repo,
especially when we are using `git submodule`.
Long words short, `--depth=1` works well for the parent repo,
but dysfunctions so frequently when working with submodules.

And even later, in 2018, `git` improved again.

1. We now have `git clone --filter tree:0`: to clone commits eagerly but files lazily.
That's a great improvement!
But GitHub hadn't been offering support for `--filter` until 2019.
So, now, we have all the tools necessary to download whatever you what from GitHub!

### Benefits of using `git-get`

1. It leverages both `--depth` and `--filter` to save bandwidth.
Only the files you actually want (that commit that file) are downloaded.
No entire development history.
No entire repository folder.
Remember, this applies to the parent repo as well as all sub repos.
1. It handles `git submodule`s very well.
Just tell `git-get` the path of your file with respect to the parent repo.
`git-get` will recursively scan through the submodule chain and grab the file for you.
1. It handles optional dependencies also pretty well:
Some project specifies optional dependencies as submodules.
If you want to download some submodules but not the others,
just add `-c|--confirm` to `git-gets` and you can
interactively choose which dependency you want to install.

## Basic Usage

The CLI is pretty self-explanatory:

```bash
# There are multiple ways to specify what you want to download:
:=

| / [|]
| https://github.com///
| https://github.com///commit/
| https://github.com///tree/[/]
| https://github.com///blob/[/]

# Download a single repo (or part of):
git-get [-v|--verbose|-q|--quiet] [-s|--ssh | -H|--https] [-X|-Y]
[-o ] [-f|--force] [-F|--rm-rf]
(-x [-B] [-T] | [-t|--tag] [-- []])

# Download a repo and its submodules:
git gets [-v|--verbose|-q|--quiet] [-s|--ssh | -H|--https] [-X|-Y]
[-P|--parallel] [-c|--confirm] [--no-recursive]
[-o ] [-F|--rm-rf]
(-x [-B] [-T] | [-t|--tag])

# Download submodules of an existing repo:
git gets [-v|--verbose|-q|--quiet] [-s|--ssh | -H|--https] [-X|-Y]
[-P|--parallel] [-c|--confirm] [--no-recursive] [--no-init]
```

Some comments:

* `-X`=`-xuBTP` for keep repo and make changes; `-Y`=`-tP` for compiling.

* `-s|--ssh` and `-H|--https`:
Override using HTTPS or SSH when accesssing github.com and gist.github.com
in the case when you don't have a ready-to-use SSH or HTTPS set-up,

* `-f|--force` and `-F|--rm-rf`:
Override existing file with `-f|--force`.
Override existing directory with `-F|--rm-rf`.

* For `git-get`, leaving an empty `--` at the end creates a
[sparse checkout](https://git-scm.com/docs/git-sparse-checkout)
repo, cone mode.

* `-x`, `-B|--single-branch`, and `-T|--no-tags`:
`-x` will keep the `.git` so you can make changes.
The repository is NOT 100% the same as a regular `git-clone`'d one,
as only commits are fetched but not file contents.
You cannot use it together with `-t|--tag`.
To take a deeper look at the difference, please read the following reference:
[git partial clone](https://git-scm.com/docs/partial-clone).
For repos with many branches / git tags, specifying `-B` and/or `-T` will
remove unused branches / git tags.

* `-t|--tag`:
Instead of keeping a respository, generate a single file called `VERSION`
that contains the SHA-1 of the commit you accessed.
Put it along side with your downloaded file or inside your downloaded directory
so you will know from where the file/dir is obtained.
You cannot use it together with `-x`.

Not all options are shown here.
For additional ones, refer to `man git-get` and `man git-gets`.

## Install and Upgrade

(The upgrading process and install process are identical.)

- Arch Linux

It's on [AUR](https://aur.archlinux.org/packages/git-get):
```bash
yay install git-get
rua install git-get
...
```

- Linux but not Arch Linux

We recommend that you download the latest release and untar the files:
```bash
# Install git-get(1) globally:
curl -fsSL https://github.com/b1f6c1c4/git-get/releases/latest/download/git-get.tar.xz | sudo tar -C /usr -xJv
# Or, locally:
mkdir -p ~/.local/
curl -fsSL https://github.com/b1f6c1c4/git-get/releases/latest/download/git-get.tar.xz | tar -C ~/.local/ -xJv
```

- MacOS

```bash
# Install dependencies, including realpath(1):
brew install coreutils
# Install git-get(1) globally:
curl -fsSL https://github.com/b1f6c1c4/git-get/releases/latest/download/git-get.tar.xz | sudo tar -C /usr/local -xJv
# Or, locally:
mkdir -p ~/.local/bin/
curl -fsSL https://github.com/b1f6c1c4/git-get/releases/latest/download/git-get.tar.xz | tar -C ~/.local/ -xJv
```

- Windows

Similar as above, but you need to manually download the two files [git-get](https://github.com/b1f6c1c4/git-get/blob/master/git-get) and [git-gets](https://github.com/b1f6c1c4/git-get/blob/master/git-gets) and put it in `PATH`. As for the documentation, you will need to browse it online.

You DO NOT need to setup `git config alias.get '!git-get'`.
In fact, git is so smart that, as long as `git-get` is in `PATH`, `git ` will be interpreted as `git-`.

## Requirements

* `bash`, can be `GNU bash` on Linux / MacOS, or `Git bash` on Windows
* `git` **2.20+**, the newer the better
* `sed`, and `grep`
* On Linux: You should already have them installed.
* On MacOS: You should already have them installed.
* On Windows:
```bash
choco install grep sed
```
* (optional) `curl` for `-u|--upstream` functionality
* (optional) `kcov` for checking coverage

## License

MIT