descriptionPrograms to feed git-fast-import
ownerfrej.drejhammar@gmail.com
last changeSat, 11 Aug 2018 13:49:08 +0000 (11 15:49 +0200)
content tags
add:
README.md

hg-fast-export.(sh|py) - mercurial to git converter using git-fast-import

Most hg-* scripts are licensed under the [MIT license] (http://www.opensource.org/licenses/mit-license.php) and were written by Rocco Rutte <pdmef@gmx.net> with hints and help from the git list and #mercurial on freenode. hg-reset.py is licensed under GPLv2 since it copies some code from the mercurial sources.

The current maintainer is Frej Drejhammar <frej.drejhammar@gmail.com>.

Support

If you have problems with hg-fast-export or have found a bug, please create an issue at the [github issue tracker] (https://github.com/frej/fast-export/issues). Before creating a new issue, check that your problem has not already been addressed in an already closed issue. Do not contact the maintainer directly unless you want to report a security bug. That way the next person having the same problem can benefit from the time spent solving the problem the first time.

System Requirements

This project depends on Python 2.7 and the Mercurial 4.6 package. If Python is not installed, install it before proceeding. The Mercurial package can be installed with pip install mercurial.

If you're on Windows, run the following commands in git bash (Git for Windows).

Usage

Using hg-fast-export is quite simple for a mercurial repository <repo>:

mkdir repo-git # or whatever
cd repo-git
git init
hg-fast-export.sh -r <local-repo>
git checkout HEAD

Please note that hg-fast-export does not automatically check out the newly imported repository. You probably want to follow up the import with a git checkout-command.

Incremental imports to track hg repos is supported, too.

Using hg-reset it is quite simple within a git repository that is hg-fast-export'ed from mercurial:

hg-reset.sh -R <revision>

will give hints on which branches need adjustment for starting over again.

When a mercurial repository does not use utf-8 for encoding author strings and commit messages the -e <encoding> command line option can be used to force fast-export to convert incoming meta data from <encoding> to utf-8. This encoding option is also applied to file names.

In some locales Mercurial uses different encodings for commit messages and file names. In that case, you can use --fe <encoding> command line option which overrides the -e option for file names.

As mercurial appears to be much less picky about the syntax of the author information than git, an author mapping file can be given to hg-fast-export to fix up malformed author strings. The file is specified using the -A option. The file should contain lines of the form "<key>"="<value>". Inside the key and value strings, all escape sequences understood by the python string_escape encoding are supported. (Versions of fast-export prior to v171002 had a different syntax, the old syntax can be enabled by the flag --mappings-are-raw.)

The example authors.map below will translate User <garbage<tab><user@example.com> to User <user@example.com>.

-- Start of authors.map --
"User <garbage\t<user@example.com>"="User <user@example.com>"
-- End of authors.map --

Tag and Branch Naming

As Git and Mercurial have differ in what is a valid branch and tag name the -B and -T options allow a mapping file to be specified to rename branches and tags (respectively). The syntax of the mapping file is the same as for the author mapping.

Content filtering

hg-fast-export supports filtering the content of exported files. The filter is supplied to the --filter-contents option. hg-fast-export runs the filter for each exported file, pipes its content to the filter's standard input, and uses the filter's standard output in place of the file's original content. The prototypical use of this feature is to convert line endings in text files from CRLF to git's preferred LF:

-- Start of crlf-filter.sh --
#!/bin/sh
# $1 = pathname of exported file relative to the root of the repo
# $2 = Mercurial's hash of the file
# $3 = "1" if Mercurial reports the file as binary, otherwise "0"

if [ "$3" == "1" ]; then cat; else dos2unix; fi
-- End of crlf-filter.sh --

Notes/Limitations

hg-fast-export supports multiple branches but only named branches with exactly one head each. Otherwise commits to the tip of these heads within the branch will get flattened into merge commits.

As each git-fast-import run creates a new pack file, it may be required to repack the repository quite often for incremental imports (especially when importing a small number of changesets per incremental import).

The way the hg API and remote access protocol is designed it is not possible to use hg-fast-export on remote repositories (http/ssh). First clone the repository, then convert it.

Design

hg-fast-export.py was designed in a way that doesn't require a 2-pass mechanism or any prior repository analysis: if just feeds what it finds into git-fast-import. This also implies that it heavily relies on strictly linear ordering of changesets from hg, i.e. its append-only storage model so that changesets hg-fast-export already saw never get modified.

Submitting Patches

Please use the issue-tracker at github https://github.com/frej/fast-export to report bugs and submit patches.

shortlog
9 days ago Daniel Smallhg 4.7: Replace call to util.email with templatefilters... master
9 days ago Frej DrejhammarAdhere to PEP 394
2018-06-22 Frej DrejhammarMerge branch 'atykhyy-as-binary'
2018-06-17 Anton TykhyyAdd --filter-contents
2018-06-10 Frej DrejhammarAdapt to changes in Mercurial 4.6v180610
2018-06-10 GabrielAdd a section about system requirements to the README
2018-02-13 ceqiUpdate usage section example commandsv180317
2018-02-01 Frej DrejhammarWarn contributors from doing work that will no be merged
2018-01-24 Martin FreundQuote $PYTHON variable to support paths with spacesv180126
2017-10-02 Frej DrejhammarChange syntax of mapping filesv171002
2017-09-29 Frej DrejhammarRefactor load_mapping() to move line parsing to inner...
2017-08-25 Frej DrejhammarAvoid nuisance error printout from readlink testv170826
2017-08-18 Frej DrejhammarDon't break if the destination directory name contains... add-missing-quotev170818
2017-06-24 Frej DrejhammarEliminate bashismv170624
2017-06-05 Frej DrejhammarBehave nicely when the found readlink does not understa... v170617
2017-06-02 Frej Drejhammardocumentation: Point users to the issue tracker for... v170604
...
tags
2 months ago v180610 v180610
5 months ago v180317 v180317
6 months ago v180126 v180126
10 months ago v171002 v171002
11 months ago v170826 v170826
12 months ago v170818 v170818
13 months ago v170624 v170624
14 months ago v170617 v170617
14 months ago v170604 v170604
19 months ago v170101 v170101
23 months ago v160914 v160914
23 months ago v160415 Tag version 160415
heads
9 days ago master
12 months ago add-missing-quote
11 years ago git-p4
11 years ago hanwen
forks
Cached version (4625s old)
 source.codes 
fast-export/rorcz.git Local repo.or.cz fork with needed fixes mackyle@gmail.com 21 months ago
fast-export/benizi.git repo.or.cz@benizi.com 6 years ago
fast-export/fast-export-unix-compliant.git dereckson@espace... 6 years ago
fast-export/dharding.git dharding@gmail.com 6 years ago
fast-export/barak.git tweaks and maybe debian packaging info barak@cs.nuim.ie 6 years ago