How do I convert simple non source controlled project backups into a versioned git repository?

前端 未结 5 704
孤独总比滥情好
孤独总比滥情好 2021-01-04 22:15

I have been extremely naughty. I have been developing a piece of software (I\'m the only developer) for a little while (O.K., it\'s over the course of a few years), but have

相关标签:
5条回答
  • 2021-01-04 22:47

    There might be an already automated way of doing this, but git should be smart enough to let you git init in your oldest backup and then repeatedly copy the .git folder to incrementally newer backups and create a commit with everything for each one. Scripting this should be pretty easy.

    Even better, you could pass a --git-dir= option or $GIT_DIR environment variable to git and have it use a repository, saving the copying step.

    Something like this:

    cd $FINAL_DIR
    git init
    
    export GIT_DIR=$FINAL_DIR/.git
    
    cd $NEXT_BACKUP
    git add -A .
    git commit
    # rinse and repeat
    
    0 讨论(0)
  • 2021-01-04 22:56

    If you are using import-tars importer (in contrib/fast-import/), know that it used to create phony files at the top-level of the repository when the archive contains global PAX headers (to register commits), which made its own logic to detect and omit the common leading directory ineffective, which has been corrected with Git 2.27 (Q2 2020).

    See commit c839fcf (24 Mar 2020) by Johannes Schindelin (dscho).
    (Merged by Junio C Hamano -- gitster -- in commit 8633f21, 22 Apr 2020)

    import-tars: ignore the global PAX header

    Signed-off-by: Johannes Schindelin

    The tar importer in contrib/fast-import/import-tars.perl has a very convenient feature: if all paths stored in the imported .tar start with a common prefix, e.g. git-2.26.0/ in the tar at https://github.com/git/git/archive/v2.26.0.tar.gz, then this prefix is stripped.

    This feature makes a ton of sense because it is relatively common to import two or more revisions of the same project into Git, and obviously we don't want all files to live in a tree whose name changes from revision to revision.

    Now, the problem with that feature is that it breaks down if there is a pax_global_header "file" located outside of said prefix, at the top of the tree.

    This is the case for .tar files generated by Git's very own git archive command: it inserts that header, and git archive allows specifying a common prefix (that the header does _not_ share with the other files contained in the archive) via --prefix=my-project-1.0.0/.

    Let's just skip any global header when importing .tar files into Git.

    Note: this global header might contain useful information.
    For example, in the output of git archive, it lists the original commit, which _is_ useful information.

    A future improvement to the import-tars.perl script might be to include that information in the commit message, or do other things with the information (e.g. use mtime information contained in the global header as date of the commit).
    This patch does not prevent any future patch from making that happen, it only prevents the header from being treated as if it was a regular file.

    0 讨论(0)
  • 2021-01-04 22:58

    You can use example git-fast-import based tools distributed in git.git repository: import-zips.py (in Python) or import-tars.perl (in Perl), or use those script as a base of your own import script. You can find those scripts in contrib/fast-import/ directory.

    0 讨论(0)
  • 2021-01-04 23:04

    I don't quite know why you don't want to just commit all snapshots individually. I mean, a shell script (or Perl, Python, Ruby, Tcl, whatever) to do that, is probably less than 5 lines of code and less than 10 minutes of work.

    Also, there is git load-dirs, which would allow you to cut that down to maybe 3 lines and 5 minutes. But you still have to load every dir indvidually.

    But, if you are so inclined, there is the git fast-import tool which is intended to make writing repository converters and importers easier. According to the manpage, you could write an importer in about 100 lines and a couple of hours.

    However, all this ignores the biggest problem: the value of a VCS lies not in the contents – you could just as well use regular backups for that – but in the commit messages. And no magic tool is going to help you there, you'll have to type them all in yourself … and more importantly, you'll have to remember exactly why you made every single little change over the last years.

    0 讨论(0)
  • 2021-01-04 23:06

    Also check out the "A Custom Importer" section in the Migrating to Git chapter. which talks about this exact issue.

    0 讨论(0)
提交回复
热议问题