Skip to content

POC: Gitlab Migration - compress exported relations using zip

George Koltsov requested to merge georgekoltsov/bulk-imports-export-zip into master

What does this MR do and why?

This MR updates Gitlab Migration to compress exported relations using zip instead of gz (and tar for archiving). It brings a few benefits:

  1. Less system calls. We can do all zipping with ruby
  2. We can read individual file contents directly from object storage instead of downloading the whole file (in the future, right now this POC downloads the whole file). This can be beneficial when importing a collection of heavy uploads that take a lot of disk space

In order to implement 2 & download individual files directly from object storage we need to know their filenames upfront, which would require additional changes to exported data (e.g. having metadata.json describing exported files, their filenames and sizes so that we can read them from object storage). We can iterate on it once we have transitioned to zip.

Mentions #376261 (closed)

Screenshots or screen recordings

Screenshots are required for UI changes, and strongly recommended for all other merge requests.

How to set up and validate locally

Numbered steps to set up and validate the change are strongly suggested.

MR acceptance checklist

This checklist encourages us to confirm any changes have been analyzed to reduce risks in quality, performance, reliability, security, and maintainability.

Edited by George Koltsov

Merge request reports

Loading