This project is solving the Syncing NASA Open Source Projects challenge.

nasa open source project page scraper: aggregator of nasa's open source code across multiple sources (nasa's open source website, github, apache svn, sourceforge) and combining them into one easy to browse repository.

We are addressing the problem of having code that is distributed across multiple Source Versioning Repositories. The goal is to have a server program that will periodically retrieve the source in these repositories and save them in a GITHUB location. This will serve as both backup and version control.

Our approach is to first use a screen scraper to retrieve the code sources from the website provided in the problem definition and feed our extensible retrieval engine. The engine itself will be built using Node JS and crafted to allow for the creation of modules that will connect to and convert other code repositories to GITHUB.

Project Information

License: GNU General Public License
Source Code/Project URL:


example run -
source code for the site scraper -