Surch searches GitHub organizations, a list of user repositories or single repositories for strings
Surch iterates through a single GitHub repository or a whole GitHub organization for different strings based on user input. Provided a repository, all commits and branches will be searched in.
The output is a file containing the blob url and additional information in which one of the strings was found.
The initial idea behind Surch was to look for secrets but it can be used to search for just about anything.
While Gitrob provides mostly the same functionality (plus a whole plethora of additional features), we wanted something that would be lightweight and won't require a PostgreSQL server and other dependencies behind it. To that end, Surch requires no dependencies whatsoever aside from Python.
It's important to note that currently, Surch does not tell you what it found where. It simply tells you that it found one of the strings you searched for and in which commit it was found.
NOTE: For now we support in python 2.7 in the future we plain to support python 3
pip install surch
# latest development version
pip install http://github.com/cloudify-cosmo/surch/archive/master.tar.gz
$ surch --help
Usage: surch [OPTIONS] COMMAND [ARGS]...
Options:
--help Show this message and exit.
Commands:
org Search all or some repositories in an...
repo Search a single repository
user Search all or some repositories for a user
$ surch repo http://github.com/cloudify-cosmo/surch --string Surch --string Burch
...
2016-07-14 08:41:57,769 - surch - INFO - Pulling repo: surch...
2016-07-14 08:41:58,540 - surch - INFO - Scanning repo surch for 2 string(s)...
2016-07-14 08:41:59,579 - surch - INFO - Writing results to: ~/.surch/results/results.json...
2016-07-14 08:42:13,008 - surch - INFO - Found 603 results in 123 commits.
...
$ cat ~/.surch/results/results.json
...
{
"_default": {
"1": {
"blob_url": "https://github.com/cloudify-cosmo/surch/blob/46a5321e902c0bad927458f94825ec7ca0aab128/README.md",
"commit_sha": "46a5321e902c0bad927458f94825ec7ca0aab128",
"commit_time": "Tue Jul 12 10:15:30 2016",
"email": "[email protected]",
"filepath": "README.md",
"organization_name": "cloudify-cosmo",
"repository_name": "surch",
"username": "haviv"
},
"2": {
"blob_url": "https://github.com/cloudify-cosmo/surch/blob/46a5321e902c0bad927458f94825ec7ca0aab128/README.rst",
"commit_sha": "46a5321e902c0bad927458f94825ec7ca0aab128",
"commit_time": "Tue Jul 12 10:15:30 2016",
"email": "[email protected]",
"filepath": "README.rst",
"organization_name": "cloudify-cosmo",
"repository_name": "surch",
"username": "haviv"
},
...
}
}
NOTE: to search in an organization, replace user
with org
$ surch user havivw --string surch
...
2016-07-14 08:47:16,294 - surch - WARNING - Choosing not to provide GitHub credentials limits requests to GitHub to 60/h. This might affect cloning.
2016-07-14 08:47:16,294 - surch - INFO - Retrieving repository information for this user:havivw...
2016-07-14 08:47:17,727 - surch - INFO - Previous results file found. Backing up to ~/.surch/results/results.json.20160714T084717
2016-07-14 08:47:17,729 - surch - INFO - Cloning repo cloudify-interactive-tutorial from org Havivw to ~/.surch/clones/cloudify-interactive-tutorial...
2016-07-14 08:47:22,677 - surch - INFO - Scanning repo cloudify-interactive-tutorial for 1 string(s)...
2016-07-14 08:47:23,215 - surch - INFO - Writing results to: ~/.surch/results/results.json...
2016-07-14 08:47:23,215 - surch - INFO - Found 0 results in 45 commits.
...
- Cloned repositories are stored under ~/.surch/clones
- Result files are stored under ~/.surch/results
NOTE: Running the tests require an internet connection
git clone [email protected]:cloudify-cosmo/surch.git
cd surch
pip install tox
tox
..are always welcome.