Add API support for downloading git based urls #5

TG1999 · 2019-11-24T16:44:28Z

Partially solves issue #1
Signed-off-by: unknown <tushar.goel.dav.gmail.com>

TG1999 · 2019-11-25T11:08:59Z

@pombredanne please have a look on it

pombredanne

Thanks and sorry for the time it took! to review this!

fetchcode/giturl.py

pombredanne · 2019-12-03T17:16:34Z

fetchcode/giturl.py

+	if dest:
+		dest_dir = os.path.join(dest, branch_name)
+	else:
+		dest_dir = os.path.join(os.environ.get('CHARM_DIR'), "fetched",


Whats is CHARM_DIR?

Since you copied this code from https://github.com/juju/charm-helpers/blob/master/charmhelpers/fetch/giturl.py or similar we absolutely need to track its origin and license AND keep all original notice.

We never borrow or copy code without tracking it and documenting where it comes from.

Now, why not reusing charm-helpers as a library directly?

I think instead of using their module I should code mine with some little tweaks and how can I track its origin, please help me in that

fetchcode/giturl.py

pombredanne · 2019-12-03T17:18:19Z

fetchcode/giturl.py

+		return True
+
+def clone(source, dest, branch='master', depth=None):
+	if not canHandle(source):


May be a small docstring would help? after all you are not only cloning but also pulling

fetchcode/giturl.py

TG1999 · 2020-01-26T18:22:01Z

I got your point @pombredanne will do the changes in 1-2 days

pombredanne

Thank you and sorry for taking so long to review this!
See my comments inline. You also really want to start adding some tests too.

fetchcode/giturl.py

TG1999 · 2020-02-21T12:01:31Z

@pombredanne changes are done, please check them.

steven-esser

See my feedback.

Additionally, you need many more test URLs to make sure that your code will work in all cases. We need tests for Null URLs, '' URLs, git:// structured URLs, svn:// structured URLs, ftp:// structured URLs and many others.

fetchcode/giturl.py

steven-esser · 2020-02-21T19:58:23Z

fetchcode/giturl.py

+	Returns destination directory  
+	"""
+	url_parts = urlparse(source)
+	repo_name = url_parts.path.strip('/').split('/')[-1].split('.')[0]


This is very ugly and I do not know what it does. You should use urllib.parse to parse URLs instead of homemade string manipulation.

I am already using urllib.parse, but it only gives path, Then I have to parse that path into something meaningful. I agree it looks ugly, that's why I am now using multiple lines and explained what steps I am using. I hope that will work :)

Hmm, maybe you could use https://docs.python.org/3/library/pathlib.html to handle the various pieces of the path?

The point here being, parsing of paths has been done before by others. I would much rather use a tested library instead of splitting path strings, when possible.

TG1999 · 2020-02-29T06:22:19Z

Hi @MaJuRG, can you give me some sample URLs that I have to handle for test cases using git

steven-esser · 2020-03-01T15:46:38Z

@TG1999 Here is a list of possible git urls combos: https://stackoverflow.com/questions/31801271/what-are-the-supported-git-url-formats

You can craft samples from these skeletons.

TG1999 · 2020-03-03T16:14:40Z

Hey @MaJuRG , Thanks for your guidance I think now this PR is good to go

All the tests are covered(if any is left by chance, please provide me the URL of that case I will ad it also).
I have now used pathlib for taking out repo name.

steven-esser

Left some formatting comments.

A quick note: Our convention is to indent with spaces, not tabs. Our default is 4 spaces per line of indentation.

On a side note, have you ran the entire test suite for fetchcode? Since we have no CI at the moment, I will have to check later and see if everything has passed tests.

steven-esser · 2020-03-03T16:29:54Z

tests/test_giturl.py

+    """
+    Testing https based URLs
+    """
+


Remove this line break

steven-esser · 2020-03-03T16:30:06Z

tests/test_giturl.py

+    """
+    Testing git based URLs
+    """
+


Remove this line break

steven-esser · 2020-03-03T16:30:12Z

tests/test_giturl.py

+    """
+    Testing git+ssh based URLs
+    """
+


Remove this line break

steven-esser · 2020-03-03T16:30:18Z

tests/test_giturl.py

+    """
+    Testing git+https based URLs
+    """
+


Remove this line break

steven-esser · 2020-03-03T16:30:23Z

tests/test_giturl.py

+    """
+    Testing ssh based URLs
+    """
+


Remove this line break

steven-esser · 2020-03-03T16:32:02Z

fetchcode/giturl.py

+	"""
+    url_parts = urlparse(source)
+    if url_parts.scheme in ('https', 'git', 'git+ssh', 'ssh', 'git+https'):
+	    return True