Adagrams: Snow Leopards ( Kae & Diana) #66

sorindevops · 2022-09-29T23:07:11Z

No description provided.

Co-authored-by: sorindevops <[email protected]>

kaesluder · 2022-09-29T23:27:20Z

This passes all tests on 3.9, but fails on Learn using 3.8.8

anselrognlie

Nice job!

All your tests are passing and your code is laid out well overall. Nice use of using some built-ins as well your own approaches. And great job catching the warnings from sample. Of course, we always recommend rolling your own approach to practice breaking down problems on your own, so even if you have found a library function to help out, do consider implementing it yourself for the practice.

anselrognlie · 2022-10-01T19:11:48Z

README.md

@@ -1,5 +1,16 @@
 # AdaGrams

+## Collaboration Plan: 


Nice documentation of your collaboration plan.

anselrognlie · 2022-10-01T19:28:24Z

tests/test_wave_01.py

@@ -65,3 +68,64 @@ def test_letter_not_selected_too_many_times():
        # Assert
        for letter in letters:
            assert letter_freq[letter] <= LETTER_POOL[letter]
+
+def test_chi_square_frequencies():


I DO NOT have stats experience, so beyond being aware of the chi-square test sounding like the right thing to do, I can only say that your approach makes sense to me: tally up the observed letter counts over a "large" number of runs, and check whether the distribution looks like the expected distribution from the tile counts.

The test that we supplied really just checks that no impossible hands can be drawn, and while it would be possible for that test to be "defeated" by a faulty implementation (i.e., the implementation might not be properly enforceing the limits, but for all of the 1000 test runs, it never happens to exceed any), it will never yield a false negative. It will never fail as long as the implementation is correct.

Statistical based tests are a touchy subject. We want to be sure that unit tests NEVER have a chance of failing for working code. EVen worse if it fails sporadically. So any statistical test would need to be careful tuned to ensure that the tolerances are such that they would NEVER cause the test to fail. If there is a test that sporadically fails, that trains the programmers to ignore the test. If it fails, "it's just that test that fails sometimes," and that can mean that if it truly is failing, no one will take any notice.

Code that depends on randomness does present a problem for testing. Another approach is to avoid the randomness entirely in the test. In our code, rather than directly relying on a library function that has randomness baked in (like randint or sample) we might have our code depend on a passed in supplier of randomness. In the real code, we can provide an implementation that uses the typical provider of randomness, but in the test, we might pass in a more deterministic provider that we know will always produce a particular result, turning the test from a check of statistical probability more towards a test of expected stable behavior.

If we do still want to run some statistical tests (probably still a good idea), we might store them in a different suite of tests. There are more kinds of tests than just unit tests (short, small tests that we run frequently while developing to check basic behaviors), like integration testing, smoke testing, or fuzz testing. These aren't usually run by devs all the time, and hence, might provide a better situation for the test runner to spend more time looking into any failures without it becoming a blocker to the regular devlopment cycle. As it is, this test actually did fail for me once in the few times I ran it! 😱

anselrognlie · 2022-10-01T19:57:59Z

adagrams/game.py

+# import pprint
+
+
+LETTER_POOL = {


👍 I like that you pulled these large data objects outside the functions. This lets the functions just have the logic, rather than being dominated by the data definitions.

When using strings as the dictionary keys, I like to the use the dict() function-stlye of making a dictionary. It uses keyword argument calling to let us set string keys without needing to quote them. For example

LETTER_POOL = dict( A=9, B=2, C=2, ... )

anselrognlie · 2022-10-01T20:19:27Z

adagrams/game.py

+
+    # get the lists of letter and count for each letter
+    # as list (preferred by random.sample)
+    letter_list = list(LETTER_POOL.keys())


👍 Nice. Without doing this, sample raises a deprecation warning.

anselrognlie · 2022-10-01T20:20:55Z

adagrams/game.py

+    # to determine the frequency of each item. 
+    # https://docs.python.org/3/library/random.html#random.sample
+    # requires python version 3.9 or greater 
+    this_hand = sample(letter_list, k=10, counts=letter_count)


Really nice, concise approach. Thanks for including a comment about how you understand the function to work, and how you reasearched it. Now that we've discussed Big O, it's also useful to think about what the time/space complexity of the function might be. Thinking about how we can implement this functionality ourselves can help us gain insight into what that might be.

Remember that not every language has the same built-in functions, so we should be able to implement this same functionality ourselves. In the projects, we don't ask for any features that we don't think you should be able to write yourself. For drawing a random hand, the only external function that would be "required" is a way to pick a random number (such as randint). At this stage in our coding journeys, it's often more valuable to re-implement things by hand to practice thinking about how to breakdown and approach the problem than to use a single library function to solve the problem for us.

anselrognlie · 2022-10-01T21:08:02Z

adagrams/game.py

+
+    letters_from_word = list(word)
+
+    if 7 <= len(word) <= 10:    


anselrognlie · 2022-10-01T21:08:25Z

adagrams/game.py

+        if letter in SCORE_CHART.keys():
+            score_total += SCORE_CHART[letter]
+
+    return(score_total)


No need for () around the value being returned.

anselrognlie · 2022-10-01T21:09:18Z

adagrams/game.py

+    return(score_total)
+
+
+def _high_score_key_helper(word):


Did you intentionally start the function with _ to indicate this was intended for internal use?

anselrognlie · 2022-10-01T21:10:09Z

adagrams/game.py

+        score_total += extra_points
+
+    for letter in letters_from_word: 
+        if letter in SCORE_CHART.keys():


You did a check to see whether a letter was in score table. This is fine and has the affect that any unexpected tiles in the word don't contribute to the score. But where would those unexpected tiles have come from? And should it be the scoring function's job to handle those? We might expect some other part of the game to validate that any words played by a player include only tiles from the tile bank, in which case this function could safely ignore the in check. In fact, we might want to let python raise a KeyError so that the program can find out that, somehow, bad data got into our word.

anselrognlie · 2022-10-01T21:32:53Z

adagrams/game.py

+    # use our helper function to find the highest ranked
+    # word using max
+    high_word = max(word_list, key=_high_score_key_helper)


This is really clever! I can't decide whether I think this is more or less understandable than a more typical approach where we write the loop ourself, performing the comparison and tie handling behavior explcitly.

Writing it out is visually more noisy, but let's us see the decisions being made a bit more explicitly.

This approach depends on the reader understanding that tuples values are compared position by position, and that we'll only move on to the next value if the early one is a tie. So by building a tuple with the score, whether it has 10 tiles, and ultimately its length, we are providing the three criteria for the comparison in weighted order. So a higher score always wins. If there's a tied score, then 10 letter word wins (because True > False), but so long as everything had been False, we go on and check the length, and the shortest wins since you flipped the sign of the length.

So really great on concision and a fantastic use of the max higher order function, but it does leave a lot up to the reader to put it together about what's going on.

Having sat with this a bit more, I'm going to go with I like this better than the explicit approach in the long run, and that my initial hesitation has more to do with the languages I've used before. But this is Python, so when in Rome...

My main hesitation come from the fact that in other languages, the way we tend to provide customization for functions like min, max, and sort comes from providing a comparison function. We explicitly write the logic to compare two instances of the data in the list. Python does support this style of customization through the use of an adapter type (functools.cmp_to_key) that can turn a comparison function into something that will work as the key function. But if we embrace the key function fully, it can effectively give us a compound key, comparing several aspects as needed.

It still feels odd to me to compare True and False in a relative sense (Java, for instance does not allow this, but we could simply map true to 1 and false to 0), but Python definitely allows it.

So again, really clever implementation, and great use of the tools of the language!

I got really confused trying to figure out the if/else tree and thought about putting it into a comparison function so that we could test how the logic works just word vs. word. But python3 doesn't have comparison functions anymore, just key functions, and somewhere tuple sorting clicked analogous to (LastName, FirstName, MiddleInitial).

I agree that sorting on a Boolean feels a bit weird though.

kaesluder and others added 16 commits September 26, 2022 14:08

a test line to test the rep

350753d

Collaboration plan

a6182d0

Co-authored-by: sorindevops <[email protected]>

Checked off todo list

e7791b2

passing version of wave 1 using random.sample

22c668d

chi square test of sampling.

1a544b0

wave 2 started

f4b52a8

testing wave 1 chi-square test

23647b0

uses available letters: tests passed

4161970

start of Wave3

64e8efd

w3 needs to be test

b5a3713

minor bug fixes to pass tests

1fbdf5f

refactored test to use a dict comprehension

62a8ef1

high score code for wave 4

12e33fe

w4 started

af537eb

complete refactor of ranking words, passes tests

ec8f1cd

cleanup of print statements and variables

e069ce4

refactor to remove second call to len

6c37747

anselrognlie reviewed Oct 1, 2022

View reviewed changes

sorindevops added 2 commits December 13, 2022 14:41

try to pull

c2f1b4e

Merge branch 'master' of https://github.com/sorindevops/adagrams-py

cc7a91d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adagrams: Snow Leopards ( Kae & Diana) #66

Adagrams: Snow Leopards ( Kae & Diana) #66

sorindevops commented Sep 29, 2022

kaesluder commented Sep 29, 2022

anselrognlie left a comment

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 1, 2022

anselrognlie Oct 4, 2022

kaesluder Oct 4, 2022

Adagrams: Snow Leopards ( Kae & Diana) #66

Are you sure you want to change the base?

Adagrams: Snow Leopards ( Kae & Diana) #66

Conversation

sorindevops commented Sep 29, 2022

kaesluder commented Sep 29, 2022

anselrognlie left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment