From 731f3c2c63d9b57a7cb2b32ed33b868cf1f4ff57 Mon Sep 17 00:00:00 2001 From: Brendan Patrick Purdy <72283450+bpurdy-ds@users.noreply.github.com> Date: Wed, 25 Oct 2023 10:11:45 -0700 Subject: [PATCH] !, verbiage --- index.ipynb | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/index.ipynb b/index.ipynb index e6e457e..03d9af5 100644 --- a/index.ipynb +++ b/index.ipynb @@ -888,7 +888,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Great, step 1 complete! We have unique identifiers (names) for each of our records (countries) that we will be able to use to connect 2018 World Cup performance to 2018 population." + "Step 1 complete. We have unique identifiers (names) for each of our records (countries) that we will be able to use to connect 2018 World Cup performance to 2018 population." ] }, { @@ -1233,7 +1233,7 @@ "source": [ "### Analysis of Wins\n", "\n", - "While we could try to understand all 32 of those numbers just by scanning through them, let's use some descriptive statistics and data visualizations instead!\n", + "While we could try to understand all 32 of those numbers just by scanning through them, let's use some descriptive statistics and data visualizations instead\n", "\n", "#### Statistical Summary of Wins\n", "\n", @@ -1378,7 +1378,7 @@ "\n", "> Add to the existing data structure so that it also connects each country name to its 2018 population, and create visualizations comparable to those from step 2.\n", "\n", - "Now we're ready to add the 2018 population to `combined_data`, finally using the CSV file!\n", + "Now we're ready to add the 2018 population to `combined_data`, finally using the CSV file\n", "\n", "Recall that `combined_data` currently looks something like this:\n", "```\n", @@ -1647,7 +1647,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Ohhhh...we have a data normalization issue! One dataset refers to this country as `'Iran, Islamic Rep.'`, while the other refers to it as `'Iran'`. This is a common issue we face when using data about countries and regions, where there is no universally-accepted naming convention.\n", + "Ohhhh...we have a data normalization issue. One dataset refers to this country as `'Iran, Islamic Rep.'`, while the other refers to it as `'Iran'`. This is a common issue we face when using data about countries and regions, where there is no universally-accepted naming convention.\n", "\n", "### Normalizing Locations in Population Data\n", "\n", @@ -1793,7 +1793,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "Great, now we should have 32 records instead of 27!\n", + "Great, now we should have 32 records instead of 27.\n", "\n", "### Type Conversion of Population Data\n", "\n", @@ -1924,7 +1924,7 @@ "source": [ "### Adding Population Data\n", "\n", - "Now it's time to add the population data to `combined_data`! Recall that the data structure currently looks like this:" + "Now it's time to add the population data to `combined_data`. Recall that the data structure currently looks like this:" ] }, { @@ -2472,7 +2472,7 @@ "- Remember that correlation is not causation. Even though we\n", " detected a weak positive correlation, that is not saying\n", " that increasing the population of a country would necessarily\n", - " improve their World Cup performance!\n", + " improve their World Cup performance.\n", "- At the same time, we have not *disproven* this causal link.\n", " In later lessons we will introduce additional statistical\n", " methods for detecting relationships between variables, but\n", @@ -2516,7 +2516,7 @@ "source": [ "## Summary\n", "\n", - "Congratulations! That was a long lab, pulling together a lot of material. You read data into Python, extracted the relevant information, cleaned the data, and combined the data into a new format to be used in analysis. While we will continue to introduce new tools and techniques, these essential steps will be present for the rest of your data science projects from here on out!" + "That was a long lab, pulling together a lot of material. You read data into Python, extracted the relevant information, cleaned the data, and combined the data into a new format to be used in analysis. While we will continue to introduce new tools and techniques, these essential steps will be present for the rest of your data science projects from here on out." ] } ],