"
+ ],
+ "text/plain": [
+ " id tratamento tempo dose ... c-96 c-97 c-98 c-99\n",
+ "0 id_000644bb2 com_droga 24 D1 ... -0.3981 0.2139 0.3801 0.4176\n",
+ "1 id_000779bfc com_droga 72 D1 ... 0.1522 0.1241 0.6077 0.7371\n",
+ "2 id_000a6266a com_droga 48 D1 ... -0.6417 -0.2187 -1.4080 0.6931\n",
+ "3 id_0015fd391 com_droga 48 D1 ... -1.6210 -0.8784 -0.3876 -0.8154\n",
+ "4 id_001626bd3 com_droga 72 D2 ... 0.1094 0.2885 -0.3786 0.7125\n",
+ "... ... ... ... ... ... ... ... ... ...\n",
+ "23809 id_fffb1ceed com_droga 24 D2 ... 0.0631 0.9171 0.5258 0.4680\n",
+ "23810 id_fffb70c0c com_droga 24 D2 ... -0.2084 -0.1224 -0.2715 0.3689\n",
+ "23811 id_fffc1c3f4 com_controle 48 D2 ... 0.2256 0.7592 0.6656 0.3808\n",
+ "23812 id_fffcb9e7c com_droga 24 D1 ... 0.1732 0.7015 -0.6290 0.0740\n",
+ "23813 id_ffffdd77b com_droga 72 D1 ... -3.5770 -0.4775 -2.1500 -4.2520\n",
+ "\n",
+ "[23814 rows x 877 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 2
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "zGB4Kq8JvmZ5"
+ },
+ "source": [
+ "A base de dados que usamos até o momento tem uma variável chamada ```composto```mas entendemos, com a ajuda da Vanessa, que essa não é a melhor nomenclatura para representa-la. \n",
+ "Por isso, vamos usar a função ```map``` da biblioteca Pandas para renomear essa coluna.\n",
+ "É importante destacar que passamos o parâmetro ```inplace = True```, esse parâmetro faz com que os dados sejam modificados no local e o dataframe será atualizado. \n",
+ "Caso esse parâmetro não seja declarado, o default é ```inplace = False``` e o retorno será uma cópia do objeto e caso você queira, precisa salva-lo com um outro nome."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "id": "CFse2h0Vcrp4"
+ },
+ "source": [
+ "mapa = {'droga': 'composto'}\n",
+ "dados.rename(columns=mapa, inplace=True)"
+ ],
+ "execution_count": 3,
+ "outputs": []
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "lq5EDpC4w_rS"
+ },
+ "source": [
+ "Aqui, estamos usando a função ```head``` para apresentar as 5 primeiras linhas da base de dados e assim, podemos conferir se a renomeação aconteceu da maneira que estavámos esperando."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 243
+ },
+ "id": "VX73K_tLcoQg",
+ "outputId": "1081b0e7-21a3-4d7f-e0b3-5c0ea0744798"
+ },
+ "source": [
+ "dados.head()"
+ ],
+ "execution_count": 4,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/html": [
+ "
\n",
+ "\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
\n",
+ "
id
\n",
+ "
tratamento
\n",
+ "
tempo
\n",
+ "
dose
\n",
+ "
composto
\n",
+ "
g-0
\n",
+ "
g-1
\n",
+ "
g-2
\n",
+ "
g-3
\n",
+ "
g-4
\n",
+ "
g-5
\n",
+ "
g-6
\n",
+ "
g-7
\n",
+ "
g-8
\n",
+ "
g-9
\n",
+ "
g-10
\n",
+ "
g-11
\n",
+ "
g-12
\n",
+ "
g-13
\n",
+ "
g-14
\n",
+ "
g-15
\n",
+ "
g-16
\n",
+ "
g-17
\n",
+ "
g-18
\n",
+ "
g-19
\n",
+ "
g-20
\n",
+ "
g-21
\n",
+ "
g-22
\n",
+ "
g-23
\n",
+ "
g-24
\n",
+ "
g-25
\n",
+ "
g-26
\n",
+ "
g-27
\n",
+ "
g-28
\n",
+ "
g-29
\n",
+ "
g-30
\n",
+ "
g-31
\n",
+ "
g-32
\n",
+ "
g-33
\n",
+ "
g-34
\n",
+ "
...
\n",
+ "
c-60
\n",
+ "
c-61
\n",
+ "
c-62
\n",
+ "
c-63
\n",
+ "
c-64
\n",
+ "
c-65
\n",
+ "
c-66
\n",
+ "
c-67
\n",
+ "
c-68
\n",
+ "
c-69
\n",
+ "
c-70
\n",
+ "
c-71
\n",
+ "
c-72
\n",
+ "
c-73
\n",
+ "
c-74
\n",
+ "
c-75
\n",
+ "
c-76
\n",
+ "
c-77
\n",
+ "
c-78
\n",
+ "
c-79
\n",
+ "
c-80
\n",
+ "
c-81
\n",
+ "
c-82
\n",
+ "
c-83
\n",
+ "
c-84
\n",
+ "
c-85
\n",
+ "
c-86
\n",
+ "
c-87
\n",
+ "
c-88
\n",
+ "
c-89
\n",
+ "
c-90
\n",
+ "
c-91
\n",
+ "
c-92
\n",
+ "
c-93
\n",
+ "
c-94
\n",
+ "
c-95
\n",
+ "
c-96
\n",
+ "
c-97
\n",
+ "
c-98
\n",
+ "
c-99
\n",
+ "
\n",
+ " \n",
+ " \n",
+ "
\n",
+ "
0
\n",
+ "
id_000644bb2
\n",
+ "
com_droga
\n",
+ "
24
\n",
+ "
D1
\n",
+ "
b68db1d53
\n",
+ "
1.0620
\n",
+ "
0.5577
\n",
+ "
-0.2479
\n",
+ "
-0.6208
\n",
+ "
-0.1944
\n",
+ "
-1.0120
\n",
+ "
-1.0220
\n",
+ "
-0.0326
\n",
+ "
0.5548
\n",
+ "
-0.0921
\n",
+ "
1.1830
\n",
+ "
0.1530
\n",
+ "
0.5574
\n",
+ "
-0.4015
\n",
+ "
0.1789
\n",
+ "
-0.6528
\n",
+ "
-0.7969
\n",
+ "
0.6342
\n",
+ "
0.1778
\n",
+ "
-0.3694
\n",
+ "
-0.5688
\n",
+ "
-1.1360
\n",
+ "
-1.1880
\n",
+ "
0.6940
\n",
+ "
0.4393
\n",
+ "
0.2664
\n",
+ "
0.1907
\n",
+ "
0.1628
\n",
+ "
-0.2853
\n",
+ "
0.5819
\n",
+ "
0.2934
\n",
+ "
-0.5584
\n",
+ "
-0.0916
\n",
+ "
-0.3010
\n",
+ "
-0.1537
\n",
+ "
...
\n",
+ "
0.4805
\n",
+ "
0.4965
\n",
+ "
0.3680
\n",
+ "
0.8427
\n",
+ "
0.1042
\n",
+ "
0.1403
\n",
+ "
0.1758
\n",
+ "
1.2570
\n",
+ "
-0.5979
\n",
+ "
1.2250
\n",
+ "
-0.0553
\n",
+ "
0.7351
\n",
+ "
0.5810
\n",
+ "
0.9590
\n",
+ "
0.2427
\n",
+ "
0.0495
\n",
+ "
0.4141
\n",
+ "
0.8432
\n",
+ "
0.6162
\n",
+ "
-0.7318
\n",
+ "
1.2120
\n",
+ "
0.6362
\n",
+ "
-0.4427
\n",
+ "
0.1288
\n",
+ "
1.4840
\n",
+ "
0.1799
\n",
+ "
0.5367
\n",
+ "
-0.1111
\n",
+ "
-1.0120
\n",
+ "
0.6685
\n",
+ "
0.2862
\n",
+ "
0.2584
\n",
+ "
0.8076
\n",
+ "
0.5523
\n",
+ "
-0.1912
\n",
+ "
0.6584
\n",
+ "
-0.3981
\n",
+ "
0.2139
\n",
+ "
0.3801
\n",
+ "
0.4176
\n",
+ "
\n",
+ "
\n",
+ "
1
\n",
+ "
id_000779bfc
\n",
+ "
com_droga
\n",
+ "
72
\n",
+ "
D1
\n",
+ "
df89a8e5a
\n",
+ "
0.0743
\n",
+ "
0.4087
\n",
+ "
0.2991
\n",
+ "
0.0604
\n",
+ "
1.0190
\n",
+ "
0.5207
\n",
+ "
0.2341
\n",
+ "
0.3372
\n",
+ "
-0.4047
\n",
+ "
0.8507
\n",
+ "
-1.1520
\n",
+ "
-0.4201
\n",
+ "
-0.0958
\n",
+ "
0.4590
\n",
+ "
0.0803
\n",
+ "
0.2250
\n",
+ "
0.5293
\n",
+ "
0.2839
\n",
+ "
-0.3494
\n",
+ "
0.2883
\n",
+ "
0.9449
\n",
+ "
-0.1646
\n",
+ "
-0.2657
\n",
+ "
-0.3372
\n",
+ "
0.3135
\n",
+ "
-0.4316
\n",
+ "
0.4773
\n",
+ "
0.2075
\n",
+ "
-0.4216
\n",
+ "
-0.1161
\n",
+ "
-0.0499
\n",
+ "
-0.2627
\n",
+ "
0.9959
\n",
+ "
-0.2483
\n",
+ "
0.2655
\n",
+ "
...
\n",
+ "
0.4083
\n",
+ "
0.0319
\n",
+ "
0.3905
\n",
+ "
0.7099
\n",
+ "
0.2912
\n",
+ "
0.4151
\n",
+ "
-0.2840
\n",
+ "
-0.3104
\n",
+ "
-0.6373
\n",
+ "
0.2887
\n",
+ "
-0.0765
\n",
+ "
0.2539
\n",
+ "
0.4443
\n",
+ "
0.5932
\n",
+ "
0.2031
\n",
+ "
0.7639
\n",
+ "
0.5499
\n",
+ "
-0.3322
\n",
+ "
-0.0977
\n",
+ "
0.4329
\n",
+ "
-0.2782
\n",
+ "
0.7827
\n",
+ "
0.5934
\n",
+ "
0.3402
\n",
+ "
0.1499
\n",
+ "
0.4420
\n",
+ "
0.9366
\n",
+ "
0.8193
\n",
+ "
-0.4236
\n",
+ "
0.3192
\n",
+ "
-0.4265
\n",
+ "
0.7543
\n",
+ "
0.4708
\n",
+ "
0.0230
\n",
+ "
0.2957
\n",
+ "
0.4899
\n",
+ "
0.1522
\n",
+ "
0.1241
\n",
+ "
0.6077
\n",
+ "
0.7371
\n",
+ "
\n",
+ "
\n",
+ "
2
\n",
+ "
id_000a6266a
\n",
+ "
com_droga
\n",
+ "
48
\n",
+ "
D1
\n",
+ "
18bb41b2c
\n",
+ "
0.6280
\n",
+ "
0.5817
\n",
+ "
1.5540
\n",
+ "
-0.0764
\n",
+ "
-0.0323
\n",
+ "
1.2390
\n",
+ "
0.1715
\n",
+ "
0.2155
\n",
+ "
0.0065
\n",
+ "
1.2300
\n",
+ "
-0.4797
\n",
+ "
-0.5631
\n",
+ "
-0.0366
\n",
+ "
-1.8300
\n",
+ "
0.6057
\n",
+ "
-0.3278
\n",
+ "
0.6042
\n",
+ "
-0.3075
\n",
+ "
-0.1147
\n",
+ "
-0.0570
\n",
+ "
-0.0799
\n",
+ "
-0.8181
\n",
+ "
-1.5320
\n",
+ "
0.2307
\n",
+ "
0.4901
\n",
+ "
0.4780
\n",
+ "
-1.3970
\n",
+ "
4.6240
\n",
+ "
-0.0437
\n",
+ "
1.2870
\n",
+ "
-1.8530
\n",
+ "
0.6069
\n",
+ "
0.4290
\n",
+ "
0.1783
\n",
+ "
0.0018
\n",
+ "
...
\n",
+ "
-0.5477
\n",
+ "
-0.7576
\n",
+ "
-0.0444
\n",
+ "
0.1894
\n",
+ "
-0.0014
\n",
+ "
-2.3640
\n",
+ "
-0.4682
\n",
+ "
0.1210
\n",
+ "
-0.5177
\n",
+ "
-0.0604
\n",
+ "
0.1682
\n",
+ "
-0.4436
\n",
+ "
0.4963
\n",
+ "
0.1363
\n",
+ "
0.3335
\n",
+ "
0.9760
\n",
+ "
-0.0427
\n",
+ "
-0.1235
\n",
+ "
0.0959
\n",
+ "
0.0690
\n",
+ "
-0.9416
\n",
+ "
-0.7548
\n",
+ "
-0.1109
\n",
+ "
-0.6272
\n",
+ "
0.3019
\n",
+ "
0.1172
\n",
+ "
0.1093
\n",
+ "
-0.3113
\n",
+ "
0.3019
\n",
+ "
-0.0873
\n",
+ "
-0.7250
\n",
+ "
-0.6297
\n",
+ "
0.6103
\n",
+ "
0.0223
\n",
+ "
-1.3240
\n",
+ "
-0.3174
\n",
+ "
-0.6417
\n",
+ "
-0.2187
\n",
+ "
-1.4080
\n",
+ "
0.6931
\n",
+ "
\n",
+ "
\n",
+ "
3
\n",
+ "
id_0015fd391
\n",
+ "
com_droga
\n",
+ "
48
\n",
+ "
D1
\n",
+ "
8c7f86626
\n",
+ "
-0.5138
\n",
+ "
-0.2491
\n",
+ "
-0.2656
\n",
+ "
0.5288
\n",
+ "
4.0620
\n",
+ "
-0.8095
\n",
+ "
-1.9590
\n",
+ "
0.1792
\n",
+ "
-0.1321
\n",
+ "
-1.0600
\n",
+ "
-0.8269
\n",
+ "
-0.3584
\n",
+ "
-0.8511
\n",
+ "
-0.5844
\n",
+ "
-2.5690
\n",
+ "
0.8183
\n",
+ "
-0.0532
\n",
+ "
-0.8554
\n",
+ "
0.1160
\n",
+ "
-2.3520
\n",
+ "
2.1200
\n",
+ "
-1.1580
\n",
+ "
-0.7191
\n",
+ "
-0.8004
\n",
+ "
-1.4670
\n",
+ "
-0.0107
\n",
+ "
-0.8995
\n",
+ "
0.2406
\n",
+ "
-0.2479
\n",
+ "
-1.0890
\n",
+ "
-0.7575
\n",
+ "
0.0881
\n",
+ "
-2.7370
\n",
+ "
0.8745
\n",
+ "
0.5787
\n",
+ "
...
\n",
+ "
-2.1220
\n",
+ "
-0.3752
\n",
+ "
-2.3820
\n",
+ "
-3.7350
\n",
+ "
-2.9740
\n",
+ "
-1.4930
\n",
+ "
-1.6600
\n",
+ "
-3.1660
\n",
+ "
0.2816
\n",
+ "
-0.2990
\n",
+ "
-1.1870
\n",
+ "
-0.5044
\n",
+ "
-1.7750
\n",
+ "
-1.6120
\n",
+ "
-0.9215
\n",
+ "
-1.0810
\n",
+ "
-3.0520
\n",
+ "
-3.4470
\n",
+ "
-2.7740
\n",
+ "
-1.8460
\n",
+ "
-0.5568
\n",
+ "
-3.3960
\n",
+ "
-2.9510
\n",
+ "
-1.1550
\n",
+ "
-3.2620
\n",
+ "
-1.5390
\n",
+ "
-2.4600
\n",
+ "
-0.9417
\n",
+ "
-1.5550
\n",
+ "
0.2431
\n",
+ "
-2.0990
\n",
+ "
-0.6441
\n",
+ "
-5.6300
\n",
+ "
-1.3780
\n",
+ "
-0.8632
\n",
+ "
-1.2880
\n",
+ "
-1.6210
\n",
+ "
-0.8784
\n",
+ "
-0.3876
\n",
+ "
-0.8154
\n",
+ "
\n",
+ "
\n",
+ "
4
\n",
+ "
id_001626bd3
\n",
+ "
com_droga
\n",
+ "
72
\n",
+ "
D2
\n",
+ "
7cbed3131
\n",
+ "
-0.3254
\n",
+ "
-0.4009
\n",
+ "
0.9700
\n",
+ "
0.6919
\n",
+ "
1.4180
\n",
+ "
-0.8244
\n",
+ "
-0.2800
\n",
+ "
-0.1498
\n",
+ "
-0.8789
\n",
+ "
0.8630
\n",
+ "
-0.2219
\n",
+ "
-0.5121
\n",
+ "
-0.9577
\n",
+ "
1.1750
\n",
+ "
0.2042
\n",
+ "
0.1970
\n",
+ "
0.1244
\n",
+ "
-1.7090
\n",
+ "
-0.3543
\n",
+ "
-0.5160
\n",
+ "
-0.3330
\n",
+ "
-0.2685
\n",
+ "
0.7649
\n",
+ "
0.2057
\n",
+ "
1.3720
\n",
+ "
0.6835
\n",
+ "
0.8056
\n",
+ "
-0.3754
\n",
+ "
-1.2090
\n",
+ "
0.2965
\n",
+ "
-0.0712
\n",
+ "
0.6389
\n",
+ "
0.6674
\n",
+ "
-0.0783
\n",
+ "
1.1740
\n",
+ "
...
\n",
+ "
-0.2274
\n",
+ "
0.3215
\n",
+ "
0.1535
\n",
+ "
-0.4640
\n",
+ "
-0.5943
\n",
+ "
0.3973
\n",
+ "
0.1500
\n",
+ "
0.5178
\n",
+ "
0.5159
\n",
+ "
0.6091
\n",
+ "
0.1813
\n",
+ "
-0.4249
\n",
+ "
0.7832
\n",
+ "
0.6529
\n",
+ "
0.5648
\n",
+ "
0.4817
\n",
+ "
0.0587
\n",
+ "
0.5303
\n",
+ "
0.6376
\n",
+ "
-0.3966
\n",
+ "
-1.4950
\n",
+ "
-0.9625
\n",
+ "
-0.0541
\n",
+ "
0.6273
\n",
+ "
0.4563
\n",
+ "
0.0698
\n",
+ "
0.8134
\n",
+ "
0.1924
\n",
+ "
0.6054
\n",
+ "
-0.1824
\n",
+ "
0.0042
\n",
+ "
0.0048
\n",
+ "
0.6670
\n",
+ "
1.0690
\n",
+ "
0.5523
\n",
+ "
-0.3031
\n",
+ "
0.1094
\n",
+ "
0.2885
\n",
+ "
-0.3786
\n",
+ "
0.7125
\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
5 rows × 877 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ " id tratamento tempo dose ... c-96 c-97 c-98 c-99\n",
+ "0 id_000644bb2 com_droga 24 D1 ... -0.3981 0.2139 0.3801 0.4176\n",
+ "1 id_000779bfc com_droga 72 D1 ... 0.1522 0.1241 0.6077 0.7371\n",
+ "2 id_000a6266a com_droga 48 D1 ... -0.6417 -0.2187 -1.4080 0.6931\n",
+ "3 id_0015fd391 com_droga 48 D1 ... -1.6210 -0.8784 -0.3876 -0.8154\n",
+ "4 id_001626bd3 com_droga 72 D2 ... 0.1094 0.2885 -0.3786 0.7125\n",
+ "\n",
+ "[5 rows x 877 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 4
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "i7mgnrykiMNb"
+ },
+ "source": [
+ "Queremos melhorar a visualização do nosso histograma de compostos e, como existem mais de 3.000 variações na nossa base de dados, decidimos elencar os 5 compostos que mais aparecem.\n",
+ "Para isso, vamos usar a função ```value_counts``` (função presente na biblioteca Pandas e que conta a ocorrência dos diferentes valores) e, como queremos saber somente os 5 elementos mais frequentes, também declaramos o ```index[0:5]```. Essa parte final, faz com que o ```value_counts```se atenha à contagem dos maiores valores e apresente como resultado apenas o index do intervalo [0, 5[, ou seja, o nome dos 5 maiores valores. "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "id": "D5hK5mcVej_G"
+ },
+ "source": [
+ "cod_compostos = dados['composto'].value_counts().index[0:5]"
+ ],
+ "execution_count": 5,
+ "outputs": []
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "hrOGPnbP4cE_"
+ },
+ "source": [
+ "Na célula acima, declaramos a variável ```cod_compostos``` e definimos a função que está atrelada à ela. \n",
+ "E agora, executamos a nossa nova variável para verificar o resultado."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/"
+ },
+ "id": "LgrRw3eSfLYu",
+ "outputId": "73d9d558-2ab6-4f58-8700-89b663a708be"
+ },
+ "source": [
+ "cod_compostos"
+ ],
+ "execution_count": 6,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ "Index(['cacb2b860', '87d714366', '9f80f3f77', '8b87a7a83', '5628cb3ee'], dtype='object')"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 6
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "1JANbNGx4wQO"
+ },
+ "source": [
+ "Exitem algumas maneiras de filtrar uma base de dados e optamos em usar a função ```query``` do Pandas e, somente a título de curiosidade, essa função é bastante análoga ao SQL (linguagem de programação para bancos de dados).\n",
+ "A estrutura dela é bastante simplificada, precisamos apenas definir o dataframe, chamar a função e passar como parâmetro a condição que deve ser \n",
+ "filtrada no nosso conjunto de dados.\n",
+ "\n",
+ "Nesta parte do projeto, queremos realizar um filtro em nossos dados, selecionando apenas as linhas nas quais o composto esteja dentro da nossa lista ```cod_composto``` (lista que representa os 5 compostos mais testados no experimento) e vamos utilizar o método ```query``` para resolver este problema. \n",
+ "\n",
+ "Como parâmetro da função, passamos uma string contendo a lógica para realização da seleção dos dados. O que queremos é o seguinte: o ```query```precisa retornar para nós todas as linhas contendo os 5 compostos mais utilizados. Logo, a string necessária para isso é: ```composto in @cod_compostos```. \n",
+ "\n",
+ "Usamos ```composto``` porque essa é a coluna a ser verificada no dataframe e ```cod_compostos``` por ser a lista com os top 5 compostos, o detalhe aqui é que o ```@``` é necessário para informar o ```query``` que ```cod_composto``` é uma variável que já foi definida fora da função."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 419
+ },
+ "id": "_GlSwTMGfpFs",
+ "outputId": "9c502a32-7341-4969-fc7e-4ac31600ac06"
+ },
+ "source": [
+ "dados.query('composto in @cod_compostos')"
+ ],
+ "execution_count": 7,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/html": [
+ "
\n",
+ "\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
\n",
+ "
id
\n",
+ "
tratamento
\n",
+ "
tempo
\n",
+ "
dose
\n",
+ "
composto
\n",
+ "
g-0
\n",
+ "
g-1
\n",
+ "
g-2
\n",
+ "
g-3
\n",
+ "
g-4
\n",
+ "
g-5
\n",
+ "
g-6
\n",
+ "
g-7
\n",
+ "
g-8
\n",
+ "
g-9
\n",
+ "
g-10
\n",
+ "
g-11
\n",
+ "
g-12
\n",
+ "
g-13
\n",
+ "
g-14
\n",
+ "
g-15
\n",
+ "
g-16
\n",
+ "
g-17
\n",
+ "
g-18
\n",
+ "
g-19
\n",
+ "
g-20
\n",
+ "
g-21
\n",
+ "
g-22
\n",
+ "
g-23
\n",
+ "
g-24
\n",
+ "
g-25
\n",
+ "
g-26
\n",
+ "
g-27
\n",
+ "
g-28
\n",
+ "
g-29
\n",
+ "
g-30
\n",
+ "
g-31
\n",
+ "
g-32
\n",
+ "
g-33
\n",
+ "
g-34
\n",
+ "
...
\n",
+ "
c-60
\n",
+ "
c-61
\n",
+ "
c-62
\n",
+ "
c-63
\n",
+ "
c-64
\n",
+ "
c-65
\n",
+ "
c-66
\n",
+ "
c-67
\n",
+ "
c-68
\n",
+ "
c-69
\n",
+ "
c-70
\n",
+ "
c-71
\n",
+ "
c-72
\n",
+ "
c-73
\n",
+ "
c-74
\n",
+ "
c-75
\n",
+ "
c-76
\n",
+ "
c-77
\n",
+ "
c-78
\n",
+ "
c-79
\n",
+ "
c-80
\n",
+ "
c-81
\n",
+ "
c-82
\n",
+ "
c-83
\n",
+ "
c-84
\n",
+ "
c-85
\n",
+ "
c-86
\n",
+ "
c-87
\n",
+ "
c-88
\n",
+ "
c-89
\n",
+ "
c-90
\n",
+ "
c-91
\n",
+ "
c-92
\n",
+ "
c-93
\n",
+ "
c-94
\n",
+ "
c-95
\n",
+ "
c-96
\n",
+ "
c-97
\n",
+ "
c-98
\n",
+ "
c-99
\n",
+ "
\n",
+ " \n",
+ " \n",
+ "
\n",
+ "
7
\n",
+ "
id_0020d0484
\n",
+ "
com_droga
\n",
+ "
48
\n",
+ "
D1
\n",
+ "
8b87a7a83
\n",
+ "
0.2711
\n",
+ "
0.5133
\n",
+ "
-0.1327
\n",
+ "
2.5950
\n",
+ "
0.6980
\n",
+ "
0.5846
\n",
+ "
-0.2633
\n",
+ "
-2.1490
\n",
+ "
0.4881
\n",
+ "
1.4750
\n",
+ "
-0.0488
\n",
+ "
-0.0561
\n",
+ "
0.1641
\n",
+ "
0.1749
\n",
+ "
-0.3464
\n",
+ "
-0.1811
\n",
+ "
0.5685
\n",
+ "
1.2970
\n",
+ "
1.0510
\n",
+ "
-0.9585
\n",
+ "
0.7911
\n",
+ "
-0.2991
\n",
+ "
2.7420
\n",
+ "
-0.5979
\n",
+ "
-1.0240
\n",
+ "
0.7268
\n",
+ "
0.1590
\n",
+ "
-1.9730
\n",
+ "
1.1340
\n",
+ "
-2.5720
\n",
+ "
0.2068
\n",
+ "
0.9992
\n",
+ "
-0.1958
\n",
+ "
0.2262
\n",
+ "
1.0140
\n",
+ "
...
\n",
+ "
-0.2623
\n",
+ "
-0.5500
\n",
+ "
-0.0530
\n",
+ "
0.2077
\n",
+ "
-0.0825
\n",
+ "
0.2156
\n",
+ "
0.8142
\n",
+ "
1.0990
\n",
+ "
1.0860
\n",
+ "
0.4011
\n",
+ "
-0.1342
\n",
+ "
0.4653
\n",
+ "
-0.1104
\n",
+ "
0.3935
\n",
+ "
1.2370
\n",
+ "
-0.5072
\n",
+ "
0.0351
\n",
+ "
0.5084
\n",
+ "
-0.1248
\n",
+ "
0.4351
\n",
+ "
0.1389
\n",
+ "
0.6726
\n",
+ "
0.6070
\n",
+ "
0.3431
\n",
+ "
-0.4116
\n",
+ "
0.1194
\n",
+ "
0.3420
\n",
+ "
0.1331
\n",
+ "
0.8541
\n",
+ "
0.6282
\n",
+ "
0.3230
\n",
+ "
-0.4140
\n",
+ "
0.2532
\n",
+ "
0.0513
\n",
+ "
0.8600
\n",
+ "
1.4250
\n",
+ "
0.6633
\n",
+ "
0.4562
\n",
+ "
-0.9622
\n",
+ "
0.0260
\n",
+ "
\n",
+ "
\n",
+ "
16
\n",
+ "
id_002fb9c19
\n",
+ "
com_droga
\n",
+ "
48
\n",
+ "
D1
\n",
+ "
87d714366
\n",
+ "
8.7380
\n",
+ "
0.1914
\n",
+ "
2.4380
\n",
+ "
-0.2930
\n",
+ "
3.2010
\n",
+ "
1.4910
\n",
+ "
5.3920
\n",
+ "
-1.0420
\n",
+ "
-8.4860
\n",
+ "
-2.5790
\n",
+ "
-6.7470
\n",
+ "
3.6150
\n",
+ "
-1.4510
\n",
+ "
0.6909
\n",
+ "
-0.7623
\n",
+ "
-2.9400
\n",
+ "
-9.2850
\n",
+ "
-6.7110
\n",
+ "
-0.4195
\n",
+ "
1.1600
\n",
+ "
-1.3180
\n",
+ "
1.3660
\n",
+ "
-1.4020
\n",
+ "
-2.6220
\n",
+ "
0.7204
\n",
+ "
-2.5930
\n",
+ "
-4.0630
\n",
+ "
-0.8282
\n",
+ "
-3.5000
\n",
+ "
0.9206
\n",
+ "
-5.1450
\n",
+ "
9.8040
\n",
+ "
-2.6840
\n",
+ "
-2.4390
\n",
+ "
-0.1427
\n",
+ "
...
\n",
+ "
-10.0000
\n",
+ "
-9.5910
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-8.5110
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-8.8870
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-10.0000
\n",
+ "
-4.9020
\n",
+ "
-9.9840
\n",
+ "
-5.5520
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.7960
\n",
+ "
-8.1770
\n",
+ "
-9.8990
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-9.9840
\n",
+ "
-6.7840
\n",
+ "
\n",
+ "
\n",
+ "
25
\n",
+ "
id_0054388ec
\n",
+ "
com_controle
\n",
+ "
48
\n",
+ "
D1
\n",
+ "
cacb2b860
\n",
+ "
-0.6696
\n",
+ "
-0.2718
\n",
+ "
-1.2230
\n",
+ "
-0.6226
\n",
+ "
-0.7220
\n",
+ "
0.1588
\n",
+ "
0.7785
\n",
+ "
0.7062
\n",
+ "
-0.7951
\n",
+ "
1.3770
\n",
+ "
0.2856
\n",
+ "
-0.8331
\n",
+ "
0.3879
\n",
+ "
1.3290
\n",
+ "
-0.4746
\n",
+ "
0.0857
\n",
+ "
-0.2745
\n",
+ "
-0.1093
\n",
+ "
0.2603
\n",
+ "
0.7386
\n",
+ "
-0.6665
\n",
+ "
-1.6520
\n",
+ "
-0.9262
\n",
+ "
0.0000
\n",
+ "
1.6310
\n",
+ "
-0.2933
\n",
+ "
0.2043
\n",
+ "
-2.2070
\n",
+ "
0.1383
\n",
+ "
-0.4966
\n",
+ "
0.6922
\n",
+ "
-0.4077
\n",
+ "
0.2303
\n",
+ "
-0.0456
\n",
+ "
-0.7447
\n",
+ "
...
\n",
+ "
0.6976
\n",
+ "
-0.0178
\n",
+ "
0.3768
\n",
+ "
0.4318
\n",
+ "
-0.1051
\n",
+ "
0.3933
\n",
+ "
-0.3189
\n",
+ "
0.4128
\n",
+ "
-0.2116
\n",
+ "
-1.0410
\n",
+ "
0.7798
\n",
+ "
0.1579
\n",
+ "
1.7250
\n",
+ "
1.4490
\n",
+ "
0.9117
\n",
+ "
-0.0754
\n",
+ "
-0.8396
\n",
+ "
0.7640
\n",
+ "
1.2080
\n",
+ "
1.1890
\n",
+ "
0.2114
\n",
+ "
0.6335
\n",
+ "
0.3265
\n",
+ "
-0.1056
\n",
+ "
0.5478
\n",
+ "
0.7477
\n",
+ "
1.2340
\n",
+ "
0.5035
\n",
+ "
-0.6591
\n",
+ "
0.3023
\n",
+ "
0.6171
\n",
+ "
0.6405
\n",
+ "
0.5429
\n",
+ "
0.3562
\n",
+ "
1.3290
\n",
+ "
0.5573
\n",
+ "
0.8837
\n",
+ "
0.5534
\n",
+ "
0.8976
\n",
+ "
1.0050
\n",
+ "
\n",
+ "
\n",
+ "
38
\n",
+ "
id_0079af0fb
\n",
+ "
com_controle
\n",
+ "
24
\n",
+ "
D1
\n",
+ "
cacb2b860
\n",
+ "
-0.1636
\n",
+ "
-1.8230
\n",
+ "
-0.5211
\n",
+ "
0.3054
\n",
+ "
-1.1280
\n",
+ "
0.6041
\n",
+ "
0.3856
\n",
+ "
-0.1670
\n",
+ "
-0.5400
\n",
+ "
0.2721
\n",
+ "
-0.2479
\n",
+ "
0.0761
\n",
+ "
-0.4054
\n",
+ "
2.9760
\n",
+ "
-0.7679
\n",
+ "
0.6995
\n",
+ "
0.0752
\n",
+ "
0.5943
\n",
+ "
0.4266
\n",
+ "
0.0000
\n",
+ "
1.0410
\n",
+ "
-0.1131
\n",
+ "
0.6593
\n",
+ "
-0.6890
\n",
+ "
0.5115
\n",
+ "
-0.2512
\n",
+ "
-0.8775
\n",
+ "
-1.3810
\n",
+ "
0.1419
\n",
+ "
-0.0363
\n",
+ "
0.2559
\n",
+ "
2.5290
\n",
+ "
0.2853
\n",
+ "
1.4690
\n",
+ "
-0.3626
\n",
+ "
...
\n",
+ "
-0.3420
\n",
+ "
0.2815
\n",
+ "
-0.7493
\n",
+ "
-0.2701
\n",
+ "
0.1205
\n",
+ "
0.2396
\n",
+ "
0.3506
\n",
+ "
-0.9233
\n",
+ "
0.2490
\n",
+ "
-0.0824
\n",
+ "
0.4841
\n",
+ "
1.1940
\n",
+ "
-1.1650
\n",
+ "
-0.4853
\n",
+ "
1.1080
\n",
+ "
-0.5979
\n",
+ "
0.7438
\n",
+ "
0.2050
\n",
+ "
-0.2761
\n",
+ "
-0.1748
\n",
+ "
0.3385
\n",
+ "
-0.3500
\n",
+ "
-0.0309
\n",
+ "
-0.0409
\n",
+ "
-0.0735
\n",
+ "
0.2732
\n",
+ "
-1.2100
\n",
+ "
0.0089
\n",
+ "
0.0317
\n",
+ "
0.4721
\n",
+ "
-0.3880
\n",
+ "
-0.6828
\n",
+ "
-0.6594
\n",
+ "
-0.2043
\n",
+ "
0.3571
\n",
+ "
-0.1319
\n",
+ "
0.2187
\n",
+ "
0.0737
\n",
+ "
0.6498
\n",
+ "
-1.4820
\n",
+ "
\n",
+ "
\n",
+ "
40
\n",
+ "
id_007bfbb91
\n",
+ "
com_controle
\n",
+ "
24
\n",
+ "
D2
\n",
+ "
cacb2b860
\n",
+ "
-1.3200
\n",
+ "
-1.7340
\n",
+ "
-0.0741
\n",
+ "
1.5440
\n",
+ "
-1.8290
\n",
+ "
-0.0951
\n",
+ "
0.3829
\n",
+ "
-1.8700
\n",
+ "
-0.0324
\n",
+ "
2.7670
\n",
+ "
-0.0438
\n",
+ "
-0.0147
\n",
+ "
0.1276
\n",
+ "
0.2968
\n",
+ "
-0.0704
\n",
+ "
-0.3767
\n",
+ "
2.5790
\n",
+ "
0.4069
\n",
+ "
-0.3868
\n",
+ "
-0.2782
\n",
+ "
1.8180
\n",
+ "
-2.9390
\n",
+ "
-0.1150
\n",
+ "
-0.8009
\n",
+ "
-1.4440
\n",
+ "
0.5412
\n",
+ "
-0.0804
\n",
+ "
-0.4444
\n",
+ "
2.9230
\n",
+ "
2.8270
\n",
+ "
-0.3437
\n",
+ "
1.6570
\n",
+ "
-1.9200
\n",
+ "
0.3442
\n",
+ "
1.7160
\n",
+ "
...
\n",
+ "
0.4009
\n",
+ "
0.7649
\n",
+ "
0.2789
\n",
+ "
0.4465
\n",
+ "
0.9492
\n",
+ "
0.6385
\n",
+ "
1.5510
\n",
+ "
-0.1220
\n",
+ "
0.6688
\n",
+ "
0.4396
\n",
+ "
0.6947
\n",
+ "
0.9761
\n",
+ "
1.2430
\n",
+ "
1.4770
\n",
+ "
-0.0081
\n",
+ "
1.0040
\n",
+ "
1.4420
\n",
+ "
-0.3482
\n",
+ "
0.3297
\n",
+ "
1.2070
\n",
+ "
0.0524
\n",
+ "
0.5346
\n",
+ "
0.7643
\n",
+ "
0.4067
\n",
+ "
0.3817
\n",
+ "
0.4566
\n",
+ "
0.7736
\n",
+ "
0.5316
\n",
+ "
0.2790
\n",
+ "
0.9735
\n",
+ "
0.7412
\n",
+ "
1.0340
\n",
+ "
0.7393
\n",
+ "
1.1020
\n",
+ "
0.3786
\n",
+ "
0.2636
\n",
+ "
-0.5751
\n",
+ "
0.3362
\n",
+ "
0.8543
\n",
+ "
0.9180
\n",
+ "
\n",
+ "
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
\n",
+ "
\n",
+ "
23793
\n",
+ "
id_ffd26f361
\n",
+ "
com_controle
\n",
+ "
48
\n",
+ "
D2
\n",
+ "
cacb2b860
\n",
+ "
0.6008
\n",
+ "
0.2781
\n",
+ "
-0.3319
\n",
+ "
-0.8782
\n",
+ "
0.9281
\n",
+ "
0.7535
\n",
+ "
0.5971
\n",
+ "
1.6350
\n",
+ "
-0.3146
\n",
+ "
0.4984
\n",
+ "
-1.1890
\n",
+ "
0.3772
\n",
+ "
-0.8196
\n",
+ "
0.1083
\n",
+ "
-2.0350
\n",
+ "
0.4135
\n",
+ "
0.7586
\n",
+ "
-0.4024
\n",
+ "
-0.0449
\n",
+ "
0.3335
\n",
+ "
0.2358
\n",
+ "
0.7919
\n",
+ "
-0.8775
\n",
+ "
0.4232
\n",
+ "
1.8460
\n",
+ "
-2.0940
\n",
+ "
0.1438
\n",
+ "
-0.6604
\n",
+ "
0.7243
\n",
+ "
0.0338
\n",
+ "
-0.0182
\n",
+ "
-0.2022
\n",
+ "
0.8827
\n",
+ "
0.3147
\n",
+ "
-0.9263
\n",
+ "
...
\n",
+ "
0.1934
\n",
+ "
1.2240
\n",
+ "
-0.1178
\n",
+ "
0.1459
\n",
+ "
-0.1242
\n",
+ "
0.3106
\n",
+ "
-0.7959
\n",
+ "
0.5681
\n",
+ "
0.3663
\n",
+ "
0.4874
\n",
+ "
-0.0548
\n",
+ "
0.1810
\n",
+ "
1.3470
\n",
+ "
1.6210
\n",
+ "
-0.8260
\n",
+ "
0.3377
\n",
+ "
0.3264
\n",
+ "
0.9241
\n",
+ "
0.3012
\n",
+ "
1.2980
\n",
+ "
-0.2789
\n",
+ "
0.7019
\n",
+ "
-0.3528
\n",
+ "
0.7383
\n",
+ "
1.0300
\n",
+ "
0.6527
\n",
+ "
1.6750
\n",
+ "
-0.5046
\n",
+ "
0.2942
\n",
+ "
0.2950
\n",
+ "
1.2470
\n",
+ "
0.9569
\n",
+ "
-0.2065
\n",
+ "
-0.4918
\n",
+ "
0.7863
\n",
+ "
0.0504
\n",
+ "
0.8813
\n",
+ "
0.7757
\n",
+ "
-0.5272
\n",
+ "
0.7082
\n",
+ "
\n",
+ "
\n",
+ "
23802
\n",
+ "
id_fff3976bd
\n",
+ "
com_droga
\n",
+ "
24
\n",
+ "
D1
\n",
+ "
87d714366
\n",
+ "
3.2890
\n",
+ "
2.1270
\n",
+ "
0.9770
\n",
+ "
2.0490
\n",
+ "
2.3170
\n",
+ "
-0.5355
\n",
+ "
-0.6357
\n",
+ "
-6.9060
\n",
+ "
-4.4830
\n",
+ "
-2.8400
\n",
+ "
-5.3310
\n",
+ "
0.6547
\n",
+ "
-2.2080
\n",
+ "
3.6030
\n",
+ "
0.6380
\n",
+ "
-2.1280
\n",
+ "
-3.1680
\n",
+ "
-4.2180
\n",
+ "
-0.3170
\n",
+ "
0.2140
\n",
+ "
-1.7660
\n",
+ "
-1.9700
\n",
+ "
-1.4820
\n",
+ "
-1.3710
\n",
+ "
-0.2628
\n",
+ "
-0.5276
\n",
+ "
-2.4160
\n",
+ "
-0.4638
\n",
+ "
-3.8630
\n",
+ "
-0.8880
\n",
+ "
-1.9850
\n",
+ "
2.0530
\n",
+ "
-1.4700
\n",
+ "
-3.3220
\n",
+ "
1.8800
\n",
+ "
...
\n",
+ "
-9.9170
\n",
+ "
-4.0200
\n",
+ "
-6.7560
\n",
+ "
-9.9170
\n",
+ "
-4.1240
\n",
+ "
-8.5110
\n",
+ "
-5.7500
\n",
+ "
-8.4760
\n",
+ "
-6.9470
\n",
+ "
-4.7650
\n",
+ "
-9.7690
\n",
+ "
-5.7900
\n",
+ "
-9.9170
\n",
+ "
-5.7450
\n",
+ "
-2.3570
\n",
+ "
-9.1190
\n",
+ "
-4.5980
\n",
+ "
-8.7760
\n",
+ "
-7.7980
\n",
+ "
-4.1620
\n",
+ "
-6.1510
\n",
+ "
-5.7400
\n",
+ "
-6.8700
\n",
+ "
-9.1000
\n",
+ "
-3.3360
\n",
+ "
-6.6420
\n",
+ "
-2.2750
\n",
+ "
-3.4720
\n",
+ "
-6.2670
\n",
+ "
-6.3980
\n",
+ "
-8.6300
\n",
+ "
-8.1490
\n",
+ "
-6.2900
\n",
+ "
-8.0970
\n",
+ "
-8.7330
\n",
+ "
-2.3780
\n",
+ "
-8.4480
\n",
+ "
-4.4550
\n",
+ "
-5.7930
\n",
+ "
-3.7000
\n",
+ "
\n",
+ "
\n",
+ "
23805
\n",
+ "
id_fff6df1c5
\n",
+ "
com_droga
\n",
+ "
48
\n",
+ "
D2
\n",
+ "
5628cb3ee
\n",
+ "
1.7380
\n",
+ "
-1.2900
\n",
+ "
-0.4533
\n",
+ "
-1.1640
\n",
+ "
-0.4841
\n",
+ "
-0.2392
\n",
+ "
-0.1774
\n",
+ "
-1.3980
\n",
+ "
-0.4059
\n",
+ "
0.1553
\n",
+ "
0.1083
\n",
+ "
-0.8020
\n",
+ "
-0.5039
\n",
+ "
-0.7167
\n",
+ "
-0.5016
\n",
+ "
-0.9369
\n",
+ "
-0.5864
\n",
+ "
-0.1009
\n",
+ "
0.8987
\n",
+ "
0.3389
\n",
+ "
-1.3460
\n",
+ "
-0.5087
\n",
+ "
-0.0892
\n",
+ "
-0.6559
\n",
+ "
0.0864
\n",
+ "
-0.6251
\n",
+ "
0.2471
\n",
+ "
-0.4523
\n",
+ "
-0.0389
\n",
+ "
1.0340
\n",
+ "
0.0796
\n",
+ "
-0.2560
\n",
+ "
-0.7851
\n",
+ "
-0.1687
\n",
+ "
-1.2890
\n",
+ "
...
\n",
+ "
0.2866
\n",
+ "
0.3854
\n",
+ "
-0.4986
\n",
+ "
0.2883
\n",
+ "
0.5843
\n",
+ "
0.0907
\n",
+ "
-0.9303
\n",
+ "
0.1981
\n",
+ "
0.6515
\n",
+ "
0.2877
\n",
+ "
0.4389
\n",
+ "
-0.3859
\n",
+ "
0.5203
\n",
+ "
-1.1070
\n",
+ "
2.3870
\n",
+ "
0.1820
\n",
+ "
-0.6434
\n",
+ "
-0.9722
\n",
+ "
-0.4200
\n",
+ "
-0.5693
\n",
+ "
1.0700
\n",
+ "
1.1110
\n",
+ "
-0.5117
\n",
+ "
-1.2130
\n",
+ "
0.3769
\n",
+ "
0.5296
\n",
+ "
1.0400
\n",
+ "
0.0587
\n",
+ "
1.0150
\n",
+ "
-0.7669
\n",
+ "
0.2865
\n",
+ "
0.5974
\n",
+ "
-0.2727
\n",
+ "
0.1188
\n",
+ "
1.0770
\n",
+ "
-1.0250
\n",
+ "
0.1516
\n",
+ "
0.4503
\n",
+ "
-0.6312
\n",
+ "
-0.8302
\n",
+ "
\n",
+ "
\n",
+ "
23811
\n",
+ "
id_fffc1c3f4
\n",
+ "
com_controle
\n",
+ "
48
\n",
+ "
D2
\n",
+ "
cacb2b860
\n",
+ "
0.3942
\n",
+ "
0.3756
\n",
+ "
0.3109
\n",
+ "
-0.7389
\n",
+ "
0.5505
\n",
+ "
-0.0159
\n",
+ "
-0.2541
\n",
+ "
0.1745
\n",
+ "
-0.0340
\n",
+ "
0.4865
\n",
+ "
-0.1854
\n",
+ "
0.0716
\n",
+ "
0.1729
\n",
+ "
-0.0434
\n",
+ "
0.1542
\n",
+ "
-0.2192
\n",
+ "
-0.0302
\n",
+ "
-0.4218
\n",
+ "
0.4057
\n",
+ "
-0.5372
\n",
+ "
0.1521
\n",
+ "
-0.2651
\n",
+ "
0.2310
\n",
+ "
-0.8101
\n",
+ "
0.4943
\n",
+ "
0.6905
\n",
+ "
-0.3720
\n",
+ "
-1.4110
\n",
+ "
0.4516
\n",
+ "
1.2300
\n",
+ "
-0.1949
\n",
+ "
-1.3280
\n",
+ "
-0.4276
\n",
+ "
-0.0040
\n",
+ "
-0.3086
\n",
+ "
...
\n",
+ "
0.6845
\n",
+ "
0.7127
\n",
+ "
0.7294
\n",
+ "
0.4718
\n",
+ "
-0.2020
\n",
+ "
0.2783
\n",
+ "
0.4934
\n",
+ "
0.4144
\n",
+ "
0.5449
\n",
+ "
1.4690
\n",
+ "
-0.6142
\n",
+ "
0.6068
\n",
+ "
0.3434
\n",
+ "
0.9880
\n",
+ "
-0.0468
\n",
+ "
-0.1882
\n",
+ "
-0.0087
\n",
+ "
-0.0356
\n",
+ "
0.5718
\n",
+ "
0.4971
\n",
+ "
0.1057
\n",
+ "
0.2111
\n",
+ "
0.6992
\n",
+ "
0.0708
\n",
+ "
0.6169
\n",
+ "
0.2248
\n",
+ "
0.5994
\n",
+ "
0.2689
\n",
+ "
0.0305
\n",
+ "
1.2320
\n",
+ "
0.5409
\n",
+ "
0.3755
\n",
+ "
0.7343
\n",
+ "
0.2807
\n",
+ "
0.4116
\n",
+ "
0.6422
\n",
+ "
0.2256
\n",
+ "
0.7592
\n",
+ "
0.6656
\n",
+ "
0.3808
\n",
+ "
\n",
+ "
\n",
+ "
23812
\n",
+ "
id_fffcb9e7c
\n",
+ "
com_droga
\n",
+ "
24
\n",
+ "
D1
\n",
+ "
8b87a7a83
\n",
+ "
0.6660
\n",
+ "
0.2324
\n",
+ "
0.4392
\n",
+ "
0.2044
\n",
+ "
0.8531
\n",
+ "
-0.0343
\n",
+ "
0.0323
\n",
+ "
0.0463
\n",
+ "
0.4299
\n",
+ "
-0.7985
\n",
+ "
0.5742
\n",
+ "
0.1421
\n",
+ "
2.2700
\n",
+ "
0.2046
\n",
+ "
0.5363
\n",
+ "
-1.7330
\n",
+ "
0.1450
\n",
+ "
0.6097
\n",
+ "
0.2024
\n",
+ "
0.9865
\n",
+ "
-0.7805
\n",
+ "
0.9608
\n",
+ "
0.3440
\n",
+ "
2.7650
\n",
+ "
0.4925
\n",
+ "
0.6698
\n",
+ "
0.2374
\n",
+ "
-0.3372
\n",
+ "
0.8771
\n",
+ "
-2.6560
\n",
+ "
-0.2000
\n",
+ "
-0.2043
\n",
+ "
0.6797
\n",
+ "
-0.0248
\n",
+ "
-0.0927
\n",
+ "
...
\n",
+ "
0.3360
\n",
+ "
-0.6136
\n",
+ "
0.5011
\n",
+ "
0.9261
\n",
+ "
0.4419
\n",
+ "
0.0295
\n",
+ "
0.4220
\n",
+ "
0.4677
\n",
+ "
-0.1184
\n",
+ "
0.4524
\n",
+ "
0.4135
\n",
+ "
0.1356
\n",
+ "
-0.5801
\n",
+ "
0.0411
\n",
+ "
1.0240
\n",
+ "
1.0340
\n",
+ "
-0.0270
\n",
+ "
-0.4194
\n",
+ "
0.7403
\n",
+ "
-0.6793
\n",
+ "
-0.1423
\n",
+ "
0.7307
\n",
+ "
0.7946
\n",
+ "
-0.0650
\n",
+ "
0.9038
\n",
+ "
0.2324
\n",
+ "
0.9676
\n",
+ "
1.0940
\n",
+ "
0.4290
\n",
+ "
0.5187
\n",
+ "
-0.1105
\n",
+ "
0.4258
\n",
+ "
-0.2012
\n",
+ "
0.1506
\n",
+ "
1.5230
\n",
+ "
0.7101
\n",
+ "
0.1732
\n",
+ "
0.7015
\n",
+ "
-0.6290
\n",
+ "
0.0740
\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
3235 rows × 877 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ " id tratamento tempo dose ... c-96 c-97 c-98 c-99\n",
+ "7 id_0020d0484 com_droga 48 D1 ... 0.6633 0.4562 -0.9622 0.0260\n",
+ "16 id_002fb9c19 com_droga 48 D1 ... -9.9840 -9.9840 -9.9840 -6.7840\n",
+ "25 id_0054388ec com_controle 48 D1 ... 0.8837 0.5534 0.8976 1.0050\n",
+ "38 id_0079af0fb com_controle 24 D1 ... 0.2187 0.0737 0.6498 -1.4820\n",
+ "40 id_007bfbb91 com_controle 24 D2 ... -0.5751 0.3362 0.8543 0.9180\n",
+ "... ... ... ... ... ... ... ... ... ...\n",
+ "23793 id_ffd26f361 com_controle 48 D2 ... 0.8813 0.7757 -0.5272 0.7082\n",
+ "23802 id_fff3976bd com_droga 24 D1 ... -8.4480 -4.4550 -5.7930 -3.7000\n",
+ "23805 id_fff6df1c5 com_droga 48 D2 ... 0.1516 0.4503 -0.6312 -0.8302\n",
+ "23811 id_fffc1c3f4 com_controle 48 D2 ... 0.2256 0.7592 0.6656 0.3808\n",
+ "23812 id_fffcb9e7c com_droga 24 D1 ... 0.1732 0.7015 -0.6290 0.0740\n",
+ "\n",
+ "[3235 rows x 877 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 7
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "p9FQo1EuFF8M"
+ },
+ "source": [
+ "Agora que vimos que a nossa filtragem funcionou e que temos como retorno uma base de dados com 3.235 linhas, podemos usar a função ```query```como parâmetro para o ```countplot```, o nosso gráfico de barras.\n",
+ "O ```countplot``` é um gráfico pré-programado da biblioteca ```Seaborn```e, por isso, precisaremos fazer a importação padrão da mesma (```import seaborn as sns```). Adicionalmente, aqui no Google Colaboratory, para que possamos enxergar o gráfico com os padrões de configuração da biblioteca, precisamos rodar ```sns.set()```.\n",
+ "Além disso, para refinar a apresentação do gráfico, podemos utilizar algumas funcionalidades da biblioteca ```Matplotlib``` (fazendo, primeiramente, sua importação - ```import matplotlib.pyplt as plt```).\n",
+ "Também estamos definindo o tamanho do gráfico através da função ```figure``` e seu parâmetro ```figsize=(x, y))``` e o título através do ```set_title('Título')```.\n",
+ "Como comentado na aula, usualmente, armazenamos o nosso gráfico em uma variável ```ax``` e então, definimos as demais configurações (por exemplo, ```ax.set_title('Título')```.\n",
+ "E, finalmente, para visualizar o gráfico de barras, usamos o ```plt.show()```."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 518
+ },
+ "id": "IePUr98kdgED",
+ "outputId": "c238d326-41d3-46f8-b33f-ac65320aff00"
+ },
+ "source": [
+ "import seaborn as sns\n",
+ "import matplotlib.pyplot as plt\n",
+ "sns.set()\n",
+ "plt.figure(figsize=(10, 8))\n",
+ "ax = sns.countplot(x = 'composto', data=dados.query('composto in @cod_compostos'))\n",
+ "ax.set_title('Top 5 compostos')\n",
+ "plt.show()"
+ ],
+ "execution_count": 8,
+ "outputs": [
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "\n",
+ "text/plain": [
+ "
"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ }
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "bA_RUClHM6I8"
+ },
+ "source": [
+ "Até o momento analisamos os dados de tempo, dose, compostos e afins. Entretanto, não analisamos os dados de expressões gênicas (G's) e viabilidade celular (C's). Será que podemos criar um gráfico de barras para esses dados?\n",
+ "Vamos pensar que a nossa base de dados apresenta mais de 3.000 compostos. Mas quantos desses compostos aparecem na coluna ```g-0```? \n",
+ "Para responder essa questão, vamos usar a função ```unique()``` do Pandas que conta os valores únicos presentes na coluna em questão. Como resposta padrão, o retorno será uma lista com arrays (os nomes dos compostos) mas, nosso objetivo é saber o tamanho dessa lista e, por isso, usamos o ```len```, pois assim, ele contará o tamanho desta lista de arrays."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/"
+ },
+ "id": "DkQ2wH9Gj-w7",
+ "outputId": "2abfe103-7199-4e96-f9eb-9d229b6cf32b"
+ },
+ "source": [
+ "len(dados['g-0'].unique())"
+ ],
+ "execution_count": 9,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ "14367"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 9
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "wM4yn28tRflX"
+ },
+ "source": [
+ "Como temos diversos compostos únicos dentro da coluna ```g-0```, não é viável que façamos o mesmo gráfico utilizado anteriormente. \n",
+ "Por isso, precisamos traçar uma nova estratégia para visualizar os nossos dadose aqui, usaremos um histograma.\n",
+ "O primeiro passo, é identificar qual o valor mínimo (```min()```) e o valor máximo (```max()```) para entender qual o intervalo númerico com o qual estamos trabalhando."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/"
+ },
+ "id": "mCC5DpK1kmiX",
+ "outputId": "388669ea-0b94-4b43-d562-bc23107088a9"
+ },
+ "source": [
+ "dados['g-0'].min()"
+ ],
+ "execution_count": 10,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ "-5.513"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 10
+ }
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/"
+ },
+ "id": "qzH4Jl0OksuL",
+ "outputId": "840787d0-9a72-4ea4-d863-f16c54549112"
+ },
+ "source": [
+ "dados['g-0'].max()"
+ ],
+ "execution_count": 11,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ "10.0"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 11
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "-Tp7M_wWUCU8"
+ },
+ "source": [
+ "Depois que reconhecemos que o nosso intervalo vai de ~5,5 a 10,0, podemos partir para o histograma e a função que usaremos para plotar é do Pandas (```dataframe['variável'].hist()```).\n",
+ "Assim que rodamos essa função, percebemos que a visualização destes dados ainda não está boa pois, a divisão padrão das barras do histograma, representam intervalos muito grandes que atrapalham o entendimento dos dados.\n",
+ "Por isso, acresentamos um parâmetro dentro da função ```(bins = número de quebras)``` para melhor dividir e, consequentemente visualizar os dados.\n",
+ "Quando definimos os bins em 100, podemos perceber que a forma se aproxima bastante de uma curva bastante conhecida: a curva normal."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 286
+ },
+ "id": "RnjRPpubk1z0",
+ "outputId": "60f94bad-cd61-4b06-aeef-c79d245a49b0"
+ },
+ "source": [
+ "dados['g-0'].hist(bins = 100)"
+ ],
+ "execution_count": 12,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 12
+ },
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "\n",
+ "text/plain": [
+ "
"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ }
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "ILTZmvd2fczV"
+ },
+ "source": [
+ "Aqui, estamos testando o mesmo histograma para outro gene, o ```g-19```. E queremos fazer uma comparação entre os gráficos e podemos fazer algumas considerações sobre ambos os gráficos.\n",
+ "Percebemos, por exemplo, que a imagem seguem a mesma tendência de curva mas há um deslocamento para a direita."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 286
+ },
+ "id": "1_aedkvToYlB",
+ "outputId": "1fb80d8d-3a09-4433-e220-fe35cf995a41"
+ },
+ "source": [
+ "dados['g-19'].hist(bins = 100)"
+ ],
+ "execution_count": 13,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 13
+ },
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYIAAAD7CAYAAABnoJM0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAT0klEQVR4nO3df0zc9eHH8RdH4Wi/hZ2wqzuoSbeqHY5kTb1FY0aiML/cDJRs2QIj1kTRJk3aZLElomPg2hqlP1ym677dH3UmHbPqHyuBdqUubTaXTWONVS8sbdNQ/xhXiwekxyhg7z7fPxpOaIE7Du4+d7yfj7+Oe/OB19Hr5/X5+b4sy7IsAQCM5bA7AADAXhQBABiOIgAAw1EEAGA4igAADEcRAIDhKAIAMNwyuwMkamjov4pE5n8LRFHRSgWDI0lItPjImjyZlJesyZFJWaWF53U4snTbbf8z41jGFkEkYiVUBJPLZgqyJk8m5SVrcmRSVil5eTk0BACGowgAwHAUAQAYjiIAAMNRBABgOIoAAAxHEQCA4TL2PgIA6Wfiy7Dc7nxJ0tj4dYWuXrM5EeJBEQBYNLk52arZ3ilJ6tpfq5DNeRAfDg0BgOEoAgAwHEUAAIajCADAcBQBABiOIgAAw1EEAGA4igAADEcRAIDhYhbB0NCQnnrqKVVVVammpkZbt27V4OCgJOns2bPauHGjqqqq9MQTTygYDEaXS3QMAJBaMYsgKytLTz75pHp6etTV1aU77rhD+/btUyQSUVNTk1pbW9XT0yOv16t9+/ZJUsJjAIDUi1kELpdL9913X/Tr9evXq7+/X36/X06nU16vV5JUX1+vEydOSFLCYwCA1JvXpHORSERvvPGGKioqFAgEVFxcHB0rLCxUJBLR8PBwwmMulyvuLEVFK+cTfZrJ2REzAVmTJ5PyplvWiS/Dys3JvuXxzdIt983SPd/NkpV3XkWwa9curVixQo8++qjeeeedpASKVzA4okjEmvdybne+BgYyY05EsiZPJuVNx6xud/60WUYn8928okq33FOl4991LgvN63BkzboBHXcRtLe367PPPtPBgwflcDjk8XjU398fHR8cHJTD4ZDL5Up4DACQenFdPvryyy/L7/frwIEDys3NlSSVlZVpbGxMZ86ckSQdOXJEPp9vQWMAlo7JD6lxu/OVX7Dc7jiYQ8w9ggsXLuj3v/+91qxZo/r6eknS6tWrdeDAAe3Zs0dtbW0aHx9XSUmJ9u7dK0lyOBwJjQFYOviQmswRswjuuusunTt3bsaxDRs2qKura1HHAACpxZ3FAGA4igAADEcRAIDhKAIAMNy8bigDgJtNXiaKzMUeAYAFmbxMdPJSUWQeigAADEcRAIDhKAIAMBxFAACGowgAwHAUAQAYjiIAAMNRBABgOIoAAAxHEQCA4SgCADAcRQAAhqMIAMBwFAEAGI4iAADD8cE0AJJu6ofXjI1fV+jqNZsTYSqKAEDSTX54jSR17a9VyOY8mI5DQwBgOIoAAAxHEQCA4SgCADAcRQAAhqMIAMBwXD4KIC75BcuV52SVsRSxRwAgLnnOZarZ3hm9HwBLB0UAAIajCADAcBQBABiOIgAAw1EEAGA4igAADEcRAIDhKAIAMFxcRdDe3q6KigqtW7dO58+fjz5fUVEhn8+n2tpa1dbW6t13342OnT17Vhs3blRVVZWeeOIJBYPBuMYAAKkVVxFUVlaqo6NDJSUlt4y98sor6uzsVGdnp8rLyyVJkUhETU1Nam1tVU9Pj7xer/bt2xdzDACQenEVgdfrlcfjifuH+v1+OZ1Oeb1eSVJ9fb1OnDgRcwwAkHoLnkFqx44dsixL9957r55++mkVFBQoEAiouLg4+j2FhYWKRCIaHh6ec8zlci00DgBgnhZUBB0dHfJ4PJqYmNALL7ygnTt3puwwT1HRyoSXdbvzFzFJcpE1eTIpbyZljUe6vJ50yRGvZOVdUBFMHi7Kzc1VQ0ODtmzZEn2+v78/+n2Dg4NyOBxyuVxzjs1HMDiiSMSad2a3O18DA6F5L2cHsiZPJuVNl6yLuRJKl9eTDjnitdC8DkfWrBvQCV8+Ojo6qlDoRijLsnT8+HGVlpZKksrKyjQ2NqYzZ85Iko4cOSKfzxdzDACQenHtEezevVsnT57UF198occff1wul0sHDx7Utm3bFA6HFYlEtHbtWrW1tUmSHA6H9uzZo7a2No2Pj6ukpER79+6NOQYASL24iqClpUUtLS23PH/06NFZl9mwYYO6urrmPQYASC3uLAYAw/EBpABSauLLcPTE89j4dYWuXrM5ESgCACmVm5Md/dzjrv21ypzrdpYuigDArPILlivPyWpiqeMcAYBZ5TmXqWZ7Z3QLHksTRQAAhqMIAMBwFAEAGI4iAADDUQQAYDiKAAAMRxEAgOEoAgAwHEUAAIajCADAcBQBABiOIgAAw1EEAGA4igAADEcRAIDhKAIAMBxFAACGowgAwHAUAQAYjiIAAMNRBABgOIoAAAy3zO4AANJLfsFy5TlZNZiEPQIA0+Q5l6lme6dqtnfaHQUpQhEAgOEoAgAwHEUAAIajCADAcBQBABiOIgAAw1EEAGA47hoBYJuJL8Nyu/MlSWPj1xW6es3mRGaiCADYJjcnO3rjWtf+WoVszmMqDg0BgOFiFkF7e7sqKiq0bt06nT9/Pvp8X1+f6urqVFVVpbq6Ol26dGnBYwCA1ItZBJWVlero6FBJScm059va2tTQ0KCenh41NDSotbV1wWMAgNSLWQRer1cej2fac8FgUL29vaqurpYkVVdXq7e3V4ODgwmPAQDskdDJ4kAgoNtvv13Z2dmSpOzsbK1atUqBQECWZSU0VlhYuEgvCQAwHxl71VBR0cqEl528XC0TkDV5MilvJmVdiFS/zkz7uyYrb0JF4PF49PnnnyscDis7O1vhcFhXrlyRx+ORZVkJjc1XMDiiSMSa93Jud74GBjLjIjWyJk8m5U11VjtXjql+nZnyHpAWntfhyJp1Azqhy0eLiopUWlqq7u5uSVJ3d7dKS0tVWFiY8BgAwB4x9wh2796tkydP6osvvtDjjz8ul8ulY8eO6fnnn1dzc7N+97vfqaCgQO3t7dFlEh0DAKRezCJoaWlRS0vLLc+vXbtWb7/99ozLJDoGAEi9jD1ZDGBpYd4h+1AEANIC8w7Zh7mGAMBwFAEAGI4iAADDUQQAYDiKAAAMRxEAgOEoAgAwHEUAAIbjhjIAyi9YrjwnqwNTsUcAQHnOZarZ3hm9sxdmoQgAwHAUAQAYjiIAAMNRBABgOIoAAAzH9WKAobhkFJPYIwAMxSWjmEQRAIDhKAIAMBxFAACGowgAwHAUAQAYjiIAAMNRBABgOIoAAAxHEQCA4SgCADAcRQAAhqMIAMBwFAEAGI4iAADDUQQAYDg+lQJA2pn4Miy3O1+SNDZ+XaGr12xOtLRRBADSTm5OdvQDc7r21ypkc56ljkNDAGA4igAADEcRAIDhFnyOoKKiQrm5uXI6nZKkHTt2qLy8XGfPnlVra6vGx8dVUlKivXv3qqioSJLmHAMApNai7BG88sor6uzsVGdnp8rLyxWJRNTU1KTW1lb19PTI6/Vq3759kjTnGIDkyi9YLrc7P3pFDiAl6dCQ3++X0+mU1+uVJNXX1+vEiRMxxwAkV55zmWq2d0avyAGkRbp8dMeOHbIsS/fee6+efvppBQIBFRcXR8cLCwsViUQ0PDw855jL5VqMOACAeVhwEXR0dMjj8WhiYkIvvPCCdu7cqYcffngxss2pqGhlwstm0m4xWZMnk/JmUtZkSNbrz7S/a7LyLrgIPB6PJCk3N1cNDQ3asmWLHnvsMfX390e/Z3BwUA6HQy6XSx6PZ9ax+QgGRxSJWPPO63bna2AgM25PIWvyZFLexcyaaSu+Scn4t8qk94C08LwOR9asG9ALOkcwOjqqUOhGMMuydPz4cZWWlqqsrExjY2M6c+aMJOnIkSPy+XySNOcYACD1FrRHEAwGtW3bNoXDYUUiEa1du1ZtbW1yOBzas2eP2trapl0iKmnOMQBA6i2oCO644w4dPXp0xrENGzaoq6tr3mMAMBUT0CUfk84BSGtMQJd8TDEBAIajCADAcBQBABiOcwTAEpdfsFx5Tv6rY3bsEQBLHPMLIRaKAAAMRxEAgOEoAgAwHEUAAIajCADAcFxTBiBjMO9QclAEwBK0VO8dYN6h5ODQELAEce8A5oMiAADDUQQAYLildxARgBE4cbx4KAIAGYkTx4uHQ0MAYDiKAAAMRxEAgOEoAgAwHEUAAIbjqiFgiViq00og+dgjAJYIppVAoigCADAc+5FAhuJQ0Fe4y3hheBcBGWryUNCkrv21NqaxF3cZLwyHhgDAcBQBABiOIgAAw3GOAMggnCCObeqJY4mTx/HgHQWkuZtXbFNPiuJWU08cS5w8jgeHhoA0N7li40YxJAt7BACWNO4xiI0iANIQ5wIWD/cYxMahISANMW8QUolNDsBGU7f8xyfCcuZm25xoaZt6mGjiy7DNadIHRQDYaOo0EV37a7kiKMluPkyEGygCIAXY8k8/nET+im1F0NfXp+bmZg0PD8vlcqm9vV1r1qyxKw6QsNlW8lNXLmz5px9OIn/FtpPFbW1tamhoUE9PjxoaGtTa2mpXFGBBpp7YdeZ+dc2/w5Eltzt/2s1gSE+Tewdud74KvrYi+ji/YLnd0VLClj2CYDCo3t5e/eEPf5AkVVdXa9euXRocHFRhYWFcP8PhyEr49y9k2VQja2JWrsyTc3Irffy6RkbGbnm+4GsrolvvU7fk5/tYklbdtvyWx7k52WrcfVKSdKjlf2f8nsV8nKyfm+mP4/m+m/+tJh//3zOV0SKf9u8/5T2VSsla72VZlmUl/JMT5Pf79cwzz+jYsWPR5x555BHt3btX3/nOd1IdBwCMxn0EAGA4W4rA4/Ho888/Vzh84zrecDisK1euyOPx2BEHAIxmSxEUFRWptLRU3d3dkqTu7m6VlpbGfX4AALB4bDlHIEkXL15Uc3Ozrl69qoKCArW3t+tb3/qWHVEAwGi2FQEAID1wshgADEcRAIDhKAIAMBxFAACGM6IIOjs7VVNTo3vuuUd//OMfp41du3ZNP//5z/Xwww/L5/Pp9OnTNqWcWV9fnzZt2qTa2lr98Ic/1Kuvvmp3pDkdPnxYPp9PNTU1qq3NjAnV3n//fZWWlt7y3kgnv/rVr+Tz+bRx40bV19fr008/tTvSNH19faqrq1NVVZXq6up06dIluyPNamhoSE899ZSqqqpUU1OjrVu3anBw0O5YMf32t7/VunXrdP78+cX/4ZYBzp07Z124cMFqamqyDh8+PG3s1VdftX7xi19YlmVZfX191gMPPGCNjIzYEXNGW7ZsiWYeGRmxHnzwQevjjz+2OdXMenp6rIaGBisUClmWZVkDAwM2J4otFApZP/nJT6zNmzff8t5IJ6dOnbImJiaijysrK21ONN2mTZuso0ePWpZlWUePHrU2bdpkc6LZDQ0NWe+9917065deesl69tlnbUwUm9/vtxobG62HHnrIOnfu3KL/fCP2CO6++27deeedcjhufbl/+ctfVFdXJ0las2aNysrK9Pe//z3VEWeVlZWlUOjGBLljY2PKyspK2xvvXnvtNW3dulUrV66UJH3961+3OVFsL730khobG3XbbbfZHWVODz30kHJyciRJ69ev1+XLlxWJRGxOdcPkJJLV1dWSbkwi2dvbm7Zb2S6XS/fdd1/06/Xr16u/v9/GRHObmJjQzp079fzzzyftdxhRBHPp7+9XSUlJ9GuPx6PLly/bmGi65557TsePH1d5ebkqKirU2Nio1atX2x1rRhcvXtTHH3+s+vp6/fjHP9Zbb71ld6Q5/e1vf1MoFJLP57M7yrx0dHTowQcfnHHDxg6BQEC33367srNvzMyZnZ2tVatWKRAI2JwstkgkojfeeEMVFRV2R5nVb37zG23cuDGp/++XxCeU/ehHP5q10f/5z39G36DpKFb2N998U7W1tXryySd15coVbdq0SWVlZfrud7+b4qSxs4bDYQUCAf3pT3/S0NCQfvazn+mb3/ymvve976U46Q1z5T1x4oT2798fnQrdbvG+h48dO6auri51dHSkMt6StWvXLq1YsUKPPvqo3VFm9NFHH8nv92vHjh1J/T1Logj+/Oc/J7xscXGx/vOf/0QPtwQCgWm7jckWK/vhw4f117/+VZK0atUq3X///frggw9sKYJYWYuLi1VdXS2Hw6GioiI98MAD+uSTT2wrgrnynjlzRgMDA/rpT38q6cYJxNOnT2t4eFhbt25NVcSoeN7D77zzjn7961/r9ddfT6vDblMnkczOzs6YSSTb29v12Wef6eDBg2mzd3WzDz74QBcvXlRlZaUk6fLly2psbNSLL76o73//+4v2e9Lz1aeQz+fTm2++KUm6dOmSPv30U5WXl9uc6iurV6/Wu+++K0kaGRnRhx9+qLvuusvmVDOrrq6OZh0dHdWHH36ob3/72zanmpnX69W//vUvnTp1SqdOnVJVVZW2bdtmSwnE4/Tp03rxxRd16NChtDs0mImTSL788svy+/06cOCAcnNz7Y4zq82bN+sf//hH9H36jW98Q4cOHVrUEpAMmWuou7tbe/bs0dWrV5WTk6Ply5frtdde05133qnR0VE1Nzfr3//+txwOh5qamvSDH/zA7shRfr9fu3fv1ujoqK5fv65HHnkkbVdWY2Nj+uUvf6ne3l5JUm1trTZv3mxzqvg0NzerrKwsbQ8R3H///crJyZm2cn399dfT5iR3Jk0ieeHCBVVXV2vNmjXKy8uTdGOD68CBAzYni62iokIHDx7U3Xffvag/14giAADMzvhDQwBgOooAAAxHEQCA4SgCADAcRQAAhqMIAMBwFAEAGI4iAADD/T/vwhUQr40OmwAAAABJRU5ErkJggg==\n",
+ "text/plain": [
+ "
"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ }
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "i8HZHcicEJ0F"
+ },
+ "source": [
+ "Como concluimos que plotar os gráficos de todos os genes é inviável, vamos analisar resumidamente algumas estatísticas sobre eles. \n",
+ "Para isso, vamos usar a função ```describe``` do Pandas que já calcula e descreve algumas estatísticas importantes para o entendimento dos dados (contagem, média, desvio padrão, mínimo, alguns quartis e máximo).\n"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 331
+ },
+ "id": "Kv-2zPLBprgD",
+ "outputId": "9968a9fc-9780-4ded-fc89-22f1caad14b0"
+ },
+ "source": [
+ "dados.describe()"
+ ],
+ "execution_count": 14,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/html": [
+ "
\n",
+ "\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
\n",
+ "
tempo
\n",
+ "
g-0
\n",
+ "
g-1
\n",
+ "
g-2
\n",
+ "
g-3
\n",
+ "
g-4
\n",
+ "
g-5
\n",
+ "
g-6
\n",
+ "
g-7
\n",
+ "
g-8
\n",
+ "
g-9
\n",
+ "
g-10
\n",
+ "
g-11
\n",
+ "
g-12
\n",
+ "
g-13
\n",
+ "
g-14
\n",
+ "
g-15
\n",
+ "
g-16
\n",
+ "
g-17
\n",
+ "
g-18
\n",
+ "
g-19
\n",
+ "
g-20
\n",
+ "
g-21
\n",
+ "
g-22
\n",
+ "
g-23
\n",
+ "
g-24
\n",
+ "
g-25
\n",
+ "
g-26
\n",
+ "
g-27
\n",
+ "
g-28
\n",
+ "
g-29
\n",
+ "
g-30
\n",
+ "
g-31
\n",
+ "
g-32
\n",
+ "
g-33
\n",
+ "
g-34
\n",
+ "
g-35
\n",
+ "
g-36
\n",
+ "
g-37
\n",
+ "
g-38
\n",
+ "
...
\n",
+ "
c-60
\n",
+ "
c-61
\n",
+ "
c-62
\n",
+ "
c-63
\n",
+ "
c-64
\n",
+ "
c-65
\n",
+ "
c-66
\n",
+ "
c-67
\n",
+ "
c-68
\n",
+ "
c-69
\n",
+ "
c-70
\n",
+ "
c-71
\n",
+ "
c-72
\n",
+ "
c-73
\n",
+ "
c-74
\n",
+ "
c-75
\n",
+ "
c-76
\n",
+ "
c-77
\n",
+ "
c-78
\n",
+ "
c-79
\n",
+ "
c-80
\n",
+ "
c-81
\n",
+ "
c-82
\n",
+ "
c-83
\n",
+ "
c-84
\n",
+ "
c-85
\n",
+ "
c-86
\n",
+ "
c-87
\n",
+ "
c-88
\n",
+ "
c-89
\n",
+ "
c-90
\n",
+ "
c-91
\n",
+ "
c-92
\n",
+ "
c-93
\n",
+ "
c-94
\n",
+ "
c-95
\n",
+ "
c-96
\n",
+ "
c-97
\n",
+ "
c-98
\n",
+ "
c-99
\n",
+ "
\n",
+ " \n",
+ " \n",
+ "
\n",
+ "
count
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
...
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
\n",
+ "
\n",
+ "
mean
\n",
+ "
48.020156
\n",
+ "
0.248366
\n",
+ "
-0.095684
\n",
+ "
0.152253
\n",
+ "
0.081971
\n",
+ "
0.057347
\n",
+ "
-0.138836
\n",
+ "
0.035961
\n",
+ "
-0.202651
\n",
+ "
-0.190083
\n",
+ "
0.119905
\n",
+ "
-0.123321
\n",
+ "
0.182307
\n",
+ "
0.143203
\n",
+ "
0.209402
\n",
+ "
-0.173884
\n",
+ "
-0.024432
\n",
+ "
0.126823
\n",
+ "
-0.146663
\n",
+ "
0.087687
\n",
+ "
-0.082982
\n",
+ "
-0.111908
\n",
+ "
-0.087379
\n",
+ "
0.047548
\n",
+ "
-0.117474
\n",
+ "
-0.113212
\n",
+ "
-0.052746
\n",
+ "
-0.091055
\n",
+ "
0.112176
\n",
+ "
-0.046458
\n",
+ "
-0.076239
\n",
+ "
-0.197699
\n",
+ "
0.382177
\n",
+ "
-0.189432
\n",
+ "
0.078791
\n",
+ "
-0.093312
\n",
+ "
0.135729
\n",
+ "
-0.188616
\n",
+ "
-0.606710
\n",
+ "
0.534425
\n",
+ "
...
\n",
+ "
-0.517397
\n",
+ "
-0.360770
\n",
+ "
-0.435752
\n",
+ "
-0.613591
\n",
+ "
-0.402083
\n",
+ "
-0.619682
\n",
+ "
-0.452265
\n",
+ "
-0.497164
\n",
+ "
-0.413836
\n",
+ "
-0.277029
\n",
+ "
-0.547845
\n",
+ "
-0.358611
\n",
+ "
-0.442906
\n",
+ "
-0.475194
\n",
+ "
-0.010404
\n",
+ "
-0.467001
\n",
+ "
-0.276963
\n",
+ "
-0.455848
\n",
+ "
-0.412918
\n",
+ "
-0.456404
\n",
+ "
-0.472514
\n",
+ "
-0.505481
\n",
+ "
-0.492735
\n",
+ "
-0.446836
\n",
+ "
-0.463029
\n",
+ "
-0.409310
\n",
+ "
-0.333124
\n",
+ "
-0.295009
\n",
+ "
-0.328342
\n",
+ "
-0.401615
\n",
+ "
-0.469244
\n",
+ "
-0.461411
\n",
+ "
-0.513256
\n",
+ "
-0.500142
\n",
+ "
-0.507093
\n",
+ "
-0.353726
\n",
+ "
-0.463485
\n",
+ "
-0.378241
\n",
+ "
-0.470252
\n",
+ "
-0.301505
\n",
+ "
\n",
+ "
\n",
+ "
std
\n",
+ "
19.402807
\n",
+ "
1.393399
\n",
+ "
0.812363
\n",
+ "
1.035731
\n",
+ "
0.950012
\n",
+ "
1.032091
\n",
+ "
1.179388
\n",
+ "
0.882395
\n",
+ "
1.125494
\n",
+ "
1.749885
\n",
+ "
1.087180
\n",
+ "
1.291501
\n",
+ "
1.253604
\n",
+ "
1.234590
\n",
+ "
1.273068
\n",
+ "
1.247178
\n",
+ "
0.659839
\n",
+ "
1.418997
\n",
+ "
1.179688
\n",
+ "
0.743301
\n",
+ "
0.844796
\n",
+ "
1.219529
\n",
+ "
0.824401
\n",
+ "
0.924838
\n",
+ "
0.760159
\n",
+ "
1.203186
\n",
+ "
0.866977
\n",
+ "
1.103765
\n",
+ "
1.001687
\n",
+ "
1.027758
\n",
+ "
1.279399
\n",
+ "
1.302567
\n",
+ "
1.559174
\n",
+ "
0.933514
\n",
+ "
1.172270
\n",
+ "
1.174325
\n",
+ "
1.061719
\n",
+ "
1.397677
\n",
+ "
2.200277
\n",
+ "
2.003317
\n",
+ "
...
\n",
+ "
2.122318
\n",
+ "
1.710725
\n",
+ "
1.898871
\n",
+ "
2.307820
\n",
+ "
1.785055
\n",
+ "
2.225596
\n",
+ "
1.991021
\n",
+ "
2.063896
\n",
+ "
1.887001
\n",
+ "
1.459639
\n",
+ "
2.187835
\n",
+ "
1.730634
\n",
+ "
1.924716
\n",
+ "
2.021927
\n",
+ "
1.029820
\n",
+ "
2.004317
\n",
+ "
1.429340
\n",
+ "
1.924263
\n",
+ "
1.888788
\n",
+ "
1.832863
\n",
+ "
2.011396
\n",
+ "
2.091353
\n",
+ "
2.055624
\n",
+ "
1.987476
\n",
+ "
2.014045
\n",
+ "
1.883974
\n",
+ "
1.647241
\n",
+ "
1.634073
\n",
+ "
1.663170
\n",
+ "
1.832794
\n",
+ "
2.000488
\n",
+ "
2.042475
\n",
+ "
2.001714
\n",
+ "
2.107105
\n",
+ "
2.159589
\n",
+ "
1.629291
\n",
+ "
2.059725
\n",
+ "
1.703615
\n",
+ "
1.834828
\n",
+ "
1.407918
\n",
+ "
\n",
+ "
\n",
+ "
min
\n",
+ "
24.000000
\n",
+ "
-5.513000
\n",
+ "
-5.737000
\n",
+ "
-9.104000
\n",
+ "
-5.998000
\n",
+ "
-6.369000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-8.337000
\n",
+ "
-10.000000
\n",
+ "
-5.870000
\n",
+ "
-8.587000
\n",
+ "
-5.018000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-4.226000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-5.700000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-8.272000
\n",
+ "
-8.184000
\n",
+ "
-4.835000
\n",
+ "
-7.913000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-2.956000
\n",
+ "
-8.356000
\n",
+ "
-7.182000
\n",
+ "
-10.000000
\n",
+ "
-9.261000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
...
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-9.839000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-6.452000
\n",
+ "
-10.000000
\n",
+ "
-9.938000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
\n",
+ "
\n",
+ "
25%
\n",
+ "
24.000000
\n",
+ "
-0.473075
\n",
+ "
-0.562200
\n",
+ "
-0.437750
\n",
+ "
-0.429575
\n",
+ "
-0.470925
\n",
+ "
-0.602225
\n",
+ "
-0.493900
\n",
+ "
-0.525175
\n",
+ "
-0.511675
\n",
+ "
-0.360200
\n",
+ "
-0.511475
\n",
+ "
-0.489675
\n",
+ "
-0.447500
\n",
+ "
-0.481200
\n",
+ "
-0.607975
\n",
+ "
-0.404150
\n",
+ "
-0.391950
\n",
+ "
-0.513775
\n",
+ "
-0.272200
\n",
+ "
-0.488675
\n",
+ "
-0.524600
\n",
+ "
-0.538900
\n",
+ "
-0.440375
\n",
+ "
-0.508900
\n",
+ "
-0.533900
\n",
+ "
-0.497700
\n",
+ "
-0.512875
\n",
+ "
-0.467800
\n",
+ "
-0.378300
\n",
+ "
-0.505750
\n",
+ "
-0.457975
\n",
+ "
-0.328200
\n",
+ "
-0.600500
\n",
+ "
-0.478700
\n",
+ "
-0.570525
\n",
+ "
-0.481800
\n",
+ "
-0.541950
\n",
+ "
-0.604100
\n",
+ "
-0.470250
\n",
+ "
...
\n",
+ "
-0.588075
\n",
+ "
-0.564025
\n",
+ "
-0.561000
\n",
+ "
-0.583250
\n",
+ "
-0.566500
\n",
+ "
-0.603200
\n",
+ "
-0.541575
\n",
+ "
-0.560825
\n",
+ "
-0.555200
\n",
+ "
-0.534500
\n",
+ "
-0.569100
\n",
+ "
-0.558300
\n",
+ "
-0.573350
\n",
+ "
-0.594275
\n",
+ "
-0.389925
\n",
+ "
-0.551200
\n",
+ "
-0.544150
\n",
+ "
-0.575075
\n",
+ "
-0.568275
\n",
+ "
-0.582650
\n",
+ "
-0.558575
\n",
+ "
-0.562375
\n",
+ "
-0.572800
\n",
+ "
-0.561225
\n",
+ "
-0.560675
\n",
+ "
-0.560100
\n",
+ "
-0.533700
\n",
+ "
-0.504575
\n",
+ "
-0.544275
\n",
+ "
-0.569150
\n",
+ "
-0.566175
\n",
+ "
-0.565975
\n",
+ "
-0.589975
\n",
+ "
-0.568700
\n",
+ "
-0.563775
\n",
+ "
-0.567975
\n",
+ "
-0.552575
\n",
+ "
-0.561000
\n",
+ "
-0.592600
\n",
+ "
-0.562900
\n",
+ "
\n",
+ "
\n",
+ "
50%
\n",
+ "
48.000000
\n",
+ "
-0.008850
\n",
+ "
-0.046600
\n",
+ "
0.075200
\n",
+ "
0.008050
\n",
+ "
-0.026900
\n",
+ "
-0.015650
\n",
+ "
-0.000650
\n",
+ "
-0.017900
\n",
+ "
0.010000
\n",
+ "
0.160450
\n",
+ "
0.038550
\n",
+ "
0.013800
\n",
+ "
0.060250
\n",
+ "
0.009800
\n",
+ "
-0.030100
\n",
+ "
0.000000
\n",
+ "
0.149400
\n",
+ "
-0.002200
\n",
+ "
0.000000
\n",
+ "
-0.027800
\n",
+ "
-0.002800
\n",
+ "
-0.069350
\n",
+ "
-0.011800
\n",
+ "
0.000000
\n",
+ "
0.018300
\n",
+ "
-0.011650
\n",
+ "
0.016100
\n",
+ "
0.037600
\n",
+ "
0.003750
\n",
+ "
0.021750
\n",
+ "
0.027000
\n",
+ "
0.019100
\n",
+ "
-0.054000
\n",
+ "
0.023350
\n",
+ "
0.003350
\n",
+ "
-0.000600
\n",
+ "
0.015350
\n",
+ "
-0.000700
\n",
+ "
0.005800
\n",
+ "
...
\n",
+ "
-0.017650
\n",
+ "
-0.041550
\n",
+ "
-0.002950
\n",
+ "
-0.012650
\n",
+ "
-0.005600
\n",
+ "
0.007650
\n",
+ "
0.004950
\n",
+ "
0.000000
\n",
+ "
-0.023800
\n",
+ "
-0.011450
\n",
+ "
-0.007100
\n",
+ "
-0.019500
\n",
+ "
-0.019500
\n",
+ "
-0.009300
\n",
+ "
0.081550
\n",
+ "
-0.006900
\n",
+ "
0.018400
\n",
+ "
-0.014650
\n",
+ "
-0.014350
\n",
+ "
0.005300
\n",
+ "
-0.005300
\n",
+ "
-0.004050
\n",
+ "
0.003300
\n",
+ "
-0.007900
\n",
+ "
-0.004600
\n",
+ "
-0.002400
\n",
+ "
0.007850
\n",
+ "
-0.005600
\n",
+ "
-0.020600
\n",
+ "
-0.030000
\n",
+ "
-0.009900
\n",
+ "
0.003250
\n",
+ "
-0.009100
\n",
+ "
-0.013750
\n",
+ "
-0.003300
\n",
+ "
-0.010250
\n",
+ "
-0.001250
\n",
+ "
-0.006800
\n",
+ "
0.014000
\n",
+ "
-0.019500
\n",
+ "
\n",
+ "
\n",
+ "
75%
\n",
+ "
72.000000
\n",
+ "
0.525700
\n",
+ "
0.403075
\n",
+ "
0.663925
\n",
+ "
0.463400
\n",
+ "
0.465375
\n",
+ "
0.510425
\n",
+ "
0.528725
\n",
+ "
0.411900
\n",
+ "
0.549225
\n",
+ "
0.697775
\n",
+ "
0.525400
\n",
+ "
0.575275
\n",
+ "
0.604450
\n",
+ "
0.575825
\n",
+ "
0.457975
\n",
+ "
0.382475
\n",
+ "
0.829500
\n",
+ "
0.494775
\n",
+ "
0.327800
\n",
+ "
0.400600
\n",
+ "
0.492400
\n",
+ "
0.414875
\n",
+ "
0.433400
\n",
+ "
0.329250
\n",
+ "
0.527700
\n",
+ "
0.461650
\n",
+ "
0.508425
\n",
+ "
0.586450
\n",
+ "
0.431275
\n",
+ "
0.507600
\n",
+ "
0.458075
\n",
+ "
0.471075
\n",
+ "
0.391950
\n",
+ "
0.551300
\n",
+ "
0.503725
\n",
+ "
0.564875
\n",
+ "
0.517025
\n",
+ "
0.460500
\n",
+ "
0.642300
\n",
+ "
...
\n",
+ "
0.452675
\n",
+ "
0.427675
\n",
+ "
0.462175
\n",
+ "
0.447975
\n",
+ "
0.447150
\n",
+ "
0.441250
\n",
+ "
0.470600
\n",
+ "
0.458550
\n",
+ "
0.441000
\n",
+ "
0.460075
\n",
+ "
0.460950
\n",
+ "
0.449975
\n",
+ "
0.445200
\n",
+ "
0.473200
\n",
+ "
0.563575
\n",
+ "
0.456350
\n",
+ "
0.493400
\n",
+ "
0.448375
\n",
+ "
0.451975
\n",
+ "
0.463075
\n",
+ "
0.447675
\n",
+ "
0.462000
\n",
+ "
0.468900
\n",
+ "
0.452375
\n",
+ "
0.460475
\n",
+ "
0.461675
\n",
+ "
0.465950
\n",
+ "
0.463400
\n",
+ "
0.450075
\n",
+ "
0.430875
\n",
+ "
0.457750
\n",
+ "
0.461500
\n",
+ "
0.445675
\n",
+ "
0.452900
\n",
+ "
0.470900
\n",
+ "
0.444750
\n",
+ "
0.465225
\n",
+ "
0.446400
\n",
+ "
0.461275
\n",
+ "
0.438650
\n",
+ "
\n",
+ "
\n",
+ "
max
\n",
+ "
72.000000
\n",
+ "
10.000000
\n",
+ "
5.039000
\n",
+ "
8.257000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
7.282000
\n",
+ "
7.333000
\n",
+ "
5.473000
\n",
+ "
8.887000
\n",
+ "
6.433000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
4.134000
\n",
+ "
6.418000
\n",
+ "
4.750000
\n",
+ "
8.872000
\n",
+ "
4.081000
\n",
+ "
9.842000
\n",
+ "
5.248000
\n",
+ "
5.942000
\n",
+ "
5.201000
\n",
+ "
10.000000
\n",
+ "
8.494000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
9.416000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
6.796000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
5.834000
\n",
+ "
5.602000
\n",
+ "
10.000000
\n",
+ "
...
\n",
+ "
3.888000
\n",
+ "
3.596000
\n",
+ "
4.857000
\n",
+ "
3.549000
\n",
+ "
3.382000
\n",
+ "
3.328000
\n",
+ "
4.157000
\n",
+ "
3.736000
\n",
+ "
3.582000
\n",
+ "
3.119000
\n",
+ "
3.323000
\n",
+ "
5.014000
\n",
+ "
2.898000
\n",
+ "
4.185000
\n",
+ "
3.170000
\n",
+ "
3.276000
\n",
+ "
4.992000
\n",
+ "
3.770000
\n",
+ "
2.851000
\n",
+ "
3.211000
\n",
+ "
4.534000
\n",
+ "
3.890000
\n",
+ "
3.994000
\n",
+ "
4.321000
\n",
+ "
4.020000
\n",
+ "
3.738000
\n",
+ "
3.252000
\n",
+ "
5.406000
\n",
+ "
3.110000
\n",
+ "
3.320000
\n",
+ "
4.069000
\n",
+ "
3.960000
\n",
+ "
3.927000
\n",
+ "
3.596000
\n",
+ "
3.747000
\n",
+ "
2.814000
\n",
+ "
3.505000
\n",
+ "
2.924000
\n",
+ "
3.111000
\n",
+ "
3.805000
\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
8 rows × 873 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ " tempo g-0 ... c-98 c-99\n",
+ "count 23814.000000 23814.000000 ... 23814.000000 23814.000000\n",
+ "mean 48.020156 0.248366 ... -0.470252 -0.301505\n",
+ "std 19.402807 1.393399 ... 1.834828 1.407918\n",
+ "min 24.000000 -5.513000 ... -10.000000 -10.000000\n",
+ "25% 24.000000 -0.473075 ... -0.592600 -0.562900\n",
+ "50% 48.000000 -0.008850 ... 0.014000 -0.019500\n",
+ "75% 72.000000 0.525700 ... 0.461275 0.438650\n",
+ "max 72.000000 10.000000 ... 3.111000 3.805000\n",
+ "\n",
+ "[8 rows x 873 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 14
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "G4mCKUkUFYhb"
+ },
+ "source": [
+ "Neste ponto, vamos separar as variáveis que queremos analisar da base de dados (exemplo: ```g-0``` e ```g-1```) através de uma lista de arrays.\n",
+ "Entretanto, apesar desta ser uma ótima estratégia para a separação, temos 771 genes e escrevê-los um a um seria muito trabalhoso e podemos fazer de uma outra forma."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 399
+ },
+ "id": "OAkjbZdkqPlR",
+ "outputId": "1f7cfb30-4c6a-4921-c7e3-b37767d89f96"
+ },
+ "source": [
+ "dados[['g-0', 'g-1']]"
+ ],
+ "execution_count": 15,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/html": [
+ "
\n",
+ "\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
\n",
+ "
g-0
\n",
+ "
g-1
\n",
+ "
\n",
+ " \n",
+ " \n",
+ "
\n",
+ "
0
\n",
+ "
1.0620
\n",
+ "
0.5577
\n",
+ "
\n",
+ "
\n",
+ "
1
\n",
+ "
0.0743
\n",
+ "
0.4087
\n",
+ "
\n",
+ "
\n",
+ "
2
\n",
+ "
0.6280
\n",
+ "
0.5817
\n",
+ "
\n",
+ "
\n",
+ "
3
\n",
+ "
-0.5138
\n",
+ "
-0.2491
\n",
+ "
\n",
+ "
\n",
+ "
4
\n",
+ "
-0.3254
\n",
+ "
-0.4009
\n",
+ "
\n",
+ "
\n",
+ "
...
\n",
+ "
...
\n",
+ "
...
\n",
+ "
\n",
+ "
\n",
+ "
23809
\n",
+ "
0.1394
\n",
+ "
-0.0636
\n",
+ "
\n",
+ "
\n",
+ "
23810
\n",
+ "
-1.3260
\n",
+ "
0.3478
\n",
+ "
\n",
+ "
\n",
+ "
23811
\n",
+ "
0.3942
\n",
+ "
0.3756
\n",
+ "
\n",
+ "
\n",
+ "
23812
\n",
+ "
0.6660
\n",
+ "
0.2324
\n",
+ "
\n",
+ "
\n",
+ "
23813
\n",
+ "
-0.8598
\n",
+ "
1.0240
\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
23814 rows × 2 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ " g-0 g-1\n",
+ "0 1.0620 0.5577\n",
+ "1 0.0743 0.4087\n",
+ "2 0.6280 0.5817\n",
+ "3 -0.5138 -0.2491\n",
+ "4 -0.3254 -0.4009\n",
+ "... ... ...\n",
+ "23809 0.1394 -0.0636\n",
+ "23810 -1.3260 0.3478\n",
+ "23811 0.3942 0.3756\n",
+ "23812 0.6660 0.2324\n",
+ "23813 -0.8598 1.0240\n",
+ "\n",
+ "[23814 rows x 2 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 15
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "0LHi_9AUGFmS"
+ },
+ "source": [
+ "Uma estratégia mais direta em que não precisaremos escrever a lista gene a gene, é o uso da função ```loc[]``` do Pandas.\n",
+ "Como argumentos, passamos primeiramente o ```:```, os dois pontos faz com que o ```loc[]```retorne todos os elementos de uma determinada coluna, isso é importante quando não sabemos qual a quantidade de linhas de um dataframe. E, o segundo elemento, passamos as colunas que são de nosso interesse. No caso, queremos que a função nos retorne todos os elementos das colunas ```g-0```até ```g-771```.\n",
+ "E, por fim, podemos declarar a nossa função de interesse a partir deste filtro realizado nos dados, o ```describe()```."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 331
+ },
+ "id": "dQX0KOhUqm66",
+ "outputId": "9966282b-ee3b-4852-a094-227c3cce5262"
+ },
+ "source": [
+ "dados.loc[:,'g-0':'g-771'].describe()"
+ ],
+ "execution_count": 16,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/html": [
+ "
\n",
+ "\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
\n",
+ "
g-0
\n",
+ "
g-1
\n",
+ "
g-2
\n",
+ "
g-3
\n",
+ "
g-4
\n",
+ "
g-5
\n",
+ "
g-6
\n",
+ "
g-7
\n",
+ "
g-8
\n",
+ "
g-9
\n",
+ "
g-10
\n",
+ "
g-11
\n",
+ "
g-12
\n",
+ "
g-13
\n",
+ "
g-14
\n",
+ "
g-15
\n",
+ "
g-16
\n",
+ "
g-17
\n",
+ "
g-18
\n",
+ "
g-19
\n",
+ "
g-20
\n",
+ "
g-21
\n",
+ "
g-22
\n",
+ "
g-23
\n",
+ "
g-24
\n",
+ "
g-25
\n",
+ "
g-26
\n",
+ "
g-27
\n",
+ "
g-28
\n",
+ "
g-29
\n",
+ "
g-30
\n",
+ "
g-31
\n",
+ "
g-32
\n",
+ "
g-33
\n",
+ "
g-34
\n",
+ "
g-35
\n",
+ "
g-36
\n",
+ "
g-37
\n",
+ "
g-38
\n",
+ "
g-39
\n",
+ "
...
\n",
+ "
g-732
\n",
+ "
g-733
\n",
+ "
g-734
\n",
+ "
g-735
\n",
+ "
g-736
\n",
+ "
g-737
\n",
+ "
g-738
\n",
+ "
g-739
\n",
+ "
g-740
\n",
+ "
g-741
\n",
+ "
g-742
\n",
+ "
g-743
\n",
+ "
g-744
\n",
+ "
g-745
\n",
+ "
g-746
\n",
+ "
g-747
\n",
+ "
g-748
\n",
+ "
g-749
\n",
+ "
g-750
\n",
+ "
g-751
\n",
+ "
g-752
\n",
+ "
g-753
\n",
+ "
g-754
\n",
+ "
g-755
\n",
+ "
g-756
\n",
+ "
g-757
\n",
+ "
g-758
\n",
+ "
g-759
\n",
+ "
g-760
\n",
+ "
g-761
\n",
+ "
g-762
\n",
+ "
g-763
\n",
+ "
g-764
\n",
+ "
g-765
\n",
+ "
g-766
\n",
+ "
g-767
\n",
+ "
g-768
\n",
+ "
g-769
\n",
+ "
g-770
\n",
+ "
g-771
\n",
+ "
\n",
+ " \n",
+ " \n",
+ "
\n",
+ "
count
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
...
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
23814.000000
\n",
+ "
\n",
+ "
\n",
+ "
mean
\n",
+ "
0.248366
\n",
+ "
-0.095684
\n",
+ "
0.152253
\n",
+ "
0.081971
\n",
+ "
0.057347
\n",
+ "
-0.138836
\n",
+ "
0.035961
\n",
+ "
-0.202651
\n",
+ "
-0.190083
\n",
+ "
0.119905
\n",
+ "
-0.123321
\n",
+ "
0.182307
\n",
+ "
0.143203
\n",
+ "
0.209402
\n",
+ "
-0.173884
\n",
+ "
-0.024432
\n",
+ "
0.126823
\n",
+ "
-0.146663
\n",
+ "
0.087687
\n",
+ "
-0.082982
\n",
+ "
-0.111908
\n",
+ "
-0.087379
\n",
+ "
0.047548
\n",
+ "
-0.117474
\n",
+ "
-0.113212
\n",
+ "
-0.052746
\n",
+ "
-0.091055
\n",
+ "
0.112176
\n",
+ "
-0.046458
\n",
+ "
-0.076239
\n",
+ "
-0.197699
\n",
+ "
0.382177
\n",
+ "
-0.189432
\n",
+ "
0.078791
\n",
+ "
-0.093312
\n",
+ "
0.135729
\n",
+ "
-0.188616
\n",
+ "
-0.606710
\n",
+ "
0.534425
\n",
+ "
0.132026
\n",
+ "
...
\n",
+ "
0.263569
\n",
+ "
-0.242638
\n",
+ "
0.110180
\n",
+ "
-0.039868
\n",
+ "
-0.064056
\n",
+ "
0.193629
\n",
+ "
0.004470
\n",
+ "
-0.187085
\n",
+ "
0.067186
\n",
+ "
0.006159
\n",
+ "
-0.201189
\n",
+ "
-0.243317
\n",
+ "
0.548903
\n",
+ "
0.247065
\n",
+ "
0.085950
\n",
+ "
0.090544
\n",
+ "
0.215525
\n",
+ "
-0.007705
\n",
+ "
0.236954
\n",
+ "
0.022785
\n",
+ "
0.103224
\n",
+ "
0.048402
\n",
+ "
0.109367
\n",
+ "
-0.103262
\n",
+ "
0.004129
\n",
+ "
0.066211
\n",
+ "
0.072773
\n",
+ "
-0.033051
\n",
+ "
-0.400643
\n",
+ "
-0.338978
\n",
+ "
0.207299
\n",
+ "
0.009161
\n",
+ "
0.171027
\n",
+ "
0.002316
\n",
+ "
-0.209799
\n",
+ "
-0.076251
\n",
+ "
0.134162
\n",
+ "
-0.128018
\n",
+ "
-0.219210
\n",
+ "
0.101524
\n",
+ "
\n",
+ "
\n",
+ "
std
\n",
+ "
1.393399
\n",
+ "
0.812363
\n",
+ "
1.035731
\n",
+ "
0.950012
\n",
+ "
1.032091
\n",
+ "
1.179388
\n",
+ "
0.882395
\n",
+ "
1.125494
\n",
+ "
1.749885
\n",
+ "
1.087180
\n",
+ "
1.291501
\n",
+ "
1.253604
\n",
+ "
1.234590
\n",
+ "
1.273068
\n",
+ "
1.247178
\n",
+ "
0.659839
\n",
+ "
1.418997
\n",
+ "
1.179688
\n",
+ "
0.743301
\n",
+ "
0.844796
\n",
+ "
1.219529
\n",
+ "
0.824401
\n",
+ "
0.924838
\n",
+ "
0.760159
\n",
+ "
1.203186
\n",
+ "
0.866977
\n",
+ "
1.103765
\n",
+ "
1.001687
\n",
+ "
1.027758
\n",
+ "
1.279399
\n",
+ "
1.302567
\n",
+ "
1.559174
\n",
+ "
0.933514
\n",
+ "
1.172270
\n",
+ "
1.174325
\n",
+ "
1.061719
\n",
+ "
1.397677
\n",
+ "
2.200277
\n",
+ "
2.003317
\n",
+ "
0.990477
\n",
+ "
...
\n",
+ "
1.075809
\n",
+ "
1.309482
\n",
+ "
0.881283
\n",
+ "
0.917119
\n",
+ "
1.186966
\n",
+ "
1.212621
\n",
+ "
1.175611
\n",
+ "
1.182180
\n",
+ "
1.057292
\n",
+ "
0.996940
\n",
+ "
1.029370
\n",
+ "
1.378815
\n",
+ "
2.023791
\n",
+ "
1.639307
\n",
+ "
1.238753
\n",
+ "
0.961212
\n",
+ "
1.293783
\n",
+ "
0.966213
\n",
+ "
1.284074
\n",
+ "
1.015557
\n",
+ "
1.035968
\n",
+ "
1.048645
\n",
+ "
0.856080
\n",
+ "
1.175372
\n",
+ "
0.701031
\n",
+ "
0.996914
\n",
+ "
1.077310
\n",
+ "
1.579948
\n",
+ "
1.913468
\n",
+ "
1.701554
\n",
+ "
1.096488
\n",
+ "
1.200395
\n",
+ "
1.357701
\n",
+ "
1.052771
\n",
+ "
1.201918
\n",
+ "
1.115477
\n",
+ "
0.951264
\n",
+ "
1.230636
\n",
+ "
1.326193
\n",
+ "
1.417674
\n",
+ "
\n",
+ "
\n",
+ "
min
\n",
+ "
-5.513000
\n",
+ "
-5.737000
\n",
+ "
-9.104000
\n",
+ "
-5.998000
\n",
+ "
-6.369000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-8.337000
\n",
+ "
-10.000000
\n",
+ "
-5.870000
\n",
+ "
-8.587000
\n",
+ "
-5.018000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-4.226000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-5.700000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-8.272000
\n",
+ "
-8.184000
\n",
+ "
-4.835000
\n",
+ "
-7.913000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-2.956000
\n",
+ "
-8.356000
\n",
+ "
-7.182000
\n",
+ "
-10.000000
\n",
+ "
-9.261000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-4.223000
\n",
+ "
...
\n",
+ "
-5.504000
\n",
+ "
-10.000000
\n",
+ "
-3.790000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-7.920000
\n",
+ "
-10.000000
\n",
+ "
-9.462000
\n",
+ "
-6.634000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-2.912000
\n",
+ "
-9.833000
\n",
+ "
-7.227000
\n",
+ "
-5.561000
\n",
+ "
-8.393000
\n",
+ "
-5.790000
\n",
+ "
-4.915000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-7.918000
\n",
+ "
-5.255000
\n",
+ "
-10.000000
\n",
+ "
-6.264000
\n",
+ "
-10.000000
\n",
+ "
-5.505000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-8.193000
\n",
+ "
-10.000000
\n",
+ "
-9.795000
\n",
+ "
-8.180000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-4.269000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
-10.000000
\n",
+ "
\n",
+ "
\n",
+ "
25%
\n",
+ "
-0.473075
\n",
+ "
-0.562200
\n",
+ "
-0.437750
\n",
+ "
-0.429575
\n",
+ "
-0.470925
\n",
+ "
-0.602225
\n",
+ "
-0.493900
\n",
+ "
-0.525175
\n",
+ "
-0.511675
\n",
+ "
-0.360200
\n",
+ "
-0.511475
\n",
+ "
-0.489675
\n",
+ "
-0.447500
\n",
+ "
-0.481200
\n",
+ "
-0.607975
\n",
+ "
-0.404150
\n",
+ "
-0.391950
\n",
+ "
-0.513775
\n",
+ "
-0.272200
\n",
+ "
-0.488675
\n",
+ "
-0.524600
\n",
+ "
-0.538900
\n",
+ "
-0.440375
\n",
+ "
-0.508900
\n",
+ "
-0.533900
\n",
+ "
-0.497700
\n",
+ "
-0.512875
\n",
+ "
-0.467800
\n",
+ "
-0.378300
\n",
+ "
-0.505750
\n",
+ "
-0.457975
\n",
+ "
-0.328200
\n",
+ "
-0.600500
\n",
+ "
-0.478700
\n",
+ "
-0.570525
\n",
+ "
-0.481800
\n",
+ "
-0.541950
\n",
+ "
-0.604100
\n",
+ "
-0.470250
\n",
+ "
-0.401700
\n",
+ "
...
\n",
+ "
-0.390700
\n",
+ "
-0.565075
\n",
+ "
-0.433275
\n",
+ "
-0.498650
\n",
+ "
-0.526300
\n",
+ "
-0.430750
\n",
+ "
-0.488775
\n",
+ "
-0.593125
\n",
+ "
-0.498275
\n",
+ "
-0.489800
\n",
+ "
-0.564100
\n",
+ "
-0.538000
\n",
+ "
-0.464000
\n",
+ "
-0.521975
\n",
+ "
-0.488600
\n",
+ "
-0.442925
\n",
+ "
-0.457200
\n",
+ "
-0.477900
\n",
+ "
-0.458175
\n",
+ "
-0.479075
\n",
+ "
-0.478825
\n",
+ "
-0.489300
\n",
+ "
-0.327500
\n",
+ "
-0.576775
\n",
+ "
-0.278275
\n",
+ "
-0.492700
\n",
+ "
-0.501575
\n",
+ "
-0.526450
\n",
+ "
-0.507375
\n",
+ "
-0.559175
\n",
+ "
-0.433875
\n",
+ "
-0.502250
\n",
+ "
-0.537075
\n",
+ "
-0.508775
\n",
+ "
-0.606175
\n",
+ "
-0.506200
\n",
+ "
-0.353100
\n",
+ "
-0.544600
\n",
+ "
-0.554400
\n",
+ "
-0.523800
\n",
+ "
\n",
+ "
\n",
+ "
50%
\n",
+ "
-0.008850
\n",
+ "
-0.046600
\n",
+ "
0.075200
\n",
+ "
0.008050
\n",
+ "
-0.026900
\n",
+ "
-0.015650
\n",
+ "
-0.000650
\n",
+ "
-0.017900
\n",
+ "
0.010000
\n",
+ "
0.160450
\n",
+ "
0.038550
\n",
+ "
0.013800
\n",
+ "
0.060250
\n",
+ "
0.009800
\n",
+ "
-0.030100
\n",
+ "
0.000000
\n",
+ "
0.149400
\n",
+ "
-0.002200
\n",
+ "
0.000000
\n",
+ "
-0.027800
\n",
+ "
-0.002800
\n",
+ "
-0.069350
\n",
+ "
-0.011800
\n",
+ "
0.000000
\n",
+ "
0.018300
\n",
+ "
-0.011650
\n",
+ "
0.016100
\n",
+ "
0.037600
\n",
+ "
0.003750
\n",
+ "
0.021750
\n",
+ "
0.027000
\n",
+ "
0.019100
\n",
+ "
-0.054000
\n",
+ "
0.023350
\n",
+ "
0.003350
\n",
+ "
-0.000600
\n",
+ "
0.015350
\n",
+ "
-0.000700
\n",
+ "
0.005800
\n",
+ "
0.007400
\n",
+ "
...
\n",
+ "
0.065650
\n",
+ "
-0.014100
\n",
+ "
0.030850
\n",
+ "
-0.001800
\n",
+ "
-0.008400
\n",
+ "
0.101400
\n",
+ "
0.033400
\n",
+ "
-0.001600
\n",
+ "
0.000850
\n",
+ "
0.014800
\n",
+ "
-0.044000
\n",
+ "
0.017400
\n",
+ "
0.000850
\n",
+ "
-0.015550
\n",
+ "
0.025350
\n",
+ "
0.017750
\n",
+ "
0.053450
\n",
+ "
0.027700
\n",
+ "
-0.002100
\n",
+ "
0.009500
\n",
+ "
-0.004450
\n",
+ "
-0.000050
\n",
+ "
0.002400
\n",
+ "
0.024250
\n",
+ "
0.000000
\n",
+ "
0.005300
\n",
+ "
0.013600
\n",
+ "
0.001350
\n",
+ "
0.007350
\n",
+ "
0.038700
\n",
+ "
0.059600
\n",
+ "
-0.005900
\n",
+ "
-0.013600
\n",
+ "
-0.027700
\n",
+ "
-0.011850
\n",
+ "
0.009900
\n",
+ "
0.005400
\n",
+ "
0.000600
\n",
+ "
0.028700
\n",
+ "
-0.006500
\n",
+ "
\n",
+ "
\n",
+ "
75%
\n",
+ "
0.525700
\n",
+ "
0.403075
\n",
+ "
0.663925
\n",
+ "
0.463400
\n",
+ "
0.465375
\n",
+ "
0.510425
\n",
+ "
0.528725
\n",
+ "
0.411900
\n",
+ "
0.549225
\n",
+ "
0.697775
\n",
+ "
0.525400
\n",
+ "
0.575275
\n",
+ "
0.604450
\n",
+ "
0.575825
\n",
+ "
0.457975
\n",
+ "
0.382475
\n",
+ "
0.829500
\n",
+ "
0.494775
\n",
+ "
0.327800
\n",
+ "
0.400600
\n",
+ "
0.492400
\n",
+ "
0.414875
\n",
+ "
0.433400
\n",
+ "
0.329250
\n",
+ "
0.527700
\n",
+ "
0.461650
\n",
+ "
0.508425
\n",
+ "
0.586450
\n",
+ "
0.431275
\n",
+ "
0.507600
\n",
+ "
0.458075
\n",
+ "
0.471075
\n",
+ "
0.391950
\n",
+ "
0.551300
\n",
+ "
0.503725
\n",
+ "
0.564875
\n",
+ "
0.517025
\n",
+ "
0.460500
\n",
+ "
0.642300
\n",
+ "
0.439400
\n",
+ "
...
\n",
+ "
0.646900
\n",
+ "
0.467900
\n",
+ "
0.543400
\n",
+ "
0.477200
\n",
+ "
0.497400
\n",
+ "
0.846400
\n",
+ "
0.573875
\n",
+ "
0.501675
\n",
+ "
0.519825
\n",
+ "
0.515000
\n",
+ "
0.412375
\n",
+ "
0.479400
\n",
+ "
0.646325
\n",
+ "
0.565200
\n",
+ "
0.610075
\n",
+ "
0.506975
\n",
+ "
0.671725
\n",
+ "
0.514175
\n",
+ "
0.524175
\n",
+ "
0.501325
\n",
+ "
0.516975
\n",
+ "
0.517475
\n",
+ "
0.375275
\n",
+ "
0.572250
\n",
+ "
0.251600
\n",
+ "
0.540950
\n",
+ "
0.530900
\n",
+ "
0.527900
\n",
+ "
0.475275
\n",
+ "
0.522075
\n",
+ "
0.648500
\n",
+ "
0.492150
\n",
+ "
0.522500
\n",
+ "
0.457000
\n",
+ "
0.483000
\n",
+ "
0.511175
\n",
+ "
0.409075
\n",
+ "
0.498500
\n",
+ "
0.496400
\n",
+ "
0.536950
\n",
+ "
\n",
+ "
\n",
+ "
max
\n",
+ "
10.000000
\n",
+ "
5.039000
\n",
+ "
8.257000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
7.282000
\n",
+ "
7.333000
\n",
+ "
5.473000
\n",
+ "
8.887000
\n",
+ "
6.433000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
4.134000
\n",
+ "
6.418000
\n",
+ "
4.750000
\n",
+ "
8.872000
\n",
+ "
4.081000
\n",
+ "
9.842000
\n",
+ "
5.248000
\n",
+ "
5.942000
\n",
+ "
5.201000
\n",
+ "
10.000000
\n",
+ "
8.494000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
9.416000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
6.796000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
5.834000
\n",
+ "
5.602000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
...
\n",
+ "
10.000000
\n",
+ "
5.444000
\n",
+ "
10.000000
\n",
+ "
4.889000
\n",
+ "
10.000000
\n",
+ "
7.635000
\n",
+ "
10.000000
\n",
+ "
6.664000
\n",
+ "
9.566000
\n",
+ "
10.000000
\n",
+ "
3.497000
\n",
+ "
6.911000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
5.323000
\n",
+ "
10.000000
\n",
+ "
6.533000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
4.008000
\n",
+ "
5.900000
\n",
+ "
10.000000
\n",
+ "
9.552000
\n",
+ "
10.000000
\n",
+ "
8.346000
\n",
+ "
5.444000
\n",
+ "
6.317000
\n",
+ "
10.000000
\n",
+ "
5.911000
\n",
+ "
10.000000
\n",
+ "
10.000000
\n",
+ "
\n",
+ " \n",
+ "
\n",
+ "
8 rows × 772 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ " g-0 g-1 ... g-770 g-771\n",
+ "count 23814.000000 23814.000000 ... 23814.000000 23814.000000\n",
+ "mean 0.248366 -0.095684 ... -0.219210 0.101524\n",
+ "std 1.393399 0.812363 ... 1.326193 1.417674\n",
+ "min -5.513000 -5.737000 ... -10.000000 -10.000000\n",
+ "25% -0.473075 -0.562200 ... -0.554400 -0.523800\n",
+ "50% -0.008850 -0.046600 ... 0.028700 -0.006500\n",
+ "75% 0.525700 0.403075 ... 0.496400 0.536950\n",
+ "max 10.000000 5.039000 ... 10.000000 10.000000\n",
+ "\n",
+ "[8 rows x 772 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 16
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "8mp610qzHQGY"
+ },
+ "source": [
+ "Apesar do describe reunir as nossas estatísticas de interesse, é bastante complexo analisar o dataframe resposta. Para facilitar o nosso entendimento, vamos plotar histogramas que nos ajudaram na visualização das estatísticas de todas as colunas selecionadas.\n",
+ "Olhando o dataframe original, anteriormente, fizemos o histograma de apenas uma coluna. Mas agora, nosso conjunto de dados de interesse é o ```describe()``` que fizemos a partir do ```loc[]``` e, deste ponto de vista, não queremos mais fazer o histograma coluna a coluna (genes), queremos que ele seja a partir das linhas (estatísticas). Por isso, vamos transpor as linhas e colunas (transformar as linhas em colunas e vice-versa).\n",
+ "Para isso, vamos usar o ```.T[]``` no código anterior que produzimos para organizar o ```describe()```.\n",
+ "Ou seja, vamos manter todo o código até o ```describe()``` e, ao final, acrescentaremos o ```.T[]```. Mas, ao rodarmos essa linha, percebemos que temos como devolutiva o mesmo dataframe mas transposto. E, como aqui, nosso interesse é produzir histogramas, acrescentamos como argumento do ```.T[]``` a estatística alvo (```.T['estatística']```) e, por último, acrescentamos o ```.hist(bins = número de quebras)``` para que o histograma seja observado.\n",
+ "\n"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 286
+ },
+ "id": "ij9QVxGArZsd",
+ "outputId": "d0594cfd-08a0-4035-da4a-69360e4b263b"
+ },
+ "source": [
+ "dados.loc[:,'g-0':'g-771'].describe().T['mean'].hist(bins=30)"
+ ],
+ "execution_count": 17,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 17
+ },
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "\n",
+ "text/plain": [
+ "
"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ }
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "g8FvuJapKhco"
+ },
+ "source": [
+ "É muito interessante que a gente replique a análise desenvolvida para os ```genes (g)```, nos ```tipos celulares (c)```. \n",
+ "Por isso, vamos copiar a linha de código que produz os histogramas mas aqui, vamos modificar o argumento ```loc[:,'g-0':'g-771'] -> loc[:,'c-0':'c-99']``` e a quantidade de bins ```hist(bins=100) -> hist(bins=50)```."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 286
+ },
+ "id": "hu2jZ2snuJ09",
+ "outputId": "bc773912-7bf2-49de-e099-0b406b6ed129"
+ },
+ "source": [
+ "dados.loc[:,'c-0':'c-99'].describe().T['mean'].hist(bins=50)"
+ ],
+ "execution_count": 20,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 20
+ },
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXQAAAD7CAYAAAB68m/qAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAATq0lEQVR4nO3de2xUZf7H8c9Ma1sK05SWaR0BRY0gugpJ2XRj1j+2VEFo6YLBapVNugvGGLwkXmBZVi41sOOFSFYJEKMxuwQ3JAppUesFAglegq6sIihut2hjC4UpKy1gCzNn/9gf/em2Mz0z58yFh/frL/r0zPN8v3NOPzk5nT54LMuyBAC44HnTXQAAwB0EOgAYgkAHAEMQ6ABgCAIdAAxBoAOAIQh0ADBEdroLOHHilCKR5H0Uvrh4hEKhnqTNnyr0kVnoI7OY0IfdHrxej0aOHD7o99Ie6JGIldRAP7+GCegjs9BHZjGhD6c98MgFAAxBoAOAIQh0ADAEgQ4AhiDQAcAQBDoAGIJABwBDpP1z6Eg9X8Ew5eUOPPU/9J5T98kzaagIgBsI9ItQXm62qh/ZNmC88dkadaehHgDu4JELABiCQAcAQxDoAGAIAh0ADEGgA4AhCHQAMASBDgCGINABwBAEOgAYgkAHAEPYCvRgMKiKigpNmDBBhw4dkiSdOHFCCxYs0LRp01RdXa2FCxeqq6srqcUCAKKzFehTp07Vpk2bNHr06P4xj8ej+fPnq7m5WY2NjRo7dqyeeeaZpBUKAIjNVqBPmTJFgUDgJ2OFhYUqLy/v/3ry5Mlqb293tzoAgG2uPEOPRCLavHmzKioq3JgOAJAAV7bPbWhoUH5+vu655564X1tcPMKNEmLy+31JXyMVUtGHKWukAn1kFhP6cNqD40APBoP65ptvtH79enm98d/wh0I9ikQsp2VE5ff7dOzYhb/Lt5t9xLpokv1ecT4yC31kDrs9eL2eqDfCjgJ9zZo12r9/vzZu3KicnBwnUwEAHLIV6E8++aTefvttHT9+XPX19SosLNRzzz2nDRs2aNy4cbrzzjslSWPGjNELL7yQ1IIBAIOzFehLly7V0qVLB4x/9dVXrhcEAEgMfykKAIYg0AHAEAQ6ABiCQAcAQxDoAGAIAh0ADEGgA4AhCHQAMASBDgCGINABwBAEOgAYwpX90JGZfAXDlJfLKQYuFvy0GywvN1vVj2wbMN74bE0aqgGQbDxyAQBDEOgAYAgCHQAMQaADgCEIdAAwBIEOAIYg0AHAEAQ6ABiCQAcAQxDoAGCIIQM9GAyqoqJCEyZM0KFDh/rHW1tbVVtbq2nTpqm2tlaHDx9OZp0AgCEMGehTp07Vpk2bNHr06J+ML1u2THV1dWpublZdXZ2eeOKJpBUJABjakIE+ZcoUBQKBn4yFQiEdOHBAVVVVkqSqqiodOHBAXV1dyakSADCkhJ6hd3R0qLS0VFlZWZKkrKwslZSUqKOjw9XiAAD2pX373OLiEUlfw+/3JX2NVEhFH6askQr0kVlM6MNpDwkFeiAQ0NGjRxUOh5WVlaVwOKzOzs4Bj2bsCIV6FIlYiZRhi9/v07Fj3UmbP1US6SORiyPZ79XFfD4yEX1kDrs9eL2eqDfCCT1yKS4u1sSJE9XU1CRJampq0sSJE1VUVJTIdAAAFwx5h/7kk0/q7bff1vHjx1VfX6/CwkJt375dy5cv1+LFi7Vu3ToVFBQoGAymol4AQBRDBvrSpUu1dOnSAeNXX321tmzZkpSiAADx4y9FAcAQBDoAGIJABwBDEOgAYAgCHQAMQaADgCEIdAAwBIEOAIYg0AHAEAQ6ABiCQAcAQxDoAGAIAh0ADEGgA4AhCHQAMASBDgCGINABwBAEOgAYgkAHAEMQ6ABgCAIdAAxBoAOAIQh0ADBEttMJdu7cqbVr18qyLFmWpYULF+rWW291ozYAQBwcBbplWXr88ce1adMmjR8/Xl9++aXuuusuVVZWyuvl5h8AUslx6nq9XnV3d0uSuru7VVJSQpgDQBo4ukP3eDx67rnndP/99ys/P1+nTp3Sxo0b3aoNABAHR4F+7tw5bdiwQevWrVNZWZk++eQTPfzww9q+fbuGDx9ua47i4hFOSrDF7/clfY1USEUfpqyRCvSRWUzow2kPjgL94MGD6uzsVFlZmSSprKxMw4YNU0tLi2688UZbc4RCPYpELCdlxOT3+3TsWHfS5k+VRPpI5OJI9nt1MZ+PTEQfmcNuD16vJ+qNsKOH3ZdeeqmOHDmif/3rX5KklpYWhUIhXX755U6mBQAkwNEdut/v1/Lly/XQQw/J4/FIklatWqXCwkJXigMA2Of4c+izZs3SrFmz3KgFAOAAny8EAEMQ6ABgCAIdAAxBoAOAIQh0ADAEgQ4AhiDQAcAQBDoAGIJABwBDEOgAYAgCHQAMQaADgCEIdAAwBIEOAIYg0AHAEAQ6ABiCQAcAQxDoAGAIAh0ADEGgA4AhCHQAMASBDgCGINABwBDZTifo7e3VqlWr9MEHHyg3N1eTJ09WQ0ODG7UBAOLgONCffvpp5ebmqrm5WR6PR8ePH3ejLgBAnBwF+qlTp7R161bt2rVLHo9HkjRq1ChXCgMAxMfRM/S2tjYVFhbq+eef15w5czRv3jx9/PHHbtUGAIiDozv0cDistrY2XXfddVq0aJH+8Y9/6L777tM777yjESNG2JqjuNjecU74/b6kr+GmvrNh5VySNWC8oDBfuYOM954NDzqeiFS8Vxfa+YiGPjKLCX047cFRoAcCAWVnZ6uqqkqSNGnSJI0cOVKtra264YYbbM0RCvUoErGclBGT3+/TsWPdSZs/Gfx+n6of2TZgvPHZmrjH45Xs9+pCPB+DoY/MYkIfdnvwej1Rb4QdPXIpKipSeXm59uzZI0lqbW1VKBTSFVdc4WRaAEACHH/KZcWKFVqyZImCwaCys7P11FNPqaCgwI3aAABxcBzoY8eO1V/+8hc3agEAOMBfigKAIQh0ADAEgQ4AhiDQAcAQBDoAGIJABwBDEOgAYAgCHQAMQaADgCEIdAAwBIEOAIYg0AHAEAQ6ABiCQAcAQxDoAGAIAh0ADEGgA4AhCHQAMASBDgCGINABwBAEOgAYgkAHAEMQ6ABgCNcC/fnnn9eECRN06NAht6YEAMTBlUD/4osvtG/fPo0ePdqN6QAACXAc6H19fVq5cqWWL1/uQjkAgEQ5DvS1a9dq1qxZGjNmjBv1AAASlO3kxZ9++qn279+vRx99NOE5iotHOCnBFr/fl/Q1EtF3NqycS7LSXUa/vrPhQd+r3rNh5Q5SZ7TxofoabI1or0nXeDQ/Pv7HfWTauYxHpv58xMuEPpz24CjQ9+7dq5aWFk2dOlWSdOTIEf3ud7/T6tWr9ctf/tLWHKFQjyIRy0kZMfn9Ph071p20+Z3w+32qfmTbgPHGZ2vSUI2Uc0lW1HriHY/2nkc7H7Hei3QcH41b82SKTP75iIcJfdjtwev1RL0RdhTo9957r+69997+rysqKrR+/XqNHz/eybQAgATwOXQAMISjO/T/tWPHDjenAwDEgTt0ADAEgQ4AhiDQAcAQBDoAGIJABwBDEOgAYAgCHQAMQaADgCEIdAAwBIEOAIYg0AHAEK7u5ZJKvoJhyssdWP4PvefUffJMGirCeVH3Ve8LSzJj32ogE12wgZ6Xmx11X+oLe1fkC1+8+6qf/x4AZ3jkAgCGINABwBAEOgAYgkAHAEMQ6ABgCAIdAAxBoAOAIQh0ADAEgQ4AhiDQAcAQjv70/8SJE3r88cf17bffKicnR1dccYVWrlypoqIit+oDANjk6A7d4/Fo/vz5am5uVmNjo8aOHatnnnnGrdoAAHFwFOiFhYUqLy/v/3ry5Mlqb293XBQAIH6u7bYYiUS0efNmVVRUxPW64uIRbpUgafCtW/1+n/rOhpVzSZara/3vutHmT/bapot3u91kHz+YqFsGnw0rd5Bzn0nXhCnbGZvQh9MeXAv0hoYG5efn65577onrdaFQjyIRK+71ojUea+vWY8eSt7Gu3++LuTXsYGubcAGmQrzvnRvHR5PIdZeO69Euv9+XEXU4ZUIfdnvwej1Rb4RdCfRgMKhvvvlG69evl9fLB2cAIB0cB/qaNWu0f/9+bdy4UTk5OW7UBABIgKNA//rrr7VhwwaNGzdOd955pyRpzJgxeuGFF1wpDgBgn6NAv+aaa/TVV1+5VQsAwAEeeAOAIQh0ADAEgQ4AhiDQAcAQBDoAGIJABwBDEOgAYAgCHQAMQaADgCEIdAAwhGvb516ofAXDlJc78G34ofecuk+esX08kiPaPuPJPr63L6zcnOTunx+tzmjXXrzivbbhvlSfg4s+mfJys6PuVz3YzsSxjof7Yu0znuzjk3meo617fg03dvaO99qG+1J9DnjkAgCGINABwBAEOgAYgkAHAEMQ6ABgCAIdAAxBoAOAIQh0ADAEgQ4AhiDQAcAQBDoAGMJxoLe2tqq2tlbTpk1TbW2tDh8+7EJZAIB4OQ70ZcuWqa6uTs3Nzaqrq9MTTzzhRl0AgDg52m0xFArpwIEDevnllyVJVVVVamhoUFdXl4qKimzN4fV6El6/ZOSwuMajreXW8dHG3ZzL1PFMrCmd74WTnws7a/x4frfWSrdM7SOefLHTQ6xjPJZlWfZL+6n9+/dr0aJF2r59e//YjBkz9PTTT+v6669PdFoAQAL4pSgAGMJRoAcCAR09elThcFiSFA6H1dnZqUAg4EpxAAD7HAV6cXGxJk6cqKamJklSU1OTJk6caPv5OQDAPY6eoUtSS0uLFi9erJMnT6qgoEDBYFBXXXWVW/UBAGxyHOgAgMzAL0UBwBAEOgAYgkAHAEMQ6ABgCOMC/cyZM3r44Yd1yy23aPr06dq5c2fUYw8ePKi7775bM2bM0IwZM7Rr164UVhqb3T4++ugjTZo0STU1NaqpqdHcuXNTXGls8ZwPSert7dXMmTM1Z86cFFU4NLs9HDx4ULNnz1ZNTY1mzpypP/7xj+rr60txtdHZ7ePdd9/VnDlzVFVVpZkzZ+qll15KcaWx2e3j6NGjmjdvnsrKyjLqerKzoWE4HNaKFStUWVmpW265RVu2bLE3uWWYP//5z9Yf/vAHy7Isq7W11brpppusnp6eAcedOnXKqqiosD799FPLsizr7NmzVldXV0prjcVuHx9++KE1e/bsVJdnm90+zlu9erX1+9//PqN6stvDmTNnrN7eXsuyLCscDlsLFy60XnnllZTWGovdPvbt22cdOXLEsizLOnnypFVZWWnt3bs3pbXGYrePkydPWnv37rV27tyZUdfTvHnzrK1bt1qWZVlbt2615s2bN+CY119/3frtb39rhcNhKxQKWTfffLPV1tY25NzG3aG/+eabqq2tlSSNGzdOP/vZz7R79+4BxzU1NamsrEyTJ0+WJGVnZ2vkyJEprTUWu31kunj6+Pjjj3X48GHV1NSkssQh2e0hLy9POTk5kqRz587phx9+kNebOT9idvuYNGmSSktLJUk+n09XX321vvvuu5TWGovdPnw+n6ZMmaJhw6JvgpZq5zc0rKqqkvTfDQ0PHDigrq6unxz3xhtvaO7cufJ6vSoqKlJlZaXeeuutIefPnKvNJe3t7Ro9enT/14FAQEeOHBlw3D//+U9lZ2drwYIFqqmp0ZIlS/T999+nstSY7PYhSYcPH9bs2bM1d+5cvf7666kq0Ra7fZw+fVqrVq3SihUrUlmeLfGci6NHj6qmpkbl5eUaPny47rjjjlSVOaR4+jivpaVF+/bt0y9+8Ytkl2dbIn1kio6ODpWWliorK0uSlJWVpZKSEnV0dAw47rLLLuv/2m6PjrbPTYfZs2ervb190O+9//77tueJRCL68MMP9eqrr2rUqFFavXq1/vSnP2n16tVulRqTW31cf/312rVrl3w+n9ra2lRfX6/S0lLddNNNbpUak1t9PPXUU6qrq1NpaWnK/5MUt3qQpNLSUm3btk2nT5/WY489pnfeeUczZ850o8whudmHJHV2dur+++/XsmXL+u/YU8HtPi4mF1ygD3UHetlll+m7777r30+mo6ND5eXlA44LBAIqLy9XSUmJJKm6ulpLlixxv+Ao3OpjxIgR/f8eO3asKisr9fe//z1lge5WH5988ol2796tdevWqbe3V99//72qq6vV2NiYlLp/zK0efiw/P18zZsxQY2NjygLdzT5CoZDq6+s1f/583Xbbba7XGksyzkem+PGGhllZWVE3NAwEAmpvb9eNN94oaeAdezTGPXKZPn26/va3v0n676OIzz//XDfffPOA42677TZ99tln6unpkSTt3r1bEyZMSGmtsdjto7OzU9b/7d7w73//W3v27NG1116b0lpjsdtHY2OjduzYoR07dmjNmjUaP358SsLcDrs9tLW19X+qpa+vT++9957Gjx+f0lpjsdvHiRMnVF9fr7vvvjvjPjUl2e8jE9nd0HD69OnasmWLIpGIurq69O6772ratGlDzm/cXi6nT5/W4sWLdfDgQXm9Xj322GOqrKyUJK1du1YlJSW66667JElbt27Viy++KI/HozFjxqihoUGjRo1KZ/n97Pbx17/+VZs3b1Z2drbC4bB+/etfa/78+Wmu/v/Fcz7O++ijjxQMBvXaa6+lo+QB7Pawbdu2/uspEono5z//uRYtWqS8vLw0d/BfdvsIBoPatGmTrrzyyv7X/uY3v9Htt9+ertJ/wm4f4XBYv/rVr9TX16eenh4VFRVp7ty5euCBB9Jaf7QNDRcsWKAHH3xQN9xwg8LhsFauXKk9e/ZIkhYsWND/i+BYjAt0ALhYGffIBQAuVgQ6ABiCQAcAQxDoAGAIAh0ADEGgA4AhCHQAMASBDgCG+A+dnAYA/AActgAAAABJRU5ErkJggg==\n",
+ "text/plain": [
+ "
"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ }
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "kUCr17j6xpu5"
+ },
+ "source": [
+ "Um outro tipo de gráfico super interessante e útil é o boxplot. \n",
+ "Para visualizá-lo, vamos usar a função ```boxplot```do Seaborn e, como argumentos dessa função vamos passar um ```x```, onde ```x = coluna que será plotada neste eixo``` e a base de dados ```data = conjunto de dados```.\n",
+ "O boxplot apresenta uma caixa no meio onde podemos identificar a mediana (linha no meio da caixa que é o ponto onde metade dos dados estão na direita e a outra metade para a esquerda), os outliers (pontos acima ou abaixo do eixo principal do gráfico que representam valores discrepantes para mais ou para menos), a maior concentração dos dados (caixa principal que representa onde está a mior parte dos dados - primeiro quartil (25%) e terceiro quartil (75%)) e os máximos e mínimos desconsiderando os outliers (linhas laterais à caixa principal).\n",
+ "O boxplot é uma importante ferramenta na visualização de dados porque em apenas um gráfico, podemos identificar várias métricas estatísticas."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 303
+ },
+ "id": "R9LHaY0yv29K",
+ "outputId": "3527f845-0c42-440b-d3f0-f49a7b942ffe"
+ },
+ "source": [
+ "sns.boxplot(x='g-0' , data=dados)"
+ ],
+ "execution_count": 21,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 21
+ },
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "iVBORw0KGgoAAAANSUhEUgAAAV8AAAEMCAYAAABjr7XYAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAATBUlEQVR4nO3da1TU9b7H8c/MgOF4CUQlFAXdmtrKoqXdJRVLfQCY2y7kSvdZWZiJl26rstyV1kke7Ky2tqxd5s60dZ6kS/Os8oKaml1XyzTd7jIBzwYVCQyhQmb+54HNyAAKcvuO8X494TL//2++AvPmP/8ZB5fjOI4AAG3KbT0AALRHxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMBBxIRuXllbI7z/7tODY2M4qKTnV4kO1lHCej9maJpxnk8J7PmZrmqbO5na7FBPT6ZyXX1B8/X4nJL6Bz4WzcJ6P2ZomnGeTwns+Zmua1piN0w4AYID4AoAB4gsABogvABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoAB4gsABogvABggvgBggPgCgIEL+jNCuLitXv2ujhzJP+82J0+WyeNxq3Pnrg2u16dPoiZPntpS4wHtCvFtR44cydfB73+QJyr6nNv4fi2TJHmiqs67VmA7AE1DfNsZT1S0vIljznl5Zf4WSTrvNjW3A9A0nPMFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8TWwa9cnys3NtR7jopSbm6tduz6xHgNotgjrAdqjnTu3KzLSo6FDr7Ue5aKzadMmnT7t080332I9CtAsHPkCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoAB4gsABogvABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoAB4gsABogvABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoAB4gsABogvABggvgBggPgCgAHiCwAGiC8AGIho7SsoKyvVsmV/1+TJf9Hq1f/UjBmzdfJkmRYtWqC4uHjNnfu4HMfRsmV/14wZs+U4jpYsWSxJmjXrEV16abQkqaAgT4sWLVBsbHdFRUVpypRpWr36n5o8eapWr35XM2bM1qWXRqusrFRLliyWz1etqKhL9OCDc4Jr1J6p5j5nZpyqd99dLkmaOPEOLVmyWI7jqEePnoqIiJDjSC6XSz7faZ04cUJPPfWsHMfRokUL5PV20k8/lUhySXIa9bVZvDhHDz/8RIt9rduDffv2SZLuu2+y8SRoDI8nQj5ftTwej3w+X8hlLpdLqaljtWXLxzW2P7tdly5dVF5erujoGHXo0EHHjx+TJEVEnMlWdXV18OOa72dm/kWrVi2X2+2Wz+fTpEmZ+uCD/9H06bO0ceP/yufzyXEcRUZGBDuSnj5Rr7/+imbOnKt169YEe1VefkpFRf+npKT++utfX2jRr43LcZzGlUJSSckp+f1nN+/Ro4uKi8vPu8/Klcu1bdsWxcf3UlFRoUaNGqODBw+osPA/kqTRo2+VJG3btkWjRo2RJG3dujl42ZQp90mSnnnm8eA+ktSrV28VFRWGrDtlyn1auXJ5cP/aa9SeqeY+gRkD1+H1dlJlZcV5/229evWWpJC5LtTy5aubvO+FyslZqB+OnJA3ccw5t6nM3yJJ590msN2APt31xBPzW3TGhhBdNKz+A6DAL4KaAh3p2NGrysoKeb2d9MsvlcGu1Mzjhd5W3W6XYmM7n/vyC1rtApWVlWrnzu1yHEeFhf+R4zjasWNbSKx27NimHTu2/X7Zdu3Ysa3GZdt18mSZCgry6gQusF7g7c6dn6igIC9kf0naufPMGvXNFNin5owBDYU3MENzwiudOfpF40yf/l/WI+CiUP/xZO3wSmc7Eri9V1ZWhHSlpgULnmnRKVv1tMP69WtCjpSls3cVQj921XjfCbls3boPdPDggQavy+/36803l6q6OvSuTWCNwNFvzZkC+9SesS3t3btHOTkL2+S6Cgry5fd5WmQtf/WvKijIb7PZJen06ao2uy6gtry8H1t0vVY98t29e1e9v23qcmq9Pfv53bt3Nero0uer/n270DUc58wa9c0U2KdxMwJAy2nVI98bb7xZn3yyrRFxC5yjqX2uxqUbb7w55BzxuXg8EYqLi1NhYWHIGi7XmTXqmymwz7Fjx0wD3FbnTQPnfFuCOyJKfdv4nC/ne/FH0qpHvunpE+V2u0I+F3iksubHERGeGu9HhFyWkfFnZWXNbPC63G63srJmBteqvUZ9MwX2qT1jWxo69Gqz677YREZ2sB4B7VhSUv8WXa9V4xsdHaMRI0bK5XKpV6/ecrlcSkkZFXyWgCSlpIxSSsqo3y8bqZSUUTUuG6lLL41W375JIftICq4XeDtixC3q2zcpZH9JGjFiZMhTzWrOFNin5owBXm+nBv99vXr1rjPXheKpZo33xhsrrEfARaH+gymPp+4d/UA/Ard3r7dTSFdqaumnmrX6f7JIT5+ogQMHKSsrWwMHDgoeyUZFRSkxsZ8yMv4c3Cbwfv/+A9S//4CQI9bAPr17J+hPfxoQXC8ra2Zw38D19e8/QImJSRo0aFDIGrVnqrlPYK3Adc+YMUuXXHKJOnTooN69E5SYmKS+fZOUmNhPCQkJioqK0vTp2cG5unWL/X31xh9Fc9SLP7pA8Dyeug/0ulwujRkzrtb2Z7fr0qWLpDMHTD17xgU/X9895Jrv33vvfXK5XMG1Jk3KlMvl0gMPPPR7G/qpb9+kkI7MmDFbHTt21EMPzQ7pVXx8gqSWP+qV2uB5vpbCdb6cnIWKjPTokUfmtfn1XuzP83355f/W6dO+Nr/exgrXnzmJ2ZqqqbOZPs8XAFA/4gsABogvABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoAB4gsABogvABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoAB4gsABogvABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYIL4AYID4AoCBCOsB2qMRI0aqa9eO1mNclG677Tb9/PMv1mMAzUZ8Ddx88y3q0aOLiovLrUe56KSmpvJ1wx8Cpx0AwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAQIT1AGhbvl/LVJm/5byXSzrvNme3696SowHtCvFtR/r0SWxwm5MnI+TxuNW5c9cGtuzeqPUA1I/4tiOTJ09t1HY9enRRcXF5K08DtG+c8wUAA8QXAAwQXwAwQHwBwADxBQADxBcADBBfADBAfAHAAPEFAAPEFwAMEF8AMEB8AcAA8QUAA8QXAAwQXwAwQHwBwADxBQADxBcADFzQnxFyu12N+lw4Cef5mK1pwnk2KbznY7amacpsDe3jchzHaepAAICm4bQDABggvgBggPgCgAHiCwAGiC8AGCC+AGCA+AKAAeILAAaILwAYaJH4rly5UuPHj1d6eromTJjQEku2uM8//1xDhgzRe++9Zz1K0PPPP6/x48crIyNDmZmZ2rt3r+k8hw8f1t13361x48bp7rvvVl5enuk8NZWWluqBBx7QuHHjlJ6eruzsbP3000/WY9WxZMkSDRo0SP/+97+tRwn67bff9Oyzz2rs2LFKT0/X/PnzrUcK2rp1q26//XZNmDBBGRkZ2rhxo9ksOTk5Sk1NrfP9a7XbhdNMH3/8sTN58mSnvLzccRzHKS4ubu6SLa68vNy54447nKysLGflypXW4wTl5uY6VVVVwffHjBljOs+UKVOctWvXOo7jOGvXrnWmTJliOk9NpaWlzmeffRb8eNGiRc5TTz1lOFFd+/btc6ZNm+aMHj3aOXjwoPU4QQsXLnRefPFFx+/3O44TPrdRv9/vDB8+PPi1OnDggJOcnOz4fD6Teb788kunsLCwzvevtW4XzT7yXb58ubKzs9W5c2dJUvfu3Zv9C6GlLVq0SNOmTVNMTIz1KCFGjx6tyMhISVJycrKOHj0qv99vMktJSYn279+vtLQ0SVJaWpr2798fNkeX0dHRuv7664MfJycnq7Cw0HCiUFVVVVqwYIGee+4561FCVFRUaO3atZozZ45crjMv9BJOt1G3263y8nJJUnl5uXr27Cm32+Zs6PDhwxUfHx/yuda8XVzQq5rV59ChQ9qzZ49effVVVVVVKTMzU3fddVezB2sp27dvV3l5ucaPH69t27ZZj3NOq1at0qhRo8x+8IqKihQXFyePxyNJ8ng86tmzp4qKitStWzeTmc7F7/fr/fffV2pqqvUoQa+++qoyMjKUkJBgPUqII0eOKDo6WkuWLNHnn3+uTp06ac6cORo+fLj1aHK5XHrllVf00EMPyev1qqKiQm+++ab1WCFa83bRYHwnTpx4ziOMTz/9VD6fT0VFRVq9erVKS0t1zz33qF+/frr22mubNVhjnW++jz76SH/729/0zjvvtMkstTX0tQt8Qzds2KD169dr1apVbTneRWvhwoXyer269957rUeRJH3zzTfat2+fHnvsMetR6vD5fDpy5IiuuOIKPfHEE9qzZ48efPBBbdq0KXhv1Up1dbXeeOMNvf766xo2bJi+/vprzZ07Vxs2bFCnTp1MZ2sLDcZ3zZo15728V69eSktLk9vtVmxsrG666SZ9++23bRbf88331Vdfqbi4WHfeeaekMw/abN26VWVlZcrOzjadLWDTpk1avHixVqxYYXp3MD4+XseOHZPP55PH45HP59Px48fr3A2zlpOTo/z8fC1btszsXkJtX375pQ4dOqQxY8ZIko4ePapp06bppZde0ogRI0xni4+PV0RERPBu89VXX62YmBgdPnxYQ4cONZ3twIEDOn78uIYNGyZJGjZsmDp27KhDhw7pqquuMp0toDVvF83+6U1LS9OOHTskSZWVlfr66681ePDgZg/WEoYPH67du3crNzdXubm5GjdunGbNmtUm4W2MrVu36qWXXtLbb79tfnc1NjZWQ4YM0YcffihJ+vDDDzVkyJCwOuXw8ssva9++fVq6dKk6dOhgPU5QVlaWdu7cGfw5u+yyy/T222+bh1eSunXrpuuvv167du2SdOaR+5KSEiUmJhpPJl122WU6evSofvzxR0lnTmGWlJSob9++xpOd1Zq3i2a/mPqvv/6q+fPna//+/ZKkCRMmKCsrq9mDtYYnn3xSV155ZdjcXb3hhhsUGRkZ8o1csWKF2QODhw4d0pNPPqmff/5ZXbt2VU5Ojvr3728yS23ff/+90tLSlJSUpKioKElSQkKCli5dajxZXampqVq2bJkuv/xy61EknTnvO2/ePJWVlSkiIkJz587VyJEjrceSJK1bt07/+Mc/gg8Gzp49W7feeqvJLC+88II2btyoEydOKCYmRtHR0dqwYUOr3S74SxYAYCA8TpoBQDtDfAHAAPEFAAPEFwAMEF8AMEB8AcAA8cUfSllZmWbOnKnk5GSNHj1a69evtx4JqFezX1gHCCcLFixQZGSkdu3apQMHDmj69OkaPHiwBg4caD0aEIL/ZIGw99133+npp59Wfn6+UlJS5Ha7lZiYqIcffjhku8rKSl133XVav369+vXrJ0l6/PHHFRcXF5YveoP2jdMOCGtVVVXKzs7WxIkT9cUXXygtLU2bN2+ud9u8vDx5PJ5geCVp8ODB+uGHH9pqXKDROO2AsLZnzx5VV1dr6tSpcrlcGjt27DlfjauysrLOyyR26dJFFRUVbTEqcEE48kVYO378uOLi4oIvvCIp+HJ+999/v6655hpdc801Wrdunbxer06dOhWy/6lTp9rFa8Pi4sORL8Jajx49dOzYMTmOEwxwUVGR+vTpo7feeitk28rKSvl8PuXl5SkpKUmS9K9//UsDBgxo67GBBnHki7CWnJwsj8ej9957T9XV1dq8efM5/8qz1+vVbbfdptdeey342tJbtmwJ27+ojfaNZzsg7O3du1fPPPOMCgoKlJKSIr/fryFDhmjmzJl1ti0rK9O8efP06aefKjo6Wo8++qjS09MNpgbOj/jionPnnXcqMzNTkyZNsh4FaDJOOyDsffHFFyouLlZ1dbXWrFmjgwcPKiUlxXosoFl4wA1h7/Dhw5o7d65++eUXJSQk6LXXXlPPnj2txwKahdMOAGCA0w4AYID4AoAB4gsABogvABggvgBggPgCgIH/B0WEIsg40bSaAAAAAElFTkSuQmCC\n",
+ "text/plain": [
+ "
"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ }
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "-4kvTdIE3d0W"
+ },
+ "source": [
+ "Podemos também, além de definir apenas os dados que irão no eixo x, definimos os dados para o outro eixo, atribuindo um valor para o parâmetro ```y``` (```y = variável que vai ser plotada neste eixo```).\n",
+ "Como podemos perceber, no boxplot que representa o ```tratamento = com_droga``` apresenta muitos outliers e isso gera uma discussão bastante interessante pois, do ponto de vista biológico a investigação desses pontos é importante mas, dependendo da área que estamos trabalhando, esse ponto pode apresentar outras soluções. \n",
+ "Dito isso, é importante para um cientista de dados não só entender e manipular a base de dados mas também saber acerca do negócio que estamos tratando."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 521
+ },
+ "id": "h87JI4a_yNa1",
+ "outputId": "029511d6-7c98-4af5-be72-2020ca41c860"
+ },
+ "source": [
+ "plt.figure(figsize=(10,8))\n",
+ "sns.boxplot(y='g-0', x='tratamento' , data=dados)"
+ ],
+ "execution_count": 22,
+ "outputs": [
+ {
+ "output_type": "execute_result",
+ "data": {
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 22
+ },
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "\n",
+ "text/plain": [
+ "
"
+ ],
+ "text/plain": [
+ " tempo g-0 ... c-98 c-99\n",
+ "count 23814.000000 23814.000000 ... 23814.000000 23814.000000\n",
+ "mean 48.020156 0.248366 ... -0.470252 -0.301505\n",
+ "std 19.402807 1.393399 ... 1.834828 1.407918\n",
+ "min 24.000000 -5.513000 ... -10.000000 -10.000000\n",
+ "25% 24.000000 -0.473075 ... -0.592600 -0.562900\n",
+ "50% 48.000000 -0.008850 ... 0.014000 -0.019500\n",
+ "75% 72.000000 0.525700 ... 0.461275 0.438650\n",
+ "max 72.000000 10.000000 ... 3.111000 3.805000\n",
+ "\n",
+ "[8 rows x 873 columns]"
+ ]
+ },
+ "metadata": {
+ "tags": []
+ },
+ "execution_count": 31
+ }
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "2ZabuXjzi5Cn"
+ },
+ "source": [
+ "A função ``` .describe() ``` fornecida pelo Pandas permite a análise dos seguintes parâmetros estatísticos: **Valor total**, **Média**, **Mediana**, **Desvio Padrão**, **Valor Mínimo**, **Valor Máximo** e os **Quartis**. Ou seja a função permite a análise de **Variáveis de Tendência Central**, como também a análise de **Medidas Separatrizes** e **Medidas de Dispersão**.\n",
+ "***\n",
+ "\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "uAE67jFKrf__"
+ },
+ "source": [
+ "### **Medidas de Tendência Central:**\n",
+ "***"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "BRzOShqOpQDF"
+ },
+ "source": [
+ "#### **Média**\n",
+ "https://pt.wikipedia.org/wiki/M%C3%A9dia#Defini%C3%A7%C3%A3o_formal\n",
+ "\n",
+ "É representada por $\\mu$ quando se refere à população e por $\\bar{X}$ quando se refere à amostra\n",
+ "\n",
+ "#$$\\mu = \\frac 1n\\sum_{i=1}^{n}X_i$$\n",
+ "\n",
+ "onde \n",
+ "\n",
+ "$n$ = número de observações (registros)\n",
+ "\n",
+ "$X_i$ = valor da i-ésima observação (registro)\n",
+ "\n",
+ "***\n",
+ "\n",
+ "#### **Médiana**\n",
+ "https://pt.wikipedia.org/wiki/Mediana_(estat%C3%ADstica)\n",
+ "\n",
+ "Para obtermos a mediana de uma conjunto de dados devemos proceder da seguinte maneira:\n",
+ "1. Ordenar o conjunto de dados;\n",
+ "2. Identificar o número de observações (registros) do conjunto de dados ($n$);\n",
+ "3. Identicar o elemento mediano:\n",
+ "\n",
+ "> Quando $n$ for ímpar, a posição do elemento mediano será obtida da seguinte forma:\n",
+ "\n",
+ "\n",
+ "# $$Elemento_{Md} = \\frac{n+1}2$$\n",
+ "\n",
+ "> Quando $n$ for par, a posição do elemento mediano será obtida da seguinte forma:\n",
+ "\n",
+ "\n",
+ "# $$Elemento_{Md} = \\frac{n}2$$\n",
+ "\n",
+ "4. Obter a mediana:\n",
+ "\n",
+ "> Quando $n$ for ímpar:\n",
+ "\n",
+ "\n",
+ "# $$Md = X_{Elemento_{Md}}$$\n",
+ "\n",
+ "> Quando $n$ for par:\n",
+ "\n",
+ "\n",
+ "# $$Md = \\frac{X_{Elemento_{Md}} + X_{Elemento_{Md}+1}}2$$\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "GQ63XNLbpPRF"
+ },
+ "source": [
+ "### **Medidas Separatrizes:**\n",
+ "***"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "d9qcPQ29rqjA"
+ },
+ "source": [
+ "Há uma série de medidas de posição semelhantes na sua concepção à mediana, embora não sejam medidas de tendência central. Como se sabe, a mediana divide a distribuição em duas partes iguais quanto ao número de elementos de cada parte. Já os quartis permitem dividir a distribuição em quatro partes iguais quanto ao número de elementos de cada uma; os decis em dez partes e os centis em cem partes iguais."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "diyw7oPcr_Et"
+ },
+ "source": [
+ "#### **Quartis**\n",
+ "\n",
+ "https://pt.wikipedia.org/wiki/Quartil\n",
+ "\n",
+ "Assim, no caso duma amostra ordenada:\n",
+ "\n",
+ "* primeiro quartil (designado por Q1/4) = quartil inferior = é o valor aos 25% da amostra ordenada = 25º percentil\n",
+ "* segundo quartil (designado por Q2/4) = mediana = é o valor até ao qual se encontra 50% da amostra ordenada = 50º percentil, ou 5º decil.\n",
+ "* terceiro quartil (designado por Q3/4) = quartil superior = valor a partir do qual se encontram 25% dos valores mais elevados = valor aos 75% da amostra ordenada = 75º percentil\n",
+ "
À diferença entre os quartis superior e inferior chama-se amplitude inter-quartil.
\n",
+ "\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "_QLiWQJStmax"
+ },
+ "source": [
+ "### **Medidas de Dispersão**\n",
+ "***"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "jQKTc_octuj2"
+ },
+ "source": [
+ "Embora as medidas de posição forneçam uma sumarização bastante importante dos dados, elas podem não ser suficientes para caracterizar conjuntos distintos, especialmente quando as observações de determinada distribuição apresentarem dados muito dispersos."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "pLVa5Jcht_km"
+ },
+ "source": [
+ "#### **Desvio Padrão**\n",
+ "\n",
+ "https://pt.wikipedia.org/wiki/Desvio_padr%C3%A3o\n",
+ "\n",
+ "
Em probabilidade, o desvio padrão ou desvio padrão populacional (comumente representado pela letra grega sigma) é uma medida de dispersão em torno da média populacional de uma variável aleatória.
O termo possui também uma acepção específica no campo da estatística, na qual também é chamado de desvio padrão amostral (comumente representado pela letra latina s) e indica uma medida de dispersão dos dados em torno de média amostral. Um baixo desvio padrão indica que os pontos dos dados tendem a estar próximos da média ou do valor esperado.Um alto desvio padrão indica que os pontos dos dados estão espalhados por uma ampla gama de valores.
O desvio padrão populacional ou amostral é a raiz quadrada da variância populacional ou amostral correspondente, de modo a ser uma medida de dispersão que seja um número não negativo e que use a mesma unidade de medida dos dados fornecidos.
"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "xL9MuXiw6Q2J"
+ },
+ "source": [
+ "## **Desafio 05: Refletir sobre a manipulação do tamanho das visualizações**"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "g15E_26wzOS8"
+ },
+ "source": [
+ "### 'Por que ocorreu a distorção?', 'O gráfico estava realmente distorcido?', 'O que realmente significa um gráfico distorcido?'\n",
+ "***"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "5ZPZLjU848qI"
+ },
+ "source": [
+ "
A 'distorção' mencionada ocorre principalmente pela grande amplitude dos dados, principalmente aqueles distantes das medidas de tendência central (outliers). Ou seja análisar essa base de dados, com uma grande amplitude torna necessária uma grande área para a melhor visualização desses dados.
\n",
+ "
Isso entretanto não classificaria necessáriamente uma distorção. Um gráfico distorcido na realidade, como aponta a autora do livro 'Storytelling with data' - Cole Nussbaumer Knaflic, é um gráfico que sugere uma interpretação diferente do que os dados realmente indicam, o que induz a pessoa ao erro ou a uma interpretação equivocada como indicado no exemplo abaixo. No qual a escala dos eixos foi alterada para gerar uma impressão errada ao telespectador.
\n",
+ ""
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "id": "rzPl_cdA6Qg8"
+ },
+ "source": [
+ "## **Desafio 06: Fazer outras análises com o boxplot e até com o histograma** "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "metadata": {
+ "colab": {
+ "base_uri": "https://localhost:8080/",
+ "height": 469
+ },
+ "id": "bzikiRUv65vD",
+ "outputId": "4cea895d-dce4-4fff-8069-940a2cb30bb6"
+ },
+ "source": [
+ "# Criação do Canva:\n",
+ "fig, axs = plt.subplots(figsize = (26, 7), ncols = 2)\n",
+ "\n",
+ "#Box Plot aplicado ao banco de dados Gerais:\n",
+ "ax1 = sns.boxplot(y='g-0', x='tratamento' , data=dados, ax = axs[0])\n",
+ "ax1.set_title('Comportamento do \"g-0\" em relação ao tratamento (Banco de Dados Geral)', fontsize = 18)\n",
+ "\n",
+ "#Box Plot aplicado aos cinco principais compostos:\n",
+ "ax1 = sns.boxplot(y='g-0', x='tratamento' , data=top_compostos, ax = axs[1])\n",
+ "ax1.set_title('Comportamento do \"g-0\" em relação ao tratamento (5 Principais Compostos)', fontsize = 18)\n",
+ "\n",
+ "plt.show()"
+ ],
+ "execution_count": 37,
+ "outputs": [
+ {
+ "output_type": "display_data",
+ "data": {
+ "image/png": "\n",
+ "text/plain": [
+ "