CVM web data extraction from a list of companies
site for consulting companies information: https://cvmweb.cvm.gov.br/SWB/Sistemas/SCW/CPublica/CiaAb/FormBuscaCiaAb.aspx?TipoConsult=c
file containing the companies to gather information: Empresas_listadas_B3-CORRETO.xlsx
After installing the required packages (following requirements.txt file) in an isolated environment, execute:
- cvm_crawler_main.py to get companies files per year (folder companies_data)
- cvm_crawler_get_formulario.py to get companies pdfs (folder companies_data)
- cvm_craeler_email.py to get companies e-mails (file tabela_empresa_nomes.csv)
p.s. the crawler is set to run Google Chrome web browser behind the scenes. So it is necessary to have Google Chrome locally installed.