Dataset Package
See phases of data in "Data Acquisition and Preparation"
Documentation from Code Strigs
drugslm.sources.anvisa
ANVISA Data Source Configuration.
This module acts as the configuration entry point for the ANVISA data source subpackage. It defines global constants, file paths, and remote URLs used across the scraping and processing modules.
Scope
This module is responsible for: - Centralizing URL definitions for ANVISA portals and open datasets. - Defining the directory structure for raw data storage within the ANVISA scope. - Exporting constants for use in sibling modules (e.g., categories scraper).
ANVISA_DIR = RAW / 'anvisa'
module-attribute
Path: Root directory for storing raw data scraped specifically from the ANVISA query portal.
ANVISA_URL = 'https://consultas.anvisa.gov.br/#/bulario/q/'
module-attribute
str: Base URL for the ANVISA 'Bulário Eletrônico' (Electronic Leaflet) query interface.
DADOS_ABERTOS = EXTERNAL / 'anvisa' / 'dados_abertos.csv'
module-attribute
Path: Local file path where the Open Data CSV is stored after download.
DADOS_ABERTOS_URL = 'https://dados.anvisa.gov.br/dados/DADOS_ABERTOS_MEDICAMENTOS.csv'
module-attribute
str: Direct URL to the ANVISA Open Data CSV file containing registered drugs.