tuberculosis_percentage by shvngisingh · Pull Request #1939 · datacommonsorg/data

shvngisingh · 2026-03-30T09:32:39Z

Tuberculosis: Percentage of people diagnosed with a new episode of pulmonary TB whose disease was bacteriologically confirmed
data source: https://data.who.int/indicators/i/1891124/449F55C

google-cla · 2026-03-30T09:32:50Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

gemini-code-assist

Code Review

This pull request introduces a new data import for the WHO Tuberculosis Percentage dataset, including the necessary download script, configuration files, and test data. The review identified several issues regarding file path conventions, incorrect script references in the documentation, and a need for more robust error handling in the download script. These changes are necessary to ensure the import automation functions correctly.

statvar_imports/tuberculosis_percentage/download_who_tuberculosis.py

gemini-code-assist · 2026-03-30T09:36:41Z

statvar_imports/tuberculosis_percentage/README.md

+
+**Download input file**
+ ```bash
+ python3 statvar_imports/tuberculosis_percentage/tuberculosisPercentage_input.py


The command to download the input file refers to a non-existent script tuberculosisPercentage_input.py. It should point to the download_who_tuberculosis.py script.

Suggested change

python3 statvar_imports/tuberculosis_percentage/tuberculosisPercentage_input.py

python3 statvar_imports/tuberculosis_percentage/download_who_tuberculosis.py

gemini-code-assist · 2026-03-30T09:36:41Z

statvar_imports/tuberculosis_percentage/README.md

+**For Main data run**
+```bash
+python3 tools/statvar_importer/stat_var_processor.py \
+  --input_data=statvar_imports/tuberculosis_percentage/tuberculosisPercentage_input.csv \


The --input_data path for the main data run is incorrect. Based on standard project structure and where the download script should save the file, the input file will be located in the source_files subdirectory. Please update the path accordingly.

Suggested change

--input_data=statvar_imports/tuberculosis_percentage/tuberculosisPercentage_input.csv \

--input_data=statvar_imports/tuberculosis_percentage/source_files/tuberculosisPercentage_input.csv \

gemini-code-assist · 2026-03-30T09:36:41Z

statvar_imports/tuberculosis_percentage/download_who_tuberculosis.py

+    if api_response.status_code != 200:
+        print(f"Failed to fetch API data. HTTP {api_response.status_code}")
+        return


The error checking for the API request can be improved. Using response.ok is a more idiomatic way to check for successful requests with the requests library, and including response.reason can provide more context on failure.

Suggested change

if api_response.status_code != 200:

print(f"Failed to fetch API data. HTTP {api_response.status_code}")

return

if not api_response.ok:

print(f"Failed to fetch API data. HTTP {api_response.status_code}: {api_response.reason}")

return

gemini-code-assist · 2026-03-30T09:36:41Z

statvar_imports/tuberculosis_percentage/download_who_tuberculosis.py

+
+    # We only pull the 'country' (for matching) and 'iso3' columns
+    geo_columns = ['country', 'iso3']
+    master_df = pd.read_csv(master_url, usecols=geo_columns).drop_duplicates()


The call to pd.read_csv with a URL can fail due to network issues or if the remote server is unavailable. It's good practice to wrap this in a try...except block to handle potential errors gracefully.

Suggested change

master_df = pd.read_csv(master_url, usecols=geo_columns).drop_duplicates()

try:

master_df = pd.read_csv(master_url, usecols=geo_columns).drop_duplicates()

except Exception as e:

print(f"Failed to fetch country iso3 codes from WHO master database: {e}")

return

gemini-code-assist bot reviewed Mar 30, 2026

View reviewed changes

shvngisingh added 2 commits March 31, 2026 07:01

tuberculosis_percentage

034f892

input_file name change

46594cf

shvngisingh force-pushed the tuberculosis_percentage branch from 345605a to 46594cf Compare March 31, 2026 07:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tuberculosis_percentage#1939

tuberculosis_percentage#1939
shvngisingh wants to merge 2 commits intodatacommonsorg:masterfrom
shvngisingh:tuberculosis_percentage

shvngisingh commented Mar 30, 2026 •

edited

Loading

Uh oh!

google-cla bot commented Mar 30, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

gemini-code-assist bot Mar 30, 2026

Uh oh!

gemini-code-assist bot Mar 30, 2026

Uh oh!

gemini-code-assist bot Mar 30, 2026

Uh oh!

gemini-code-assist bot Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	python3 statvar_imports/tuberculosis_percentage/tuberculosisPercentage_input.py
	python3 statvar_imports/tuberculosis_percentage/download_who_tuberculosis.py

	--input_data=statvar_imports/tuberculosis_percentage/tuberculosisPercentage_input.csv \
	--input_data=statvar_imports/tuberculosis_percentage/source_files/tuberculosisPercentage_input.csv \

Conversation

shvngisingh commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-cla bot commented Mar 30, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist bot Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

shvngisingh commented Mar 30, 2026 •

edited

Loading