Data & Analytics Tools
Open-source tools for data analysis, poverty measurement, and accessing international development data — available on SSC/RePEc, CRAN, and PyPI.
Featured Data Access Tools
| Tool | Description | | |
|---|
| wbopendata | Access 29,000+ indicators from 51 World Bank databases directly from Stata, covering 296 countries from 1960 to present | SSSC | |
| unicefData | Trilingual library for downloading UNICEF child welfare indicators via SDMX API with cross-language test parity | R Python Stata | |
| unicefstats-mcp | MCP server providing AI assistants access to 790+ UNICEF child-focused indicators across 200+ countries via SDMX API | PyPI | |
| datalibweb | Stata frontend for the World Bank microdata API, enabling access to global, regional, and country microdata catalogs | SGitHub | |
Stata Modules on SSC
Poverty, Inequality & Welfare
| Module | Description | | | |
|---|
| ainequal | Compute inequality and concentration indices with analytical standard errors | SSSC | | |
| apoverty | FGT and other poverty measures with standard errors and hypothesis tests | SSSC | | |
| mpovline | Poverty measures at multiple poverty lines simultaneously | SSSC | | |
| isopoverty | Graph iso-poverty curves showing growth-redistribution tradeoffs | SSSC | | |
| alorenz | Graph Lorenz and concentration curves with confidence intervals | SSSC | | |
| hoi | Human Opportunity Index for measuring inequality of opportunity (Barros et al.) | SSSC | | |
| mol | Measure of effective literacy using the Basu-Foster framework | SSSC | | |
| groupdata | Poverty and inequality estimation from grouped or tabulated data | SGitHub | | |
Decomposition Methods
| Module | Description | | | |
|---|
| adecomp | Shapley decomposition of changes in poverty and inequality indicators | SSSC | | |
| drdecomp | Datt-Ravallion decomposition of poverty changes into growth and redistribution | SSSC | | |
| skdecomp | Shapley value decomposition of changes in the income distribution | SSSC | | |
| dfl | DiNardo-Fortin-Lemieux counterfactual density decomposition | SSSC | | |
| changemean | Decompose poverty changes into growth and distributional effects | SSSC | | |
Small Area Estimation
| Module | Description | | | |
|---|
| sae | Unit-level small area estimation for poverty mapping using the ELL methodology | SSSC | | |
| fhsae | Fay-Herriot area-level EBLUP small area estimation methods | SSSC | | |
Econometrics & Estimation
| Module | Description | | | |
|---|
| grqreg | Graph quantile regression coefficients across the distribution | SSSC | | |
| factortest | Bartlett and Kaiser-Meyer-Olkin tests for factor analysis suitability | SSSC | | |
| crtest | Cramer-Ridder test for pooling states in multinomial logit models | SSSC | | |
| turnbull | Turnbull nonparametric estimator for willingness-to-pay from contingent valuation | SSSC | | |
| spike | Spike model for zero willingness-to-pay in contingent valuation surveys | SSSC | | |
Data Management & Utilities
| Module | Description | | | |
|---|
| wbopendata | Access 29,000+ World Bank indicators from 51 databases via Stata | SSSC | | |
| unicefdata | Download UNICEF child welfare indicators via SDMX API | SSSC | | |
| groupfunction | Fast replacement for collapse supporting multiple aggregation functions | SSSC | | |
| outtable | Export Stata matrix to LaTeX table with formatting options | SSSC | | |
| yaml | Read, write, and manipulate YAML configuration files for reproducible workflows | SSSC | | |
View all 24 modules on RePEc →
R Packages on CRAN
| Package | Description | | |
|---|
| unicefData | Download UNICEF child welfare indicators via SDMX API with consistent R interface | CRAN | |
Python Packages on PyPI
| Package | Description | | |
|---|
| unicefdata | Download UNICEF child welfare indicators via SDMX API with query status codes, year filtering, and MRV support (v2.4) | PyPI | |
| unicefstats-mcp | MCP server providing AI assistants access to 790+ UNICEF child-focused indicators across 200+ countries (EQA: 0.984) | PyPI | |
| wb-api-repo | World Bank API data extraction scripts in Python | GitHub | |
Other Projects: Reproducible & Scalable Analytics
Institutional projects I have led or co-developed for reproducible data access, education measurement, and scalable analytics.
Education & Learning Analytics
| Project | Description | |
|---|
| LearningPoverty | Learning Poverty indicator — combining schooling and learning data to measure the share of children unable to read by age 10 | |
| GLAD | Global Learning Assessment Database — harmonized learning assessment datasets at student and country level | |
| EduAnalyticsToolkit | EduAnalytics team toolkit for data management, documentation, and analytics | |
Utilities
| Project | Description | |
|---|
| package | Stata module to create GitHub dissemination packages | |
| useful_tweaks | Useful tweaks to user-written ado files | |
View all repositories on GitHub →
All tools are open source and available for academic and research use.