Raw Reports and Studies Ingest Inventory

This audit controls the next ingest wave for raw/reports followed by raw/studies. The generated CSV is tracked at data/evidence/raw_ingest_inventory.csv; the JSON summary is tracked at data/evidence/raw_ingest_summary.json.

Counts

BucketPDF files
Reports51
Studies331
Total382

Source-Page Status

StatusCount
deferred_historical1
matched_by_raw_path56
matched_by_sha2562
no_source_page318
rejected_unusable2
related_artifact2
superseded_artifact1

Priority Buckets

PriorityCount
P0-report-regulatory-toxicology30
P1-report-context17
P1-study-product-relevant36
P2-study-background295
P4-historical-draft1
P8-superseded1
P9-filename-content-misfire1
P9-out-of-scope1

Reports Needing Source Pages First

All report PDFs currently have a source-page match.

Reports Deliberately Not Promoted as Independent Source Pages

  • raw/reports/2512.24601v2.pdf — rejected_unusable; do not ingest into Heavy Metal Index; file is an AI/LLM paper, not a heavy-metal food, toxicology, regulatory, ingredient, or standards source
  • raw/reports/ATSDR-2023-0004-0004_content.pdf — superseded_artifact; do not create a separate public source page unless historical draft provenance is needed; final ATSDR nickel profile is represented by wiki/sources/atsdr-nickel-toxprofile-2024.md
  • raw/reports/Biochem_Mercury_Ralston_2008.pdf — rejected_unusable; do not ingest as mercury evidence; file content is Mitsuhashi et al. 2008 on pyrogallol/polyphenol apoptosis, not the intended Ralston mercury source
  • raw/reports/EPA_IRIS_Cadmium_ToxicologicalReview.pdf — deferred_historical; retain as historical EPA cadmium reassessment draft; do not use as operative EPA IRIS value because it is marked external review draft/do not quote and was not finalized
  • raw/reports/EPA_IRIS_InorganicArsenic_Summary.pdf — related_artifact; treat as companion IRIS summary artifact for the full 2025 inorganic arsenic toxicological review; do not create duplicate source weight
  • raw/reports/EPA_IRIS_InorganicMercury_ToxReview.pdf — related_artifact; treat as mislabeled companion artifact for EPA IRIS methylmercury summary; do not create an inorganic-mercury source page from this file

Product-Relevant Study Queue

These study PDFs should be reviewed before lower-priority mechanistic/background papers because they are more likely to alter product, ingredient, or HMTc standards-development pages.

Source-Page Gaps

  • raw/studies/FM_10447601_Characterisation_of_a_low_methane_emission_rice_cultivar_sui.pdf
  • raw/studies/FM_10954997_Lead_Cadmium_and_Arsenic_in_Raw_Milk_Produced_in_the_Vicinit.pdf
  • raw/studies/FM_11074271_Assessment_of_macro_trace_and_toxic_element_intake_from_rice.pdf
  • raw/studies/FM_11111292_Dietary_intake_of_methylmercury_by_0-5_years_children_using.pdf
  • raw/studies/FM_11876284_Assessing_trace_elements_in_soils_and_rice_insights_from_the.pdf
  • raw/studies/FM_12286494_Analysis_of_heavy_metal_content_in_protein_powders_available.pdf
  • raw/studies/FM_12507951_Hematological_Parameters_and_Mercury_Exposure_in_Children_Li.pdf
  • raw/studies/FM_12583816_Risk_assessment_of_heavy_metals_in_north_of_Iran_Sari_rice_a.pdf
  • raw/studies/FM_3765370_Health_risk_assessment_of_heavy_metals_in_rice_to_the_popula.pdf
  • raw/studies/FM_3958402_Tungsten_distribution_in_soil_and_rice_in_the_vicinity_of_th.pdf
  • raw/studies/FM_5331031_Arsenic_Transport_in_Rice_and_Biological_Solutions_to_Reduce.pdf
  • raw/studies/FM_5394917_Genetic_diversity_of_arsenic_accumulation_in_rice_and_QTL_an.pdf
  • raw/studies/FM_5425496_Urinary_Arsenic_Speciation_in_Children_and_Pregnant_Women_fr.pdf
  • raw/studies/FM_5940663_Infants_dietary_arsenic_exposure_during_transition_to_solid.pdf
  • raw/studies/FM_6505394_High_arsenic_in_rice_is_associated_with_elevated_genotoxic_e.pdf
  • raw/studies/FM_6825132_Rice_production_threatened_by_coupled_stresses_of_climate_an.pdf
  • raw/studies/FM_7121289_Environmental_Toxicology_Children_at_Risk.pdf
  • raw/studies/FM_7359620_Heavy_Metals_and_PAHs_in_Meat_Milk_and_Seafood_From_Augusta.pdf
  • raw/studies/FM_8082930_Maternal_methylmercury_exposure_through_rice_ingestion_and_c.pdf
  • raw/studies/FM_8164619_Levels_of_heavy_metal_cadmium_in_rice_Oryza_sativa_L_produce.pdf
  • raw/studies/FM_8207007_Developmental_toxicity_of_cadmium_in_infants_and_children_a.pdf
  • raw/studies/FM_8342936_Risk-Benefit_Assessment_of_Consumption_of_Rice_for_Adult_Men.pdf
  • raw/studies/FM_8891379_Measurement_of_Polycyclic_Aromatic_Hydrocarbons_in_Baby_Food.pdf
  • raw/studies/FM_8924265_Infant_infections_respiratory_symptoms_and_allergy_in_relati.pdf
  • raw/studies/FM_9271943_A_Narrative_Review_of_Toxic_Heavy_Metal_Content_of_Infant_an.pdf
  • raw/studies/FM_9439980_Essential_and_Non-essential_Trace_Elements_in_Milks_and_Plan.pdf
  • raw/studies/FM_9941557_Wheat_Selenium-binding_protein_TaSBP-A_enhances_cadmium_tole.pdf
  • raw/studies/lead-in-infant-formula-1992.pdf

Matched, Still Needing Completeness Audit

  • raw/studies/Content and Dietary Exposure Assessment of Toxic Elements in Infant Formulas from the Chinese Market.pdf — matched_by_raw_path; wiki/sources/chung2021-china-infant-formula-toxic-elements.md
  • raw/studies/FM_10375490_Arsenic_in_brown_rice_do_the_benefits_outweigh_the_risks.pdf — matched_by_raw_path; wiki/sources/su-2023-arsenic-brown-rice.md
  • raw/studies/FM_10883532_Chemical_characterization_of_baby_food_consumed_in_Italy.pdf — matched_by_raw_path; wiki/sources/meli2024-chemical-characterization-baby-food-italy.md
  • raw/studies/FM_11614607_Engineering_rice_Nramp5_modifies_cadmium_and_manganese_uptak.pdf — matched_by_raw_path; wiki/sources/inoue-2024-rice-nramp5-cadmium.md
  • raw/studies/FM_12378713_Arsenic_speciation_using_HPLC-ICP-MS_in_white_and_brown_rice.pdf — matched_by_raw_path; wiki/sources/navaretnam-2025-rice-as-speciation.md
  • raw/studies/FS102048 Infants metal survey FSIS pdf version.pdf — matched_by_raw_path; wiki/sources/fsa2016-infant-food-formula-metals-survey.md
  • raw/studies/Infants and young children s dietary exposures to lead and cadmium FDA total diet study 2018 2020.pdf — matched_by_raw_path; wiki/sources/spungen2024-fda-tds-infant-lead-cadmium.md
  • raw/studies/Toxic Metals and Metalloids in Infant Formulas Marketed in Brazil, and Child Health Risks According to the Target Hazard Quotients and Target Cancer Risk.pdf — matched_by_raw_path; wiki/sources/almeida2022-brazil-infant-formula-toxic-metals.md

Operating Rule

Do not treat a row as publicly ingested merely because it has a source page. A completed ingest requires source metadata, evidence-fitness classification, routeable value/claim extraction, page updates where warranted, and verification that ingredient-only findings live on ingredient pages rather than product pages.