{"gene":"INTS5","run_date":"2026-04-28T18:06:54","timeline":{"discoveries":[{"year":2005,"finding":"INTS5 is a subunit of the Integrator complex, a novel multi-subunit complex (at least 12 subunits) that associates with the C-terminal domain (CTD) of RNA polymerase II and mediates 3' end processing of U1 and U2 snRNAs. Integrator is recruited to snRNA genes and is evolutionarily conserved in metazoans.","method":"Affinity purification, co-immunoprecipitation, mass spectrometry, ChIP, in vitro processing assays","journal":"Cell","confidence":"High","confidence_rationale":"Tier 1-2 — foundational discovery paper using multiple orthogonal methods; highly cited","pmids":["16239144"],"is_preprint":false},{"year":2009,"finding":"Knockdown of zebrafish Ints5 (an ortholog of human INTS5) disrupts U1 and U2 snRNA 3' end processing, leading to aberrant splicing of smad1 and smad5 mRNAs, reduced expression of hematopoietic genes scl and gata1, and arrested red blood cell differentiation, establishing that Ints5 functions as part of the Integrator complex to regulate BMP/Smad signaling during hematopoiesis.","method":"Antisense morpholino knockdown in zebrafish, snRNA processing assays, RT-PCR splicing analysis, blood smear analysis","journal":"Development","confidence":"High","confidence_rationale":"Tier 2 — multiple orthogonal methods in a vertebrate model organism with clear mechanistic pathway placement","pmids":["19605500"],"is_preprint":false},{"year":2019,"finding":"In Drosophila, loss of intS5 (ortholog of human INTS5) in intermediate neural progenitors (INPs) causes their dedifferentiation back into type II neuroblasts. Cell-type-specific DamID revealed 1413 IntS5-binding sites in INPs, including the transcription factor earmuff (erm), whose expression is lost in intS5 mutants. IntS5 thus regulates neural progenitor identity by driving transcription of key differentiation factors.","method":"Genetic loss-of-function (mutant analysis), cell-type-specific DamID (DNA adenine methyltransferase identification), genetic interaction analysis","journal":"Cell reports","confidence":"High","confidence_rationale":"Tier 2 — multiple orthogonal methods including genomic binding analysis and genetic epistasis in Drosophila","pmids":["31018143"],"is_preprint":false},{"year":2020,"finding":"INTS5 and INTS8 form a distinct biochemical subcomplex within the larger Integrator complex, as demonstrated by biochemical characterization. The INTS4/9/11 ternary complex constitutes the catalytic core, while INTS5/8 is a separate module.","method":"Biochemical subcomplex purification, cryo-EM structural analysis","journal":"Molecular cell","confidence":"High","confidence_rationale":"Tier 1 — cryo-EM structure at 3.5 Å combined with biochemical subcomplex characterization","pmids":["33548203"],"is_preprint":false},{"year":2021,"finding":"The INTS5/8 subcomplex is part of the modular Integrator architecture. INTS5/8 forms a separable module distinct from the catalytic INTS4/9/11 core and from the INTS10/13/14 module, supporting a modular assembly model for the full Integrator complex.","method":"Cryo-EM, biochemical subcomplex reconstitution","journal":"Molecular cell","confidence":"High","confidence_rationale":"Tier 1 — structural and biochemical evidence from a single high-resolution cryo-EM study","pmids":["33548203"],"is_preprint":false},{"year":2024,"finding":"Structures of two human Integrator sub-complexes, INTS10/13/14/15 and INTS5/8/10/15, were determined, and INTS5/8 was shown to participate in a broader assembly that includes INTS10 and INTS15. An integrative model of fully assembled Integrator bound to RNAPII paused elongating complex was generated. INTS5/8 thus connects modules within the complete Integrator assembly.","method":"Cryo-EM structure determination, integrative structural modeling, in silico protein-protein interaction screening","journal":"Molecular cell","confidence":"High","confidence_rationale":"Tier 1 — high-resolution cryo-EM structures of sub-complexes with integrative modeling of full assembly","pmids":["38823386"],"is_preprint":false},{"year":2020,"finding":"INTS5 is a component of the INTAC complex (Integrator-PP2A), a cruciform-shaped assembly in which nine Integrator subunits and the PP2A core enzyme form a central scaffold. The complex has dual enzymatic activities: RNA endonuclease (INTS11) and protein phosphatase (PP2A-AC) that dephosphorylates RNAPII-CTD at Ser-2, -5, and -7 to regulate transcription.","method":"Cryo-EM at 3.5 Å, biochemical reconstitution, phosphatase activity assays, mass spectrometry","journal":"Science","confidence":"High","confidence_rationale":"Tier 1 — high-resolution structure combined with in vitro enzymatic assays and mass spectrometry","pmids":["33243860"],"is_preprint":false},{"year":2010,"finding":"INTS5 was identified as a component of the endogenous human Integrator complex by immunoprecipitation/mass spectrometry, confirming its membership in this transcriptional co-regulator complex in human cells.","method":"Immunoprecipitation followed by mass spectrometry (IP/MS) of endogenous complexes","journal":"Proceedings of the National Academy of Sciences of the United States of America","confidence":"Medium","confidence_rationale":"Tier 3 — single IP/MS study but using endogenous proteins with specificity filters","pmids":["20133760"],"is_preprint":false},{"year":2021,"finding":"The structure of the human Integrator-PP2A pretermination complex bound to paused RNAPII was determined, revealing that INTS5 (as part of the Integrator scaffold) participates in a complex that excludes elongation factors SPT6 and PAF1C, positions PP2A to dephosphorylate RNAPII-CTD, and positions the INTS11 endonuclease at the RNA exit site to cleave nascent RNA ~20 nucleotides from the active site.","method":"Cryo-EM structural analysis of pretermination complex, functional validation","journal":"Science","confidence":"High","confidence_rationale":"Tier 1 — high-resolution cryo-EM structure with functional mechanistic interpretation","pmids":["34762484"],"is_preprint":false}],"current_model":"INTS5 is a core subunit of the metazoan Integrator complex that forms a distinct INTS5/8 biochemical module (which also contacts INTS10 and INTS15) within the fully assembled complex; the Integrator associates with paused RNA polymerase II via its CTD, mediates 3' end processing of snRNAs, and—together with a PP2A phosphatase module in the INTAC assembly—attenuates transcription elongation by endonucleolytic cleavage of nascent RNA and dephosphorylation of RNAPII-CTD, with roles in snRNA biogenesis, BMP/Smad signaling (via correct splicing of Smad transcripts), and maintenance of neural progenitor identity."},"narrative":{"teleology":[{"year":2005,"claim":"The identification of INTS5 as a subunit of the newly discovered Integrator complex established for the first time that a dedicated RNAPII-CTD-associated machinery mediates metazoan snRNA 3′ end processing.","evidence":"Affinity purification, co-IP, mass spectrometry, ChIP, and in vitro processing assays in human cells","pmids":["16239144"],"confidence":"High","gaps":["Role of individual subunits including INTS5 within the complex was not resolved","No structural information on subunit organization","Functional scope beyond snRNA processing unknown"]},{"year":2009,"claim":"Demonstrating that loss of Ints5 disrupts snRNA processing and downstream splicing of Smad1/5 mRNAs established that the Integrator complex—through INTS5—feeds into BMP/Smad signaling and vertebrate hematopoiesis, extending its functional reach beyond snRNA biogenesis.","evidence":"Morpholino knockdown in zebrafish with snRNA processing assays, RT-PCR splicing analysis, and hematopoietic phenotyping","pmids":["19605500"],"confidence":"High","gaps":["Whether INTS5 has a specific molecular role in this pathway beyond Integrator complex integrity is unknown","Mammalian validation not performed"]},{"year":2019,"claim":"Cell-type-specific binding and genetic analysis of IntS5 in Drosophila neural progenitors revealed that the Integrator complex directly occupies and activates differentiation gene loci such as earmuff, establishing INTS5 as required for neural progenitor identity maintenance.","evidence":"Loss-of-function genetics, cell-type-specific DamID, and genetic epistasis in Drosophila type II neuroblast lineage","pmids":["31018143"],"confidence":"High","gaps":["Whether INTS5 has locus-specific recruitment determinants or acts genome-wide remains unclear","Mechanism by which Integrator loss triggers dedifferentiation not fully resolved"]},{"year":2020,"claim":"Structural and biochemical dissection revealed that INTS5/8 constitutes a separable module within the Integrator, and that INTS5 is part of the INTAC (Integrator–PP2A) assembly possessing dual endonuclease and protein phosphatase activities that dephosphorylate RNAPII-CTD, thereby defining the complex's transcriptional attenuation mechanism.","evidence":"Cryo-EM at 3.5 Å resolution, biochemical subcomplex reconstitution, phosphatase activity assays, and mass spectrometry","pmids":["33548203","33243860"],"confidence":"High","gaps":["Specific contacts INTS5 makes within the INTAC scaffold not fully mapped","Whether INTS5 contributes allosterically to endonuclease or phosphatase activity is untested"]},{"year":2021,"claim":"Determination of the Integrator–PP2A pretermination complex bound to paused RNAPII showed how INTS5 participates in a scaffold that excludes elongation factors and positions both the INTS11 endonuclease and PP2A phosphatase for coordinated transcription termination.","evidence":"Cryo-EM structure of the pretermination complex with functional interpretation","pmids":["34762484"],"confidence":"High","gaps":["Conformational dynamics of INTS5 during the transition from pausing to termination unresolved","In vivo contribution of INTS5 to the pretermination versus elongation decision not directly tested"]},{"year":2024,"claim":"Structures of the INTS5/8/10/15 sub-complex and an integrative model of the full Integrator on paused RNAPII clarified that INTS5/8 serves as a connector between the INTS10/15 periphery and the central scaffold, completing the modular assembly picture.","evidence":"Cryo-EM of INTS5/8/10/15 and INTS10/13/14/15 sub-complexes with integrative structural modeling","pmids":["38823386"],"confidence":"High","gaps":["INTS5 mutational analysis within the context of the full complex has not been performed","Whether distinct INTS5/8-containing sub-assemblies exist in vivo with different functions is unknown"]},{"year":null,"claim":"The specific molecular contribution of INTS5 beyond serving as a scaffold—whether it allosterically regulates INTS11 endonuclease or PP2A phosphatase activity, or has locus-specific targeting roles—remains to be determined.","evidence":"","pmids":[],"confidence":"High","gaps":["No INTS5-specific point mutations disrupting individual functions have been reported","No direct enzymatic activity attributed to INTS5 itself","No disease-causing mutations in INTS5 described"]}],"mechanism_profile":{"molecular_activity":[{"term_id":"GO:0005198","term_label":"structural molecule activity","supporting_discovery_ids":[3,4,5,6,8]}],"localization":[{"term_id":"GO:0005654","term_label":"nucleoplasm","supporting_discovery_ids":[0,2]},{"term_id":"GO:0005634","term_label":"nucleus","supporting_discovery_ids":[0,7]}],"pathway":[{"term_id":"R-HSA-74160","term_label":"Gene expression (Transcription)","supporting_discovery_ids":[0,2,6,8]},{"term_id":"R-HSA-8953854","term_label":"Metabolism of RNA","supporting_discovery_ids":[0,1]}],"complexes":["Integrator complex","INTAC (Integrator-PP2A)","INTS5/8 module"],"partners":["INTS8","INTS10","INTS15","INTS4","INTS9","INTS11","PP2A"],"other_free_text":[]},"mechanistic_narrative":"INTS5 is a core structural subunit of the metazoan Integrator complex, a multi-subunit assembly that associates with the C-terminal domain of RNA polymerase II to mediate 3′ end processing of snRNAs and regulate transcription elongation. Within the Integrator architecture, INTS5 and INTS8 form a discrete biochemical module that bridges to INTS10 and INTS15, connecting the catalytic endonuclease core (INTS4/9/11) to the broader scaffold of the Integrator–PP2A (INTAC) complex, which couples nascent RNA cleavage by INTS11 with PP2A-mediated dephosphorylation of RNAPII-CTD Ser-2, -5, and -7 to attenuate transcription [PMID:16239144, PMID:33548203, PMID:33243860, PMID:34762484, PMID:38823386]. Loss of INTS5 orthologs disrupts snRNA processing, leading to aberrant splicing of BMP/Smad pathway transcripts and blocked hematopoiesis in zebrafish, and causes dedifferentiation of neural progenitors in Drosophila through failure to sustain expression of key differentiation genes [PMID:19605500, PMID:31018143]."},"prefetch_data":{"uniprot":{"accession":"Q6P9B9","full_name":"Integrator complex subunit 5","aliases":[],"length_aa":1019,"mass_kda":108.0,"function":"Component of the integrator complex, a multiprotein complex that terminates RNA polymerase II (Pol II) transcription in the promoter-proximal region of genes (PubMed:33243860, PubMed:38570683). The integrator complex provides a quality checkpoint during transcription elongation by driving premature transcription termination of transcripts that are unfavorably configured for transcriptional elongation: the complex terminates transcription by (1) catalyzing dephosphorylation of the C-terminal domain (CTD) of Pol II subunit POLR2A/RPB1 and SUPT5H/SPT5, (2) degrading the exiting nascent RNA transcript via endonuclease activity and (3) promoting the release of Pol II from bound DNA (PubMed:33243860, PubMed:38570683). The integrator complex is also involved in terminating the synthesis of non-coding Pol II transcripts, such as enhancer RNAs (eRNAs), small nuclear RNAs (snRNAs), telomerase RNAs and long non-coding RNAs (lncRNAs) (PubMed:16239144). Mediates recruitment of cytoplasmic dynein to the nuclear envelope, probably as component of the integrator complex (PubMed:23904267)","subcellular_location":"Nucleus; Cytoplasm; Nucleus membrane","url":"https://www.uniprot.org/uniprotkb/Q6P9B9/entry"},"depmap":{"release":"DepMap","has_data":true,"is_common_essential":true,"resolved_as":"","url":"https://depmap.org/portal/gene/INTS5","classification":"Common Essential","n_dependent_lines":1191,"n_total_lines":1208,"dependency_fraction":0.9859271523178808},"opencell":{"profiled":true,"resolved_as":"","ensg_id":"ENSG00000185085","cell_line_id":"CID001871","localizations":[{"compartment":"nuclear_punctae","grade":3},{"compartment":"chromatin","grade":2}],"interactors":[{"gene":"INTS1","stoichiometry":0.2},{"gene":"POLR2K","stoichiometry":0.2},{"gene":"INTS9","stoichiometry":0.2},{"gene":"NELFE","stoichiometry":0.2}],"url":"https://opencell.sf.czbiohub.org/target/CID001871","total_profiled":1310},"omim":[{"mim_id":"611349","title":"INTEGRATOR COMPLEX SUBUNIT 5; INTS5","url":"https://www.omim.org/entry/611349"}],"hpa":{"profiled":true,"resolved_as":"","reliability":"Supported","locations":[{"location":"Nucleoplasm","reliability":"Supported"},{"location":"Cytosol","reliability":"Additional"}],"tissue_specificity":"Low tissue specificity","tissue_distribution":"Detected in all","driving_tissues":[],"url":"https://www.proteinatlas.org/search/INTS5"},"hgnc":{"alias_symbol":["INT5"],"prev_symbol":["KIAA1698"]},"alphafold":{"accession":"Q6P9B9","domains":[{"cath_id":"-","chopping":"827-1010","consensus_level":"high","plddt":84.3755,"start":827,"end":1010},{"cath_id":"1.20.190","chopping":"29-92_119-234","consensus_level":"high","plddt":86.1566,"start":29,"end":234}],"viewer_url":"https://alphafold.ebi.ac.uk/entry/Q6P9B9","model_url":"https://alphafold.ebi.ac.uk/files/AF-Q6P9B9-F1-model_v6.cif","pae_url":"https://alphafold.ebi.ac.uk/files/AF-Q6P9B9-F1-predicted_aligned_error_v6.png","plddt_mean":77.12},"mouse_models":{"mgi_url":"https://www.informatics.jax.org/marker/summary?nomen=INTS5","jax_strain_url":"https://www.jax.org/strain/search?query=INTS5"},"sequence":{"accession":"Q6P9B9","fasta_url":"https://rest.uniprot.org/uniprotkb/Q6P9B9.fasta","uniprot_url":"https://www.uniprot.org/uniprotkb/Q6P9B9/entry","alphafold_viewer_url":"https://alphafold.ebi.ac.uk/entry/Q6P9B9"}},"corpus_meta":[{"pmid":"8764102","id":"PMC_8764102","title":"Overexpression of int-5/aromatase in mammary glands of transgenic mice results in the induction of hyperplasia and nuclear abnormalities.","date":"1996","source":"Cancer research","url":"https://pubmed.ncbi.nlm.nih.gov/8764102","citation_count":91,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"7478601","id":"PMC_7478601","title":"The physical map of the human RET proto-oncogene.","date":"1995","source":"Oncogene","url":"https://pubmed.ncbi.nlm.nih.gov/7478601","citation_count":79,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"33548203","id":"PMC_33548203","title":"Structure of the catalytic core of the Integrator complex.","date":"2021","source":"Molecular cell","url":"https://pubmed.ncbi.nlm.nih.gov/33548203","citation_count":54,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"3475186","id":"PMC_3475186","title":"A fission yeast chromosome can replicate autonomously in mouse cells.","date":"1987","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/3475186","citation_count":53,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12620253","id":"PMC_12620253","title":"IGS sequence variation, group-I introns and the complete nuclear ribosomal DNA of the entomopathogenic fungus Metarhizium: excellent tools for isolate detection and phylogenetic analysis.","date":"2003","source":"Fungal genetics and biology : FG & B","url":"https://pubmed.ncbi.nlm.nih.gov/12620253","citation_count":41,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"19605500","id":"PMC_19605500","title":"The Integrator subunits function in hematopoiesis by modulating Smad/BMP signaling.","date":"2009","source":"Development (Cambridge, England)","url":"https://pubmed.ncbi.nlm.nih.gov/19605500","citation_count":38,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"38823386","id":"PMC_38823386","title":"Structural basis of the Integrator complex assembly and association with transcription factors.","date":"2024","source":"Molecular cell","url":"https://pubmed.ncbi.nlm.nih.gov/38823386","citation_count":25,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"1705320","id":"PMC_1705320","title":"Characterization of Int-5, a locus associated with early events in mammary carcinogenesis.","date":"1991","source":"Oncogene research","url":"https://pubmed.ncbi.nlm.nih.gov/1705320","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"31018143","id":"PMC_31018143","title":"The Integrator Complex Prevents Dedifferentiation of Intermediate Neural Progenitors back into Neural Stem Cells.","date":"2019","source":"Cell reports","url":"https://pubmed.ncbi.nlm.nih.gov/31018143","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"20522624","id":"PMC_20522624","title":"Molecular characterization of Mycobacterium intracellulare-related strains based on the sequence analysis of hsp65, internal transcribed spacer and 16S rRNA genes.","date":"2010","source":"Journal of medical microbiology","url":"https://pubmed.ncbi.nlm.nih.gov/20522624","citation_count":21,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"7812938","id":"PMC_7812938","title":"The nature and expression of int-5, a novel MMTV integration locus gene in carcinogen-induced mammary tumors.","date":"1994","source":"Cancer letters","url":"https://pubmed.ncbi.nlm.nih.gov/7812938","citation_count":20,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"33405298","id":"PMC_33405298","title":"iTRAQ-based proteomic analysis of the molecular mechanisms and downstream effects of fatty acid synthase in osteosarcoma cells.","date":"2021","source":"Journal of clinical laboratory analysis","url":"https://pubmed.ncbi.nlm.nih.gov/33405298","citation_count":14,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"26865931","id":"PMC_26865931","title":"A Large Cohort Study of Genotype and Phenotype Correlations of Beta- Thalassemia in Iranian Population.","date":"2015","source":"International journal of hematology-oncology and stem cell research","url":"https://pubmed.ncbi.nlm.nih.gov/26865931","citation_count":13,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"7874687","id":"PMC_7874687","title":"The overexpression of int-5/Aromatase, a novel MMTV integration locus gene, is responsible for D2 mammary tumor cell proliferation.","date":"1995","source":"Cancer letters","url":"https://pubmed.ncbi.nlm.nih.gov/7874687","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34094537","id":"PMC_34094537","title":"PA-Int5: An isatin-thiosemicarbazone derivative that exhibits anti-nociceptive and anti-inflammatory effects in Swiss mice.","date":"2021","source":"Biomedical reports","url":"https://pubmed.ncbi.nlm.nih.gov/34094537","citation_count":11,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"29949961","id":"PMC_29949961","title":"DisruPPI: structure-based computational redesign algorithm for protein binding disruption.","date":"2018","source":"Bioinformatics (Oxford, England)","url":"https://pubmed.ncbi.nlm.nih.gov/29949961","citation_count":10,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12027830","id":"PMC_12027830","title":"Western blotting analysis of the beta-hexosaminidase alpha- and beta-subunits in cultured fibroblasts from cases of various forms of GM2 gangliosidosis.","date":"2002","source":"Acta neurologica Scandinavica","url":"https://pubmed.ncbi.nlm.nih.gov/12027830","citation_count":9,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23122059","id":"PMC_23122059","title":"Detection of genetic association and functional polymorphisms of UGDH affecting milk production trait in Chinese Holstein cattle.","date":"2012","source":"BMC genomics","url":"https://pubmed.ncbi.nlm.nih.gov/23122059","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23065270","id":"PMC_23065270","title":"Novel single nucleotide polymorphisms of bovine SREBP1 gene is association with fatty acid composition and marbling score in commercial Korean cattle (Hanwoo).","date":"2012","source":"Molecular biology reports","url":"https://pubmed.ncbi.nlm.nih.gov/23065270","citation_count":6,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"27031100","id":"PMC_27031100","title":"Molecular Taxonomic Evidence for Two Distinct Genotypes of Mycobacterium yongonense via Genome-Based Phylogenetic Analysis.","date":"2016","source":"PloS one","url":"https://pubmed.ncbi.nlm.nih.gov/27031100","citation_count":6,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"7632740","id":"PMC_7632740","title":"Structure of the int-5, a novel MMTV integration genomic locus containing mouse early transposon LTR homology region.","date":"1995","source":"Biochimica et biophysica acta","url":"https://pubmed.ncbi.nlm.nih.gov/7632740","citation_count":5,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"19493003","id":"PMC_19493003","title":"Bacteriophage content of M49 strains of Streptococcus pyogenes.","date":"2009","source":"FEMS microbiology letters","url":"https://pubmed.ncbi.nlm.nih.gov/19493003","citation_count":5,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"16158289","id":"PMC_16158289","title":"Coordinates, DNA content and heterogeneity of cell nuclei and segments of the Caenorhabditis elegans intestine.","date":"2005","source":"Histochemistry and cell biology","url":"https://pubmed.ncbi.nlm.nih.gov/16158289","citation_count":4,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"25536767","id":"PMC_25536767","title":"[Frequency of diseases caused by group A streptococci among invasive infections of soft tissues and characteristics of the causative agent].","date":"2014","source":"Zhurnal mikrobiologii, epidemiologii i immunobiologii","url":"https://pubmed.ncbi.nlm.nih.gov/25536767","citation_count":1,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12477932","id":"PMC_12477932","title":"Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences.","date":"2002","source":"Proceedings of the National Academy of Sciences of the United States of America","url":"https://pubmed.ncbi.nlm.nih.gov/12477932","citation_count":1479,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"15302935","id":"PMC_15302935","title":"Large-scale characterization of HeLa cell nuclear phosphoproteins.","date":"2004","source":"Proceedings of the National Academy of Sciences of the United States of America","url":"https://pubmed.ncbi.nlm.nih.gov/15302935","citation_count":1159,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"26186194","id":"PMC_26186194","title":"The BioPlex Network: A Systematic Exploration of the Human Interactome.","date":"2015","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/26186194","citation_count":1118,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"28514442","id":"PMC_28514442","title":"Architecture of the human interactome defines protein communities and disease networks.","date":"2017","source":"Nature","url":"https://pubmed.ncbi.nlm.nih.gov/28514442","citation_count":1085,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"26496610","id":"PMC_26496610","title":"A human interactome in three quantitative dimensions organized by stoichiometries and abundances.","date":"2015","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/26496610","citation_count":1015,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"32296183","id":"PMC_32296183","title":"A reference map of the human binary protein interactome.","date":"2020","source":"Nature","url":"https://pubmed.ncbi.nlm.nih.gov/32296183","citation_count":849,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"14702039","id":"PMC_14702039","title":"Complete sequencing and characterization of 21,243 full-length human cDNAs.","date":"2003","source":"Nature genetics","url":"https://pubmed.ncbi.nlm.nih.gov/14702039","citation_count":754,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"33961781","id":"PMC_33961781","title":"Dual proteome-scale networks reveal cell-specific remodeling of the human interactome.","date":"2021","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/33961781","citation_count":705,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"21873635","id":"PMC_21873635","title":"Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.","date":"2011","source":"Briefings in bioinformatics","url":"https://pubmed.ncbi.nlm.nih.gov/21873635","citation_count":656,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"33845483","id":"PMC_33845483","title":"Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV.","date":"2021","source":"Nature","url":"https://pubmed.ncbi.nlm.nih.gov/33845483","citation_count":532,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"16239144","id":"PMC_16239144","title":"Integrator, a multiprotein mediator of small nuclear RNA processing, associates with the C-terminal repeat of RNA polymerase II.","date":"2005","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/16239144","citation_count":443,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"15489334","id":"PMC_15489334","title":"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).","date":"2004","source":"Genome research","url":"https://pubmed.ncbi.nlm.nih.gov/15489334","citation_count":438,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"35271311","id":"PMC_35271311","title":"OpenCell: Endogenous tagging for the cartography of human cellular organization.","date":"2022","source":"Science (New York, N.Y.)","url":"https://pubmed.ncbi.nlm.nih.gov/35271311","citation_count":432,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"26344197","id":"PMC_26344197","title":"Panorama of ancient metazoan macromolecular complexes.","date":"2015","source":"Nature","url":"https://pubmed.ncbi.nlm.nih.gov/26344197","citation_count":407,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"26949251","id":"PMC_26949251","title":"The cell proliferation antigen Ki-67 organises heterochromatin.","date":"2016","source":"eLife","url":"https://pubmed.ncbi.nlm.nih.gov/26949251","citation_count":265,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"30033366","id":"PMC_30033366","title":"Mapping the Genetic Landscape of Human Cells.","date":"2018","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/30033366","citation_count":225,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"34048709","id":"PMC_34048709","title":"N6-Methyladenosine on mRNA facilitates a phase-separated nuclear body that suppresses myeloid leukemic differentiation.","date":"2021","source":"Cancer cell","url":"https://pubmed.ncbi.nlm.nih.gov/34048709","citation_count":207,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"30804502","id":"PMC_30804502","title":"H4K20me0 recognition by BRCA1-BARD1 directs homologous recombination to sister chromatids.","date":"2019","source":"Nature cell biology","url":"https://pubmed.ncbi.nlm.nih.gov/30804502","citation_count":162,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"34004147","id":"PMC_34004147","title":"The PP2A-Integrator-CDK9 axis fine-tunes transcription and can be targeted therapeutically in cancer.","date":"2021","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/34004147","citation_count":152,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"33243860","id":"PMC_33243860","title":"Identification of Integrator-PP2A complex (INTAC), an RNA polymerase II phosphatase.","date":"2020","source":"Science (New York, N.Y.)","url":"https://pubmed.ncbi.nlm.nih.gov/33243860","citation_count":151,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"34762484","id":"PMC_34762484","title":"Structural basis of Integrator-mediated transcription regulation.","date":"2021","source":"Science (New York, N.Y.)","url":"https://pubmed.ncbi.nlm.nih.gov/34762484","citation_count":112,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"20133760","id":"PMC_20133760","title":"Streamlined analysis schema for high-throughput identification of endogenous protein complexes.","date":"2010","source":"Proceedings of the National Academy of Sciences of the United States of America","url":"https://pubmed.ncbi.nlm.nih.gov/20133760","citation_count":102,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"32697989","id":"PMC_32697989","title":"The Human Integrator Complex Facilitates Transcriptional Elongation by Endonucleolytic Cleavage of Nascent Transcripts.","date":"2020","source":"Cell reports","url":"https://pubmed.ncbi.nlm.nih.gov/32697989","citation_count":94,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"29180619","id":"PMC_29180619","title":"The STUbL RNF4 regulates protein group SUMOylation by targeting the SUMO conjugation machinery.","date":"2017","source":"Nature communications","url":"https://pubmed.ncbi.nlm.nih.gov/29180619","citation_count":86,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"11214970","id":"PMC_11214970","title":"Prediction of the coding sequences of unidentified human genes. XIX. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro.","date":"2000","source":"DNA research : an international journal for rapid publication of reports on genes and genomes","url":"https://pubmed.ncbi.nlm.nih.gov/11214970","citation_count":81,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"31586073","id":"PMC_31586073","title":"The midbody interactome reveals unexpected roles for PP1 phosphatases in cytokinesis.","date":"2019","source":"Nature communications","url":"https://pubmed.ncbi.nlm.nih.gov/31586073","citation_count":74,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"28330616","id":"PMC_28330616","title":"Systematic Analysis of Human Protein Phosphatase Interactions and Dynamics.","date":"2017","source":"Cell systems","url":"https://pubmed.ncbi.nlm.nih.gov/28330616","citation_count":65,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"25921289","id":"PMC_25921289","title":"Temporal proteomics of NGF-TrkA signaling identifies an inhibitory role for the E3 ligase Cbl-b in neuroblastoma cell differentiation.","date":"2015","source":"Science signaling","url":"https://pubmed.ncbi.nlm.nih.gov/25921289","citation_count":61,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"29844126","id":"PMC_29844126","title":"The Tumor Suppressor CIC Directly Regulates MAPK Pathway Genes via Histone Deacetylation.","date":"2018","source":"Cancer research","url":"https://pubmed.ncbi.nlm.nih.gov/29844126","citation_count":60,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"32647223","id":"PMC_32647223","title":"INTS10-INTS13-INTS14 form a functional module of Integrator that binds nucleic acids and the cleavage module.","date":"2020","source":"Nature communications","url":"https://pubmed.ncbi.nlm.nih.gov/32647223","citation_count":52,"is_preprint":false,"source_track":"gene2pubmed"}],"cost":{"stage1":{"model":"claude-sonnet-4-6","input_tokens":12527,"output_tokens":1212,"usd":0.02788},"stage2":{"model":"claude-opus-4-6","input_tokens":4427,"output_tokens":1714,"usd":0.097477},"total_usd":0.299219,"stage1_batch_id":"msgbatch_01DzrsgVgYngY1hvM3DU1yF2","stage2_batch_id":"msgbatch_01KA9b7iRzKFdhoBUT7vCvSe","note":"batch pricing = 50% of standard","round2_stage1":{"model":"claude-sonnet-4-6","input_tokens":24568,"output_tokens":2193,"usd":0.053299},"round2_rules_fired":"R3","round2_stage2":{"model":"claude-opus-4-6","input_tokens":5735,"output_tokens":2068,"usd":0.120563}},"stage1_raw":"```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2021,\n      \"finding\": \"INTS5 forms a discrete subcomplex with INTS8 (INTS5/8) within the Integrator complex, distinct from the INTS10/13/14 subcomplex, as demonstrated by biochemical characterization. The catalytic core (INTS4/9/11) structure was resolved by cryo-EM at 3.5 Å, revealing that INTS4, a helical repeat protein, stabilizes the nuclease domains of INTS11 and its catalytically impaired homolog INTS9, and that all three proteins form a composite electropositive groove suggesting a putative RNA binding path.\",\n      \"method\": \"Biochemical subcomplex characterization, cryo-EM structure determination (3.5 Å resolution)\",\n      \"journal\": \"Molecular cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — cryo-EM structure with biochemical subcomplex validation in a single rigorous study\",\n      \"pmids\": [\"33548203\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2024,\n      \"finding\": \"INTS5 participates in a subcomplex with INTS8, INTS10, and INTS15 (INTS5/8/10/15) within the fully assembled Integrator complex. Structural analysis via cryo-EM and integrative modelling of the fully assembled Integrator bound to the paused RNA Pol II elongating complex (PEC) showed that INTS5/8 contributes to the overall architecture of Integrator recruited at promoter-proximal paused RNAPII.\",\n      \"method\": \"Cryo-EM structure determination, integrative structural modelling, in silico protein-protein interaction screening\",\n      \"journal\": \"Molecular cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — cryo-EM structures of subcomplexes with integrative full-complex model\",\n      \"pmids\": [\"38823386\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2009,\n      \"finding\": \"In zebrafish, morpholino-mediated knockdown of Integrator subunit 5 (Ints5) impairs U1 and U2 snRNA 3' end processing, leading to aberrant splicing of smad1 and smad5 RNA, reduced expression of hematopoietic genes scl and gata1, and arrested red blood cell differentiation, placing INTS5 in the Integrator complex upstream of BMP/Smad signaling during hematopoiesis.\",\n      \"method\": \"Antisense morpholino knockdown in zebrafish embryos, snRNA processing assay, RT-PCR for splicing defects, blood smear analysis\",\n      \"journal\": \"Development (Cambridge, England)\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — clean KO with defined cellular phenotype and pathway placement, single lab study\",\n      \"pmids\": [\"19605500\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2019,\n      \"finding\": \"In Drosophila, loss of intS5 generates ectopic type II neuroblasts from intermediate neural progenitors (INPs), demonstrating that the Integrator complex (including IntS5) prevents INP dedifferentiation back into neural stem cells. Cell-type-specific DamID identified 1413 IntS5-binding sites in INPs including the earmuff (erm) transcription factor locus, and erm expression is lost in intS5 mutants, placing IntS5 upstream of Erm in the pathway suppressing dedifferentiation.\",\n      \"method\": \"Genetic loss-of-function (mutant analysis), cell-type-specific DamID chromatin binding, genetic epistasis (intS8 × erm double mutant)\",\n      \"journal\": \"Cell reports\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — multiple orthogonal methods (genetics, DamID, epistasis) in a single study; Drosophila ortholog\",\n      \"pmids\": [\"31018143\"],\n      \"is_preprint\": false\n    }\n  ],\n  \"current_model\": \"INTS5 is a core structural subunit of the Integrator complex that forms a discrete INTS5/8 (and INTS5/8/10/15) subcomplex, contributes to the overall architecture of the fully assembled Integrator at promoter-proximally paused RNA Pol II, and is required for proper 3' end processing of snRNAs — a function that places it upstream of BMP/Smad splicing and hematopoiesis in vertebrates, and upstream of the transcription factor Earmuff to prevent neural progenitor dedifferentiation in Drosophila.\"\n}\n```\n\n--- ROUND 2 (R2/R3, augmented corpus) ---\n\n```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2005,\n      \"finding\": \"INTS5 is a subunit of the Integrator complex, a novel multi-subunit complex (at least 12 subunits) that associates with the C-terminal domain (CTD) of RNA polymerase II and mediates 3' end processing of U1 and U2 snRNAs. Integrator is recruited to snRNA genes and is evolutionarily conserved in metazoans.\",\n      \"method\": \"Affinity purification, co-immunoprecipitation, mass spectrometry, ChIP, in vitro processing assays\",\n      \"journal\": \"Cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1-2 — foundational discovery paper using multiple orthogonal methods; highly cited\",\n      \"pmids\": [\"16239144\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2009,\n      \"finding\": \"Knockdown of zebrafish Ints5 (an ortholog of human INTS5) disrupts U1 and U2 snRNA 3' end processing, leading to aberrant splicing of smad1 and smad5 mRNAs, reduced expression of hematopoietic genes scl and gata1, and arrested red blood cell differentiation, establishing that Ints5 functions as part of the Integrator complex to regulate BMP/Smad signaling during hematopoiesis.\",\n      \"method\": \"Antisense morpholino knockdown in zebrafish, snRNA processing assays, RT-PCR splicing analysis, blood smear analysis\",\n      \"journal\": \"Development\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multiple orthogonal methods in a vertebrate model organism with clear mechanistic pathway placement\",\n      \"pmids\": [\"19605500\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2019,\n      \"finding\": \"In Drosophila, loss of intS5 (ortholog of human INTS5) in intermediate neural progenitors (INPs) causes their dedifferentiation back into type II neuroblasts. Cell-type-specific DamID revealed 1413 IntS5-binding sites in INPs, including the transcription factor earmuff (erm), whose expression is lost in intS5 mutants. IntS5 thus regulates neural progenitor identity by driving transcription of key differentiation factors.\",\n      \"method\": \"Genetic loss-of-function (mutant analysis), cell-type-specific DamID (DNA adenine methyltransferase identification), genetic interaction analysis\",\n      \"journal\": \"Cell reports\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multiple orthogonal methods including genomic binding analysis and genetic epistasis in Drosophila\",\n      \"pmids\": [\"31018143\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"INTS5 and INTS8 form a distinct biochemical subcomplex within the larger Integrator complex, as demonstrated by biochemical characterization. The INTS4/9/11 ternary complex constitutes the catalytic core, while INTS5/8 is a separate module.\",\n      \"method\": \"Biochemical subcomplex purification, cryo-EM structural analysis\",\n      \"journal\": \"Molecular cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — cryo-EM structure at 3.5 Å combined with biochemical subcomplex characterization\",\n      \"pmids\": [\"33548203\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2021,\n      \"finding\": \"The INTS5/8 subcomplex is part of the modular Integrator architecture. INTS5/8 forms a separable module distinct from the catalytic INTS4/9/11 core and from the INTS10/13/14 module, supporting a modular assembly model for the full Integrator complex.\",\n      \"method\": \"Cryo-EM, biochemical subcomplex reconstitution\",\n      \"journal\": \"Molecular cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — structural and biochemical evidence from a single high-resolution cryo-EM study\",\n      \"pmids\": [\"33548203\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2024,\n      \"finding\": \"Structures of two human Integrator sub-complexes, INTS10/13/14/15 and INTS5/8/10/15, were determined, and INTS5/8 was shown to participate in a broader assembly that includes INTS10 and INTS15. An integrative model of fully assembled Integrator bound to RNAPII paused elongating complex was generated. INTS5/8 thus connects modules within the complete Integrator assembly.\",\n      \"method\": \"Cryo-EM structure determination, integrative structural modeling, in silico protein-protein interaction screening\",\n      \"journal\": \"Molecular cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — high-resolution cryo-EM structures of sub-complexes with integrative modeling of full assembly\",\n      \"pmids\": [\"38823386\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"INTS5 is a component of the INTAC complex (Integrator-PP2A), a cruciform-shaped assembly in which nine Integrator subunits and the PP2A core enzyme form a central scaffold. The complex has dual enzymatic activities: RNA endonuclease (INTS11) and protein phosphatase (PP2A-AC) that dephosphorylates RNAPII-CTD at Ser-2, -5, and -7 to regulate transcription.\",\n      \"method\": \"Cryo-EM at 3.5 Å, biochemical reconstitution, phosphatase activity assays, mass spectrometry\",\n      \"journal\": \"Science\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — high-resolution structure combined with in vitro enzymatic assays and mass spectrometry\",\n      \"pmids\": [\"33243860\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2010,\n      \"finding\": \"INTS5 was identified as a component of the endogenous human Integrator complex by immunoprecipitation/mass spectrometry, confirming its membership in this transcriptional co-regulator complex in human cells.\",\n      \"method\": \"Immunoprecipitation followed by mass spectrometry (IP/MS) of endogenous complexes\",\n      \"journal\": \"Proceedings of the National Academy of Sciences of the United States of America\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 3 — single IP/MS study but using endogenous proteins with specificity filters\",\n      \"pmids\": [\"20133760\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2021,\n      \"finding\": \"The structure of the human Integrator-PP2A pretermination complex bound to paused RNAPII was determined, revealing that INTS5 (as part of the Integrator scaffold) participates in a complex that excludes elongation factors SPT6 and PAF1C, positions PP2A to dephosphorylate RNAPII-CTD, and positions the INTS11 endonuclease at the RNA exit site to cleave nascent RNA ~20 nucleotides from the active site.\",\n      \"method\": \"Cryo-EM structural analysis of pretermination complex, functional validation\",\n      \"journal\": \"Science\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — high-resolution cryo-EM structure with functional mechanistic interpretation\",\n      \"pmids\": [\"34762484\"],\n      \"is_preprint\": false\n    }\n  ],\n  \"current_model\": \"INTS5 is a core subunit of the metazoan Integrator complex that forms a distinct INTS5/8 biochemical module (which also contacts INTS10 and INTS15) within the fully assembled complex; the Integrator associates with paused RNA polymerase II via its CTD, mediates 3' end processing of snRNAs, and—together with a PP2A phosphatase module in the INTAC assembly—attenuates transcription elongation by endonucleolytic cleavage of nascent RNA and dephosphorylation of RNAPII-CTD, with roles in snRNA biogenesis, BMP/Smad signaling (via correct splicing of Smad transcripts), and maintenance of neural progenitor identity.\"\n}\n```","stage2_raw":"```json\n{\n  \"mechanistic_narrative\": \"INTS5 is a core structural subunit of the Integrator complex that forms a discrete subcomplex with INTS8 (and, within the fully assembled complex, with INTS8/10/15), contributing to the overall architecture of Integrator recruited to promoter-proximally paused RNA Polymerase II [PMID:33548203, PMID:38823386]. INTS5 is required for proper 3′ end processing of snRNAs; loss of INTS5 in zebrafish impairs U1 and U2 snRNA maturation, causing aberrant splicing of smad1/smad5 transcripts, reduced hematopoietic gene expression, and arrested erythropoiesis, thereby linking Integrator-dependent snRNA processing to BMP/Smad signaling during hematopoiesis [PMID:19605500]. In Drosophila, IntS5 binds the earmuff (erm) transcription factor locus in intermediate neural progenitors and is required for erm expression; loss of intS5 causes ectopic dedifferentiation of neural progenitors into neuroblasts, establishing a role for Integrator in maintaining neural progenitor identity [PMID:31018143].\",\n  \"teleology\": [\n    {\n      \"year\": 2009,\n      \"claim\": \"Establishing that INTS5 is functionally required for snRNA 3′ end processing and that this activity is upstream of BMP/Smad-dependent hematopoiesis answered the question of whether individual Integrator subunits have non-redundant developmental roles.\",\n      \"evidence\": \"Morpholino knockdown of Ints5 in zebrafish embryos with snRNA processing assays, RT-PCR, and blood cell analysis\",\n      \"pmids\": [\"19605500\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Morpholino knockdown may have off-target effects; genetic null confirmation was not provided\",\n        \"Whether INTS5 acts catalytically or structurally in snRNA processing was not resolved\",\n        \"Direct versus indirect effects on smad1/smad5 splicing were not distinguished\"\n      ]\n    },\n    {\n      \"year\": 2019,\n      \"claim\": \"Demonstrating that IntS5 binds the erm locus in neural progenitors and that its loss causes progenitor-to-stem-cell dedifferentiation established a chromatin-level mechanism by which Integrator maintains differentiated cell identity in the nervous system.\",\n      \"evidence\": \"Drosophila intS5 mutant analysis, cell-type-specific DamID profiling in INPs, genetic epistasis with erm\",\n      \"pmids\": [\"31018143\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Whether the dedifferentiation phenotype arises from snRNA processing defects or direct transcriptional regulation was not resolved\",\n        \"DamID identifies proximity, not direct DNA binding; whether IntS5 itself contacts the erm locus is unknown\",\n        \"Mammalian conservation of this neural progenitor maintenance function has not been tested\"\n      ]\n    },\n    {\n      \"year\": 2021,\n      \"claim\": \"Biochemical isolation of a discrete INTS5/8 subcomplex within Integrator, alongside the cryo-EM structure of the catalytic module, clarified the modular organization of the complex and positioned INTS5/8 as a distinct architectural arm separate from the catalytic core (INTS4/9/11).\",\n      \"evidence\": \"Biochemical subcomplex purification and cryo-EM structure determination at 3.5 Å resolution\",\n      \"pmids\": [\"33548203\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"The structure of the INTS5/8 subcomplex itself was not resolved at high resolution\",\n        \"How INTS5/8 communicates with the INTS4/9/11 catalytic core during RNA cleavage was not determined\"\n      ]\n    },\n    {\n      \"year\": 2024,\n      \"claim\": \"Cryo-EM and integrative modelling of the fully assembled Integrator on paused Pol II revealed that INTS5 participates in a four-subunit module (INTS5/8/10/15) and contributes to the architecture of the holoenzyme at the pause site, resolving how INTS5 is positioned relative to its elongation complex substrate.\",\n      \"evidence\": \"Cryo-EM structure determination and integrative modelling of the Integrator–paused elongation complex\",\n      \"pmids\": [\"38823386\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"The specific contacts INTS5 makes with Pol II or nascent RNA remain unresolved at atomic detail\",\n        \"Functional consequences of disrupting INTS5 within the INTS5/8/10/15 module have not been tested in the structural context\"\n      ]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"It remains unknown whether INTS5 contributes to Integrator functions beyond snRNA processing — such as transcription attenuation and enhancer RNA turnover — and whether its architectural role is purely structural or involves allosteric regulation of the INTS9/11 endonuclease.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"Low\",\n      \"gaps\": [\n        \"No reconstitution experiments have tested whether INTS5 allosterically regulates INTS11 cleavage activity\",\n        \"Mammalian genetic knockouts with transcriptomic analysis are lacking\",\n        \"Role in Integrator-mediated transcription attenuation at protein-coding genes has not been assessed\"\n      ]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0005198\", \"supporting_discovery_ids\": [0, 1]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005654\", \"supporting_discovery_ids\": [1]},\n      {\"term_id\": \"GO:0005694\", \"supporting_discovery_ids\": [3]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"R-HSA-8953854\", \"supporting_discovery_ids\": [0, 2]},\n      {\"term_id\": \"R-HSA-74160\", \"supporting_discovery_ids\": [1, 3]},\n      {\"term_id\": \"R-HSA-1266738\", \"supporting_discovery_ids\": [2, 3]}\n    ],\n    \"complexes\": [\n      \"Integrator complex\",\n      \"INTS5/8 subcomplex\",\n      \"INTS5/8/10/15 subcomplex\"\n    ],\n    \"partners\": [\n      \"INTS8\",\n      \"INTS10\",\n      \"INTS15\"\n    ],\n    \"other_free_text\": []\n  }\n}\n```\n\n--- ROUND 2 (R2/R3, augmented corpus) ---\n\n```json\n{\n  \"mechanistic_narrative\": \"INTS5 is a core structural subunit of the metazoan Integrator complex, a multi-subunit assembly that associates with the C-terminal domain of RNA polymerase II to mediate 3′ end processing of snRNAs and regulate transcription elongation. Within the Integrator architecture, INTS5 and INTS8 form a discrete biochemical module that bridges to INTS10 and INTS15, connecting the catalytic endonuclease core (INTS4/9/11) to the broader scaffold of the Integrator–PP2A (INTAC) complex, which couples nascent RNA cleavage by INTS11 with PP2A-mediated dephosphorylation of RNAPII-CTD Ser-2, -5, and -7 to attenuate transcription [PMID:16239144, PMID:33548203, PMID:33243860, PMID:34762484, PMID:38823386]. Loss of INTS5 orthologs disrupts snRNA processing, leading to aberrant splicing of BMP/Smad pathway transcripts and blocked hematopoiesis in zebrafish, and causes dedifferentiation of neural progenitors in Drosophila through failure to sustain expression of key differentiation genes [PMID:19605500, PMID:31018143].\",\n  \"teleology\": [\n    {\n      \"year\": 2005,\n      \"claim\": \"The identification of INTS5 as a subunit of the newly discovered Integrator complex established for the first time that a dedicated RNAPII-CTD-associated machinery mediates metazoan snRNA 3′ end processing.\",\n      \"evidence\": \"Affinity purification, co-IP, mass spectrometry, ChIP, and in vitro processing assays in human cells\",\n      \"pmids\": [\"16239144\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Role of individual subunits including INTS5 within the complex was not resolved\",\n        \"No structural information on subunit organization\",\n        \"Functional scope beyond snRNA processing unknown\"\n      ]\n    },\n    {\n      \"year\": 2009,\n      \"claim\": \"Demonstrating that loss of Ints5 disrupts snRNA processing and downstream splicing of Smad1/5 mRNAs established that the Integrator complex—through INTS5—feeds into BMP/Smad signaling and vertebrate hematopoiesis, extending its functional reach beyond snRNA biogenesis.\",\n      \"evidence\": \"Morpholino knockdown in zebrafish with snRNA processing assays, RT-PCR splicing analysis, and hematopoietic phenotyping\",\n      \"pmids\": [\"19605500\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Whether INTS5 has a specific molecular role in this pathway beyond Integrator complex integrity is unknown\",\n        \"Mammalian validation not performed\"\n      ]\n    },\n    {\n      \"year\": 2019,\n      \"claim\": \"Cell-type-specific binding and genetic analysis of IntS5 in Drosophila neural progenitors revealed that the Integrator complex directly occupies and activates differentiation gene loci such as earmuff, establishing INTS5 as required for neural progenitor identity maintenance.\",\n      \"evidence\": \"Loss-of-function genetics, cell-type-specific DamID, and genetic epistasis in Drosophila type II neuroblast lineage\",\n      \"pmids\": [\"31018143\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Whether INTS5 has locus-specific recruitment determinants or acts genome-wide remains unclear\",\n        \"Mechanism by which Integrator loss triggers dedifferentiation not fully resolved\"\n      ]\n    },\n    {\n      \"year\": 2020,\n      \"claim\": \"Structural and biochemical dissection revealed that INTS5/8 constitutes a separable module within the Integrator, and that INTS5 is part of the INTAC (Integrator–PP2A) assembly possessing dual endonuclease and protein phosphatase activities that dephosphorylate RNAPII-CTD, thereby defining the complex's transcriptional attenuation mechanism.\",\n      \"evidence\": \"Cryo-EM at 3.5 Å resolution, biochemical subcomplex reconstitution, phosphatase activity assays, and mass spectrometry\",\n      \"pmids\": [\"33548203\", \"33243860\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Specific contacts INTS5 makes within the INTAC scaffold not fully mapped\",\n        \"Whether INTS5 contributes allosterically to endonuclease or phosphatase activity is untested\"\n      ]\n    },\n    {\n      \"year\": 2021,\n      \"claim\": \"Determination of the Integrator–PP2A pretermination complex bound to paused RNAPII showed how INTS5 participates in a scaffold that excludes elongation factors and positions both the INTS11 endonuclease and PP2A phosphatase for coordinated transcription termination.\",\n      \"evidence\": \"Cryo-EM structure of the pretermination complex with functional interpretation\",\n      \"pmids\": [\"34762484\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Conformational dynamics of INTS5 during the transition from pausing to termination unresolved\",\n        \"In vivo contribution of INTS5 to the pretermination versus elongation decision not directly tested\"\n      ]\n    },\n    {\n      \"year\": 2024,\n      \"claim\": \"Structures of the INTS5/8/10/15 sub-complex and an integrative model of the full Integrator on paused RNAPII clarified that INTS5/8 serves as a connector between the INTS10/15 periphery and the central scaffold, completing the modular assembly picture.\",\n      \"evidence\": \"Cryo-EM of INTS5/8/10/15 and INTS10/13/14/15 sub-complexes with integrative structural modeling\",\n      \"pmids\": [\"38823386\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"INTS5 mutational analysis within the context of the full complex has not been performed\",\n        \"Whether distinct INTS5/8-containing sub-assemblies exist in vivo with different functions is unknown\"\n      ]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"The specific molecular contribution of INTS5 beyond serving as a scaffold—whether it allosterically regulates INTS11 endonuclease or PP2A phosphatase activity, or has locus-specific targeting roles—remains to be determined.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"No INTS5-specific point mutations disrupting individual functions have been reported\",\n        \"No direct enzymatic activity attributed to INTS5 itself\",\n        \"No disease-causing mutations in INTS5 described\"\n      ]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0005198\", \"supporting_discovery_ids\": [3, 4, 5, 6, 8]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005654\", \"supporting_discovery_ids\": [0, 2]},\n      {\"term_id\": \"GO:0005634\", \"supporting_discovery_ids\": [0, 7]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"R-HSA-74160\", \"supporting_discovery_ids\": [0, 2, 6, 8]},\n      {\"term_id\": \"R-HSA-8953854\", \"supporting_discovery_ids\": [0, 1]}\n    ],\n    \"complexes\": [\n      \"Integrator complex\",\n      \"INTAC (Integrator-PP2A)\",\n      \"INTS5/8 module\"\n    ],\n    \"partners\": [\n      \"INTS8\",\n      \"INTS10\",\n      \"INTS15\",\n      \"INTS4\",\n      \"INTS9\",\n      \"INTS11\",\n      \"PP2A\"\n    ],\n    \"other_free_text\": []\n  }\n}\n```"}