Affinage

CPSF2

Cleavage and polyadenylation specificity factor subunit 2 · UniProt Q9P2I0

Length
782 aa
Mass
88.5 kDa
Annotated
2026-06-09
12 papers in source corpus 7 papers cited in narrative 7 extracted findings
Cross-family judge vs UniProt: Affinage preferred faithfulness: 4/4 claims corpus-supported (100%)

Mechanistic narrative

Synthesis pass · prose summary of the discoveries below

CPSF2 (CPSF100) is a catalytically inactive metallo-beta-lactamase (MBL)-fold subunit of the cleavage machinery for both histone and polyadenylated pre-mRNA 3'-end processing (PMID:19450530, PMID:18688255). Although its own histidine motif is degenerate and it cannot cleave RNA, conserved residues within its MBL motifs are required to build the active endonuclease together with CPSF73, contributing structurally in a manner analogous to RNase Z and RNase J (PMID:18688255). Structurally, the C-terminal domains of CPSF100 and CPSF73 form a stable heterodimer through extensive inter-protein contacts, and CPSF73's CTD3 binds Symplekin, defining the trimeric core cleavage complex that, together with histone-specific factors, is required for 3'-end processing and associates co-transcriptionally with histone genes (PMID:19450530, PMID:37989222). Beyond canonical 3'-end formation, CPSF2 participates in alternative cleavage of immediate early genes through a serum-stimulation-dependent interaction with the TREX component THOC5, which recruits CPSF2 to 3'-UTR targets including Myc and Smad7 (PMID:25274738), and acts as a direct RNA-binding cofactor influencing genome-wide alternative splicing (PMID:26697379).

Mechanistic history

Synthesis pass · year-by-year structured walk · 7 steps
  1. 2005 Medium

    Established that CPSF2 has a nuclear-restricted localization and can engage processing partners outside the canonical CPSF complex, raising the possibility of substrate-specific 3'-end processing roles.

    Evidence Co-IP, subcellular fractionation, and RNAi with cell-cycle readout in HeLa cells

    PMID:15684398

    Open questions at the time
    • The reported lack of interaction with CPSF73/CPSF160 conflicts with later heterodimer structures and was not reconciled
    • Functional substrate set of the RC-68 complex not defined
    • Single-lab observation without reciprocal validation
  2. 2008 High

    Resolved how a catalytically dead subunit contributes to catalysis by showing CPSF100's MBL motifs are structurally required to assemble the active CPSF73 endonuclease.

    Evidence In vitro MBL active-site mutagenesis of both subunits with reconstituted histone pre-mRNA cleavage assay

    PMID:18688255

    Open questions at the time
    • Does not resolve the atomic geometry of the assembled active site
    • Contribution to polyadenylated substrate cleavage tested less directly
  3. 2009 High

    Defined the CPSF100-CPSF73-Symplekin core as a stable subcomplex required for both histone and polyadenylated pre-mRNA processing and showed it associates co-transcriptionally with target genes.

    Evidence Reciprocal co-IP, RNAi knockdown with processing phenotype, and ChIP in Drosophila

    PMID:19450530

    Open questions at the time
    • Mechanism distinguishing histone versus polyadenylated substrate routing not defined
    • Stoichiometry and assembly order of the trimeric core not established
  4. 2014 Medium

    Identified a stimulus-dependent route for CPSF2 recruitment, linking it to TREX-mediated 3'-end processing of immediate early genes.

    Evidence Co-IP interactome with THOC5 as bait, RNA/chromatin IP, and THOC5 RNAi with transcriptome analysis

    PMID:25274738

    Open questions at the time
    • Whether the THOC5 interaction is direct is not established
    • How serum stimulation triggers complex formation is unknown
    • Single lab
  5. 2014 Low

    Suggested a cellular consequence of CPSF2 loss in cancer, linking depletion to increased invasion and stem cell marker expansion.

    Evidence RNAi knockdown in thyroid cancer cell lines with invasion assay and stem cell marker flow cytometry

    PMID:24654752

    Open questions at the time
    • Phenotypic knockdown without molecular mechanism or pathway placement
    • No connection drawn to 3'-end processing function
    • Single lab, no in vivo validation
  6. 2015 Medium

    Extended CPSF2 function beyond 3'-end cleavage by mapping its direct RNA contacts and showing it influences genome-wide alternative splicing as an RBFOX2 cofactor.

    Evidence iCLIP for direct RNA targets paired with RNA-seq after CPSF2 knockdown

    PMID:26697379

    Open questions at the time
    • Direct physical RBFOX2 interaction not biochemically detailed in abstract
    • Mechanism coupling splicing regulation to 3'-end processing role unknown
    • Single lab
  7. 2023 High

    Provided the molecular architecture of the core cleavage complex by solving the CPSF73-CPSF100 C-terminal heterodimer and defining the CPSF73 surface that binds Symplekin.

    Evidence NMR solution structure of the minimal C-terminal heterodimer from E. cuniculi with biochemical Symplekin binding assays and comparative modeling

    PMID:37989222

    Open questions at the time
    • Structure of the full-length assembled endonuclease on RNA substrate not determined
    • Solved in a microsporidian ortholog rather than the human complex

Open questions

Synthesis pass · forward-looking unresolved questions
  • How CPSF2's distinct activities — core endonuclease assembly, THOC5-dependent immediate early gene processing, and RBFOX2-associated splicing regulation — are coordinated and substrate-selected remains unresolved.
  • No structure of the catalytically active complex engaged with substrate RNA
  • Mechanism selecting histone versus polyadenylated versus alternatively spliced targets unknown
  • Direct versus indirect nature of THOC5 and RBFOX2 associations unsettled

Mechanism profile

Synthesis pass · controlled-vocabulary classification · explore literature graph →
Molecular activity
GO:0005198 structural molecule activity 2 GO:0140098 catalytic activity, acting on RNA 2 GO:0003723 RNA binding 1
Localization
GO:0005634 nucleus 1
Pathway
R-HSA-8953854 Metabolism of RNA 3 R-HSA-74160 Gene expression (Transcription) 2
Complex memberships
CPSF cleavage core (CPSF100-CPSF73-Symplekin)

Evidence

Reading pass · 7 per-paper findings extracted from the source corpus
Year Finding Method Journal Conf PMIDs
2009 CPSF100 (CPSF2), CPSF73, and Symplekin form a stable core subcomplex that interacts with histone-specific processing factors and is required for both histone pre-mRNA 3'-end processing and polyadenylated pre-mRNA processing. Chromatin immunoprecipitation showed Symplekin and CPSF73 (but not CstF50) co-transcriptionally associate with histone genes, and depletion of CPSF160 or CstF64 downregulates Symplekin without affecting histone pre-mRNA processing. Co-immunoprecipitation, RNAi knockdown, chromatin immunoprecipitation (ChIP), in vivo processing assays in Drosophila Molecular cell High 19450530
2008 Conserved residues within the metallo-beta-lactamase (MBL) motifs of both CPSF73 and CPSF100 are required to assemble the active endonuclease that cleaves histone pre-mRNAs. CPSF100, though catalytically inactive itself (due to substitutions in the histidine motif), contributes structurally to the active endonuclease, analogous to RNase Z and RNase J homodimers. In vitro point mutagenesis of conserved MBL residues in both proteins, in vitro histone pre-mRNA cleavage assay EMBO reports High 18688255
2005 CPSF100 (CPSF2) is exclusively nuclear, does not interact with CPSF73 or CPSF160, and forms a distinct complex with RC-68 (a CPSF73 homolog) that is independent of the canonical CPSF complex, suggesting a role in 3'-end processing of a subset of pre-mRNAs distinct from bulk polyadenylation. Co-immunoprecipitation, subcellular fractionation/localization, RNAi knockdown with cell-cycle phenotype readout in HeLa cells Molecular and cellular biology Medium 15684398
2014 CPSF100 (CPSF2) forms a serum-stimulation-dependent complex with THOC5 (a TREX complex member), and THOC5 is required for recruitment of CPSF100 to the 3'-UTR of immediate early gene targets (including Myc and Smad7), controlling their 3'-end processing and alternative cleavage. Co-immunoprecipitation (interactome analysis using THOC5 as bait), chromatin/RNA immunoprecipitation, RNAi knockdown of THOC5 with transcriptome analysis Nucleic acids research Medium 25274738
2023 The C-terminal domains (CTD1 and CTD2) of CPSF73 and CPSF100 form a stable heterodimer with extensive inter-protein contacts; CTD2 of both proteins resembles TATA-box binding protein (TBP) domains. The CTD3 domain of CPSF73 (also a TBP-fold domain, connected by a flexible linker) is required for binding Symplekin, defining the molecular architecture of the trimeric core cleavage complex. NMR solution structure determination of minimal CPSF73–CPSF100 C-terminal heterodimer from E. cuniculi, biochemical binding assays for Symplekin interaction, comparative structural modeling Open biology High 37989222
2014 Knockdown of CPSF2 in thyroid cancer cells increased cellular invasion 1.8- to 3.2-fold and expanded cancer stem cell markers (CD44 and CD133 expression), establishing a functional role for CPSF2 in suppressing invasiveness. RNAi knockdown in thyroid cancer cell lines, invasion assay, flow cytometry/immunostaining for stem cell markers The Journal of clinical endocrinology and metabolism Low 24654752
2015 Genome-wide iCLIP identified direct RNA-binding targets of CPSF2, and CPSF2 knockdown altered alternative splicing events genome-wide, indicating CPSF2 acts as a direct RNA-binding cofactor of RBFOX2 in regulating alternative splicing. Individual-nucleotide resolution UV cross-linking and immunoprecipitation (iCLIP), paired-end RNA-seq after CPSF2 RNAi knockdown Genomics data Medium 26697379

Source papers

Stage 0 corpus · 12 papers · ranked by NIH iCite citations
Year Title Journal Citations PMID
2009 A core complex of CPSF73, CPSF100, and Symplekin may form two different cleavage factors for processing of poly(A) and histone mRNAs. Molecular cell 107 19450530
2005 A CPSF-73 homologue is required for cell cycle progression but not cell growth and interacts with a protein having features of CPSF-100. Molecular and cellular biology 85 15684398
2008 Conserved motifs in both CPSF73 and CPSF100 are required to assemble the active endonuclease for histone mRNA 3'-end maturation. EMBO reports 66 18688255
2014 THOC5 controls 3'end-processing of immediate early genes via interaction with polyadenylation specific factor 100 (CPSF100). Nucleic acids research 33 25274738
2006 A serendipitous discovery that in situ proteolysis is essential for the crystallization of yeast CPSF-100 (Ydh1p). Acta crystallographica. Section F, Structural biology and crystallization communications 22 17012808
2014 Loss of CPSF2 expression is associated with increased thyroid cancer cellular invasion and cancer stem cell population, and more aggressive disease. The Journal of clinical endocrinology and metabolism 18 24654752
2021 Serum anti-DIDO1, anti-CPSF2, and anti-FOXJ2 antibodies as predictive risk markers for acute ischemic stroke. BMC medicine 17 34103026
2015 Negative Expression of CPSF2 Predicts a Poorer Clinical Outcome in Patients with Papillary Thyroid Carcinoma. Thyroid : official journal of the American Thyroid Association 15 26148673
2015 Global analysis of CPSF2-mediated alternative splicing: Integration of global iCLIP and transcriptome profiling data. Genomics data 5 26697379
2024 SIZ1-mediated SUMOylation of CPSF100 promotes plant thermomorphogenesis by controlling alternative polyadenylation. Molecular plant 4 39066483
2023 Molecular details of the CPSF73-CPSF100 C-terminal heterodimer and interaction with Symplekin. Open biology 3 37989222
2023 1H, 15N and 13C resonance assignments of a minimal CPSF73-CPSF100 C-terminal heterodimer. Biomolecular NMR assignments 1 36723825

Missed literature

Know a paper Affinage missed for CPSF2? Flag it for the maintainers and the community.

No submissions yet.