{"gene":"FAM50A","run_date":"2026-06-09T23:54:43","timeline":{"discoveries":[{"year":2020,"finding":"FAM50A missense variants cause Armfield X-linked intellectual disability (XLID) syndrome; fam50a knockout zebrafish show abnormal neurogenesis and craniofacial patterning with augmented spliceosome mRNAs, depletion of neurodevelopmental transcripts, and a preponderance of 3' alternative splicing events, placing FAM50A as a component of the spliceosome C complex involved in mRNA processing during development.","method":"Patient variant identification, zebrafish KO model (RNA-seq, in vivo complementation assays), protein-protein interaction data, transcriptomics from patient-derived cell lines","journal":"Nature communications","confidence":"High","confidence_rationale":"Tier 2 / Strong — multiple orthogonal methods (in vivo KO model, RNA-seq, complementation assays, PPI data, patient cell lines) in a single rigorous study","pmids":["32703943"],"is_preprint":false},{"year":2017,"finding":"FAM50A localizes to the nucleus of ameloblasts and physically interacts with Runx2, synergistically increasing Ambn transactivation and enhancing Runx2 binding affinity to the Ambn promoter; forced expression increases enamel matrix protein gene expression and mineralization, while knockdown reduces these effects.","method":"Fluorescence microscopy (nuclear localization), Co-IP (FAM50A–Runx2 interaction), promoter transactivation assays, overexpression and knockdown in mouse ameloblast cell line (mALCs)","journal":"Journal of cellular physiology","confidence":"Medium","confidence_rationale":"Tier 2 / Moderate — Co-IP, fluorescence localization, and functional reporter assays in a single lab study","pmids":["28574578"],"is_preprint":false},{"year":1999,"finding":"The X-linked XAP5 (FAM50A) gene is ubiquitously expressed, contains 13 exons, and gave rise to an autosomal intronless retroposon XAP-5-like (X5L); XAP5 and X5L show differential expression in testis, consistent with the hypothesis that X5L may compensate for XAP5 silencing during spermatogenesis.","method":"Phylogenetic analysis, expression profiling across tissues, genomic structure determination","journal":"Genomics","confidence":"Low","confidence_rationale":"Tier 4 / Weak — expression-based characterization and genomic analysis without direct functional/mechanistic assay of the protein","pmids":["10534398"],"is_preprint":false},{"year":1997,"finding":"The XAP5 (FAM50A) gene encodes a 339-amino-acid nuclear protein with a predicted nuclear localization signal, contains runs of CCG repeats in the 5' UTR, spans 13 exons over 6.5 kb at Xq28, and exhibits markedly enhanced expression in fetal tissues.","method":"Full-length cDNA isolation, genomic structure mapping, population polymorphism analysis, expression profiling","journal":"Genomics","confidence":"Low","confidence_rationale":"Tier 4 / Weak — purely descriptive genomic/expression characterization without functional mechanistic experiment","pmids":["9339379"],"is_preprint":false},{"year":2025,"finding":"FAM50A is a component of the spliceosome complex C and is required for KSHV-mediated oncogenic transformation; FAM50A knockout alters SHP2 pre-mRNA splicing, promoting a SHP2 isoform with enhanced phosphatase activity, which reduces STAT3 Y705 phosphorylation in KSHV-transformed cells, thereby suppressing STAT3 activation and cell proliferation/transformation.","method":"CRISPR-Cas9 screening and knockout, transcriptomic (RNA-seq) splicing analysis, enzymatic activity assays (SHP2 isoform), phosphorylation assays (STAT3 Y705), tumorigenesis assays","journal":"mBio","confidence":"High","confidence_rationale":"Tier 2 / Strong — CRISPR KO with orthogonal mechanistic follow-up (splicing profiling, isoform enzymatic activity, phosphorylation readout) replicated in preprint and peer-reviewed publication","pmids":["40503897","40166334"],"is_preprint":false},{"year":2025,"finding":"FAM50A forms a complex with C9ORF78 specifically at the S121 residue of C9ORF78, and this complex enhances ASNS transcription to promote L-asparagine biosynthesis, driving breast cancer brain metastasis; genetic suppression of FAM50A or pharmacological inhibition of asparagine synthesis counteracts brain metastasis.","method":"Co-immunoprecipitation (FAM50A–C9ORF78 complex), site-specific mutagenesis (S121), ASNS transcription assays, asparagine biosynthesis assays, in vivo metastasis models, genetic knockdown","journal":"Science advances","confidence":"Medium","confidence_rationale":"Tier 2 / Moderate — Co-IP with site-specific residue identification, transcriptional and metabolic functional readouts, single lab study","pmids":["40531994"],"is_preprint":false},{"year":2023,"finding":"FAM50A knockdown in cancer cells causes DNA damage, induces interferon beta and interleukin-6 expression, and represses proliferation, invasion, and migration; FAM50A encodes a nuclear protein involved in mRNA processing.","method":"FAM50A knockdown with phenotypic readouts (DNA damage assays, cytokine expression, proliferation, invasion, migration assays)","journal":"Medical oncology","confidence":"Low","confidence_rationale":"Tier 3 / Weak — single lab, knockdown with cellular phenotype but limited pathway mechanistic resolution","pmids":["37393403"],"is_preprint":false}],"current_model":"FAM50A (XAP5) is a nuclear protein and component of the spliceosome C complex that regulates 3' alternative splice site selection; disease-causing missense variants impair its splicing function during neurodevelopment (Armfield XLID syndrome), it modulates SHP2 isoform production to sustain STAT3 signaling in KSHV-transformed cells, it forms a complex with C9ORF78 to transcriptionally upregulate asparagine synthetase (ASNS) promoting brain metastasis, and it physically interacts with Runx2 in the nucleus to enhance ameloblast differentiation."},"narrative":{"mechanistic_narrative":"FAM50A (XAP5) is a ubiquitously expressed nuclear protein that functions as a component of the spliceosome C complex and regulates pre-mRNA processing during development [PMID:32703943]. Loss of FAM50A skews splice-site selection toward 3' alternative splicing events, depleting neurodevelopmental transcripts while augmenting spliceosomal mRNAs, and missense variants in FAM50A cause Armfield X-linked intellectual disability syndrome with abnormal neurogenesis and craniofacial patterning [PMID:32703943]. Its splicing activity has downstream consequences in disease contexts: in KSHV-transformed cells FAM50A controls SHP2 pre-mRNA splicing such that its loss favors a high-activity SHP2 isoform, lowering STAT3 Y705 phosphorylation and suppressing transformation and proliferation [PMID:40503897, PMID:40166334]. FAM50A also acts in the nucleus through partner interactions — it forms a complex with C9ORF78 at C9ORF78 residue S121 to enhance ASNS transcription and L-asparagine biosynthesis, driving breast cancer brain metastasis [PMID:40531994], and it physically interacts with Runx2 to enhance Ambn promoter transactivation and enamel matrix gene expression during ameloblast differentiation [PMID:28574578]. Consistent with a role in genome and transcriptome integrity, FAM50A depletion in cancer cells provokes DNA damage and an inflammatory interferon-β/IL-6 response alongside reduced proliferation, invasion, and migration [PMID:37393403].","teleology":[{"year":1997,"claim":"Before any functional assignment, the basic molecular identity of the gene needed defining; this established XAP5/FAM50A as an Xq28-encoded nuclear protein with developmental expression bias.","evidence":"Full-length cDNA isolation, genomic mapping, and expression profiling showing a predicted NLS and enhanced fetal expression","pmids":["9339379"],"confidence":"Low","gaps":["Purely descriptive — no functional or mechanistic assay of the protein","Nuclear localization inferred from predicted NLS, not demonstrated","No molecular activity defined"]},{"year":1999,"claim":"The evolutionary and expression context was extended, raising the question of how the X-linked gene is regulated relative to its retroposon, particularly in germline tissue.","evidence":"Phylogenetic analysis, genomic structure determination, and tissue expression profiling identifying the autosomal intronless retroposon X5L with differential testis expression","pmids":["10534398"],"confidence":"Low","gaps":["Compensation hypothesis untested functionally","No protein-level mechanism","Relationship of X5L to FAM50A function unresolved"]},{"year":2017,"claim":"The first functional partner was identified, addressing whether FAM50A acts on specific transcriptional programs; it showed FAM50A enhances Runx2-driven enamel gene transactivation in ameloblasts.","evidence":"Co-IP, fluorescence microscopy, and promoter transactivation reporter assays with overexpression/knockdown in mouse ameloblast cells","pmids":["28574578"],"confidence":"Medium","gaps":["Single-lab study without reciprocal validation in other systems","Mechanism of Runx2 affinity enhancement unresolved","Connection to spliceosomal role not established"]},{"year":2020,"claim":"The core molecular function and disease basis were established together, answering what process FAM50A serves: it is a spliceosome C complex component whose loss disrupts 3' splice-site selection and developmental transcript pools, and its missense variants cause Armfield XLID.","evidence":"Patient variant identification, zebrafish knockout with RNA-seq and in vivo complementation, PPI data, and patient-cell transcriptomics","pmids":["32703943"],"confidence":"High","gaps":["Precise biochemical step within the C complex not defined","How variants impair splicing at the molecular level not resolved","Target transcript selectivity rules unknown"]},{"year":2023,"claim":"A consequence of FAM50A loss in cancer cells was probed, linking its mRNA-processing role to genome stability and innate immune signaling.","evidence":"Knockdown with DNA damage, cytokine expression, and proliferation/invasion/migration phenotype assays","pmids":["37393403"],"confidence":"Low","gaps":["Limited pathway mechanistic resolution","Causal link between splicing defects and DNA damage not established","Single-lab phenotypic study"]},{"year":2025,"claim":"Two studies connected FAM50A's splicing/nuclear activity to disease drivers, showing isoform-level control of SHP2/STAT3 in viral oncogenesis and a C9ORF78-dependent ASNS transcriptional axis in metastasis.","evidence":"CRISPR-Cas9 knockout with splicing, SHP2 isoform enzymatic and STAT3 phosphorylation readouts (mBio); Co-IP with S121 site-mapping, ASNS transcription/metabolic assays, and in vivo metastasis models (Science Advances)","pmids":["40503897","40166334","40531994"],"confidence":"Medium","gaps":["The two disease mechanisms (SHP2 splicing vs ASNS transcription) not integrated into one biochemical model","Whether ASNS upregulation is splicing-dependent or a separate transcriptional function unclear","C9ORF78 complex stoichiometry and structure undefined"]},{"year":null,"claim":"It remains unresolved how FAM50A's role within the spliceosome C complex mechanistically determines its specificity for 3' alternative splice sites and how that single biochemical activity gives rise to its diverse transcriptional and disease outputs.","evidence":"","pmids":[],"confidence":"High","gaps":["No structural model of FAM50A within the C complex","No defined rule for target transcript/splice-site selection","Unclear whether transcriptional partner functions (Runx2, C9ORF78) are separable from spliceosomal activity"]}],"mechanism_profile":{"molecular_activity":[{"term_id":"GO:0140098","term_label":"catalytic activity, acting on RNA","supporting_discovery_ids":[0,4]},{"term_id":"GO:0140110","term_label":"transcription regulator activity","supporting_discovery_ids":[1,5]}],"localization":[{"term_id":"GO:0005634","term_label":"nucleus","supporting_discovery_ids":[0,1,3]}],"pathway":[{"term_id":"R-HSA-8953854","term_label":"Metabolism of RNA","supporting_discovery_ids":[0,4]}],"complexes":["spliceosome C complex"],"partners":["RUNX2","C9ORF78"],"other_free_text":[]}},"prefetch_data":{"uniprot":{"accession":"Q14320","full_name":"Protein FAM50A","aliases":["Protein HXC-26","Protein XAP-5"],"length_aa":339,"mass_kda":40.2,"function":"Probably involved in the regulation of pre-mRNA splicing","subcellular_location":"Nucleus","url":"https://www.uniprot.org/uniprotkb/Q14320/entry"},"depmap":{"release":"DepMap","has_data":true,"is_common_essential":false,"resolved_as":"","url":"https://depmap.org/portal/gene/FAM50A","classification":"Not Classified","n_dependent_lines":645,"n_total_lines":1208,"dependency_fraction":0.5339403973509934},"opencell":{"profiled":true,"resolved_as":"","ensg_id":"ENSG00000071859","cell_line_id":"CID001776","localizations":[{"compartment":"nucleoplasm","grade":3}],"interactors":[{"gene":"SPG21","stoichiometry":10.0},{"gene":"USP22","stoichiometry":4.0},{"gene":"ARPC2","stoichiometry":0.2},{"gene":"C9ORF78","stoichiometry":0.2},{"gene":"CAPZB","stoichiometry":0.2},{"gene":"EIF5","stoichiometry":0.2},{"gene":"PRPF8","stoichiometry":0.2},{"gene":"EAPP","stoichiometry":0.2},{"gene":"SPATS2","stoichiometry":0.2},{"gene":"EFTUD2","stoichiometry":0.2}],"url":"https://opencell.sf.czbiohub.org/target/CID001776","total_profiled":1310},"omim":[{"mim_id":"614686","title":"FAMILY WITH SEQUENCE SIMILARITY 50, MEMBER B; FAM50B","url":"https://www.omim.org/entry/614686"},{"mim_id":"300453","title":"FAMILY WITH SEQUENCE SIMILARITY 50, MEMBER A; FAM50A","url":"https://www.omim.org/entry/300453"},{"mim_id":"300261","title":"INTELLECTUAL DEVELOPMENTAL DISORDER, X-LINKED, SYNDROMIC, ARMFIELD TYPE; MRXSA","url":"https://www.omim.org/entry/300261"}],"hpa":{"profiled":true,"resolved_as":"","reliability":"Supported","locations":[{"location":"Nucleoplasm","reliability":"Supported"}],"tissue_specificity":"Low tissue specificity","tissue_distribution":"Detected in all","driving_tissues":[],"url":"https://www.proteinatlas.org/search/FAM50A"},"hgnc":{"alias_symbol":["DXS9928E","XAP5","HXC-26","9F"],"prev_symbol":[]},"alphafold":{"accession":"Q14320","domains":[{"cath_id":"3.10.20.90","chopping":"199-282_300-325","consensus_level":"high","plddt":86.9747,"start":199,"end":325}],"viewer_url":"https://alphafold.ebi.ac.uk/entry/Q14320","model_url":"https://alphafold.ebi.ac.uk/files/AF-Q14320-F1-model_v6.cif","pae_url":"https://alphafold.ebi.ac.uk/files/AF-Q14320-F1-predicted_aligned_error_v6.png","plddt_mean":75.88},"mouse_models":{"mgi_url":"https://www.informatics.jax.org/marker/summary?nomen=FAM50A","jax_strain_url":"https://www.jax.org/strain/search?query=FAM50A"},"sequence":{"accession":"Q14320","fasta_url":"https://rest.uniprot.org/uniprotkb/Q14320.fasta","uniprot_url":"https://www.uniprot.org/uniprotkb/Q14320/entry","alphafold_viewer_url":"https://alphafold.ebi.ac.uk/entry/Q14320"}},"corpus_meta":[{"pmid":"32703943","id":"PMC_32703943","title":"Mutations in FAM50A suggest that Armfield XLID syndrome is a spliceosomopathy.","date":"2020","source":"Nature communications","url":"https://pubmed.ncbi.nlm.nih.gov/32703943","citation_count":58,"is_preprint":false},{"pmid":"2433033","id":"PMC_2433033","title":"Glycoproteins distinguishing non-small cell from small cell human lung carcinoma recognized by monoclonal antibody 43-9F.","date":"1987","source":"Cancer research","url":"https://pubmed.ncbi.nlm.nih.gov/2433033","citation_count":49,"is_preprint":false},{"pmid":"2406007","id":"PMC_2406007","title":"Monoclonal antibody 43-9F as a sensitive immunohistochemical marker of carcinoma in situ of human testis.","date":"1990","source":"Cancer","url":"https://pubmed.ncbi.nlm.nih.gov/2406007","citation_count":37,"is_preprint":false},{"pmid":"10534398","id":"PMC_10534398","title":"Human and mouse XAP-5 and XAP-5-like (X5L) genes: identification of an ancient functional retroposon differentially expressed in testis.","date":"1999","source":"Genomics","url":"https://pubmed.ncbi.nlm.nih.gov/10534398","citation_count":25,"is_preprint":false},{"pmid":"7521211","id":"PMC_7521211","title":"Human tumor-associated Le(a)-Le(x) hybrid carbohydrate antigen IV3(Gal beta 1-->3[Fuc alpha 1-->4]GlcNAc)III3FucnLc4 defined by monoclonal antibody 43-9F: enzymatic synthesis, structural characterization, and comparative reactivity with various antibodies.","date":"1994","source":"Biochemistry","url":"https://pubmed.ncbi.nlm.nih.gov/7521211","citation_count":24,"is_preprint":false},{"pmid":"24957674","id":"PMC_24957674","title":"Yeast X-chromosome-associated protein 5 (Xap5) functions with H2A.Z to suppress aberrant transcripts.","date":"2014","source":"EMBO reports","url":"https://pubmed.ncbi.nlm.nih.gov/24957674","citation_count":15,"is_preprint":false},{"pmid":"9339379","id":"PMC_9339379","title":"Differential expression of XAP5, a candidate disease gene.","date":"1997","source":"Genomics","url":"https://pubmed.ncbi.nlm.nih.gov/9339379","citation_count":13,"is_preprint":false},{"pmid":"28574578","id":"PMC_28574578","title":"The Fam50a positively regulates ameloblast differentiation via interacting with Runx2.","date":"2017","source":"Journal of cellular physiology","url":"https://pubmed.ncbi.nlm.nih.gov/28574578","citation_count":11,"is_preprint":false},{"pmid":"32106543","id":"PMC_32106543","title":"The Role of p53-Mediated Signaling in the Therapeutic Response of Colorectal Cancer to 9F, a Spermine-Modified Naphthalene Diimide Derivative.","date":"2020","source":"Cancers","url":"https://pubmed.ncbi.nlm.nih.gov/32106543","citation_count":10,"is_preprint":false},{"pmid":"36834630","id":"PMC_36834630","title":"Proto-Oncogene FAM50A Can Regulate the Immune Microenvironment and Development of Hepatocellular Carcinoma In Vitro and In Vivo.","date":"2023","source":"International journal of molecular sciences","url":"https://pubmed.ncbi.nlm.nih.gov/36834630","citation_count":9,"is_preprint":false},{"pmid":"35065375","id":"PMC_35065375","title":"XAP5 CIRCADIAN TIMEKEEPER specifically modulates 3' splice site recognition and is important for circadian clock regulation partly by alternative splicing of LHY and TIC.","date":"2022","source":"Plant physiology and biochemistry : PPB","url":"https://pubmed.ncbi.nlm.nih.gov/35065375","citation_count":9,"is_preprint":false},{"pmid":"1536722","id":"PMC_1536722","title":"Monoclonal antibody 43-9F: an immunohistochemical marker of embryonal carcinoma of the testis.","date":"1992","source":"APMIS : acta pathologica, microbiologica, et immunologica Scandinavica","url":"https://pubmed.ncbi.nlm.nih.gov/1536722","citation_count":9,"is_preprint":false},{"pmid":"9805244","id":"PMC_9805244","title":"Immunohistochemical expression of monoclonal antibody 43-9F in testicular germ cell tumours.","date":"1998","source":"International journal of andrology","url":"https://pubmed.ncbi.nlm.nih.gov/9805244","citation_count":8,"is_preprint":false},{"pmid":"36974904","id":"PMC_36974904","title":"XAP5 CIRCADIAN TIMEKEEPER regulates RNA splicing and the circadian clock by genetically separable pathways.","date":"2023","source":"Plant physiology","url":"https://pubmed.ncbi.nlm.nih.gov/36974904","citation_count":7,"is_preprint":false},{"pmid":"38205256","id":"PMC_38205256","title":"Regulatory network in heat stress response in parasitoid wasp focusing on Xap5 heat stress regulator.","date":"2023","source":"iScience","url":"https://pubmed.ncbi.nlm.nih.gov/38205256","citation_count":7,"is_preprint":false},{"pmid":"37393403","id":"PMC_37393403","title":"Upregulation of FAM50A promotes cancer development.","date":"2023","source":"Medical oncology (Northwood, London, England)","url":"https://pubmed.ncbi.nlm.nih.gov/37393403","citation_count":6,"is_preprint":false},{"pmid":"9039504","id":"PMC_9039504","title":"Isolation and analysis of a novel gene, HXC-26, adjacent to the rab GDP dissociation inhibitor gene located at human chromosome Xq28 region.","date":"1996","source":"DNA research : an international journal for rapid publication of reports on genes and genomes","url":"https://pubmed.ncbi.nlm.nih.gov/9039504","citation_count":5,"is_preprint":false},{"pmid":"40244117","id":"PMC_40244117","title":"ROS-Responsive Fluorinated Oxalate Nanomedicine for Dual Chemiluminescence/1⁹F MRI Imaging and Targeted Drug Release.","date":"2025","source":"International journal of molecular sciences","url":"https://pubmed.ncbi.nlm.nih.gov/40244117","citation_count":5,"is_preprint":false},{"pmid":"40531994","id":"PMC_40531994","title":"FAM50A drives breast cancer brain metastasis through interaction with C9ORF78 to enhance ʟ-asparagine production.","date":"2025","source":"Science advances","url":"https://pubmed.ncbi.nlm.nih.gov/40531994","citation_count":4,"is_preprint":false},{"pmid":"40503897","id":"PMC_40503897","title":"KSHV reprograms host RNA splicing via FAM50A to activate STAT3 and drive oncogenic cellular transformation.","date":"2025","source":"mBio","url":"https://pubmed.ncbi.nlm.nih.gov/40503897","citation_count":0,"is_preprint":false},{"pmid":"40166334","id":"PMC_40166334","title":"KSHV Reprograms Host RNA Splicing via FAM50A to Activate STAT3 and Drive Oncogenic Cellular Transformation.","date":"2025","source":"bioRxiv : the preprint server for biology","url":"https://pubmed.ncbi.nlm.nih.gov/40166334","citation_count":0,"is_preprint":false},{"pmid":"42014049","id":"PMC_42014049","title":"XAP5 CIRCADIAN TIMEKEEPER coordinates circadian rhythms and anthocyanin biosynthesis independently of splicing.","date":"2026","source":"Plant physiology","url":"https://pubmed.ncbi.nlm.nih.gov/42014049","citation_count":0,"is_preprint":false},{"pmid":null,"id":"bio_10.1101_2025.11.28.691250","title":"An ancient transcription factor functions as the master regulator of primary cilia formation","date":"2025-11-29","source":"bioRxiv","url":"https://doi.org/10.1101/2025.11.28.691250","citation_count":0,"is_preprint":true},{"pmid":null,"id":"bio_10.1101_2025.07.25.666886","title":"Pangenome of U.S. ex-PVP and Wild Sorghum Reveals Structural Variants and Selective Sweeps Shaping Adaptation and Trait Improvement","date":"2025-07-27","source":"bioRxiv","url":"https://doi.org/10.1101/2025.07.25.666886","citation_count":0,"is_preprint":true}],"cost":{"stage1":{"model":"claude-sonnet-4-6","input_tokens":13394,"output_tokens":1823,"usd":0.033764,"stage1_stop_reason":"end_turn"},"stage2":{"model":"claude-opus-4-8","input_tokens":8704,"output_tokens":2521,"usd":0.053273,"stage2_stop_reason":"end_turn"},"total_usd":0.087037,"stage1_batch_id":"msgbatch_01AsR7g8dGTEiEw9U83bJeWn","stage2_batch_id":"msgbatch_01YV5ZyxtFNUZEjxRcpd9jzb","note":"batch pricing = 50% of standard"},"stage1_raw":"```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2020,\n      \"finding\": \"FAM50A missense variants cause Armfield X-linked intellectual disability (XLID) syndrome; fam50a knockout zebrafish show abnormal neurogenesis and craniofacial patterning with augmented spliceosome mRNAs, depletion of neurodevelopmental transcripts, and a preponderance of 3' alternative splicing events, placing FAM50A as a component of the spliceosome C complex involved in mRNA processing during development.\",\n      \"method\": \"Patient variant identification, zebrafish KO model (RNA-seq, in vivo complementation assays), protein-protein interaction data, transcriptomics from patient-derived cell lines\",\n      \"journal\": \"Nature communications\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — multiple orthogonal methods (in vivo KO model, RNA-seq, complementation assays, PPI data, patient cell lines) in a single rigorous study\",\n      \"pmids\": [\"32703943\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2017,\n      \"finding\": \"FAM50A localizes to the nucleus of ameloblasts and physically interacts with Runx2, synergistically increasing Ambn transactivation and enhancing Runx2 binding affinity to the Ambn promoter; forced expression increases enamel matrix protein gene expression and mineralization, while knockdown reduces these effects.\",\n      \"method\": \"Fluorescence microscopy (nuclear localization), Co-IP (FAM50A–Runx2 interaction), promoter transactivation assays, overexpression and knockdown in mouse ameloblast cell line (mALCs)\",\n      \"journal\": \"Journal of cellular physiology\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Moderate — Co-IP, fluorescence localization, and functional reporter assays in a single lab study\",\n      \"pmids\": [\"28574578\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 1999,\n      \"finding\": \"The X-linked XAP5 (FAM50A) gene is ubiquitously expressed, contains 13 exons, and gave rise to an autosomal intronless retroposon XAP-5-like (X5L); XAP5 and X5L show differential expression in testis, consistent with the hypothesis that X5L may compensate for XAP5 silencing during spermatogenesis.\",\n      \"method\": \"Phylogenetic analysis, expression profiling across tissues, genomic structure determination\",\n      \"journal\": \"Genomics\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 4 / Weak — expression-based characterization and genomic analysis without direct functional/mechanistic assay of the protein\",\n      \"pmids\": [\"10534398\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 1997,\n      \"finding\": \"The XAP5 (FAM50A) gene encodes a 339-amino-acid nuclear protein with a predicted nuclear localization signal, contains runs of CCG repeats in the 5' UTR, spans 13 exons over 6.5 kb at Xq28, and exhibits markedly enhanced expression in fetal tissues.\",\n      \"method\": \"Full-length cDNA isolation, genomic structure mapping, population polymorphism analysis, expression profiling\",\n      \"journal\": \"Genomics\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 4 / Weak — purely descriptive genomic/expression characterization without functional mechanistic experiment\",\n      \"pmids\": [\"9339379\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2025,\n      \"finding\": \"FAM50A is a component of the spliceosome complex C and is required for KSHV-mediated oncogenic transformation; FAM50A knockout alters SHP2 pre-mRNA splicing, promoting a SHP2 isoform with enhanced phosphatase activity, which reduces STAT3 Y705 phosphorylation in KSHV-transformed cells, thereby suppressing STAT3 activation and cell proliferation/transformation.\",\n      \"method\": \"CRISPR-Cas9 screening and knockout, transcriptomic (RNA-seq) splicing analysis, enzymatic activity assays (SHP2 isoform), phosphorylation assays (STAT3 Y705), tumorigenesis assays\",\n      \"journal\": \"mBio\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — CRISPR KO with orthogonal mechanistic follow-up (splicing profiling, isoform enzymatic activity, phosphorylation readout) replicated in preprint and peer-reviewed publication\",\n      \"pmids\": [\"40503897\", \"40166334\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2025,\n      \"finding\": \"FAM50A forms a complex with C9ORF78 specifically at the S121 residue of C9ORF78, and this complex enhances ASNS transcription to promote L-asparagine biosynthesis, driving breast cancer brain metastasis; genetic suppression of FAM50A or pharmacological inhibition of asparagine synthesis counteracts brain metastasis.\",\n      \"method\": \"Co-immunoprecipitation (FAM50A–C9ORF78 complex), site-specific mutagenesis (S121), ASNS transcription assays, asparagine biosynthesis assays, in vivo metastasis models, genetic knockdown\",\n      \"journal\": \"Science advances\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Moderate — Co-IP with site-specific residue identification, transcriptional and metabolic functional readouts, single lab study\",\n      \"pmids\": [\"40531994\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2023,\n      \"finding\": \"FAM50A knockdown in cancer cells causes DNA damage, induces interferon beta and interleukin-6 expression, and represses proliferation, invasion, and migration; FAM50A encodes a nuclear protein involved in mRNA processing.\",\n      \"method\": \"FAM50A knockdown with phenotypic readouts (DNA damage assays, cytokine expression, proliferation, invasion, migration assays)\",\n      \"journal\": \"Medical oncology\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 3 / Weak — single lab, knockdown with cellular phenotype but limited pathway mechanistic resolution\",\n      \"pmids\": [\"37393403\"],\n      \"is_preprint\": false\n    }\n  ],\n  \"current_model\": \"FAM50A (XAP5) is a nuclear protein and component of the spliceosome C complex that regulates 3' alternative splice site selection; disease-causing missense variants impair its splicing function during neurodevelopment (Armfield XLID syndrome), it modulates SHP2 isoform production to sustain STAT3 signaling in KSHV-transformed cells, it forms a complex with C9ORF78 to transcriptionally upregulate asparagine synthetase (ASNS) promoting brain metastasis, and it physically interacts with Runx2 in the nucleus to enhance ameloblast differentiation.\"\n}\n```","stage2_raw":"{\n  \"mechanistic_narrative\": \"FAM50A (XAP5) is a ubiquitously expressed nuclear protein that functions as a component of the spliceosome C complex and regulates pre-mRNA processing during development [#0]. Loss of FAM50A skews splice-site selection toward 3' alternative splicing events, depleting neurodevelopmental transcripts while augmenting spliceosomal mRNAs, and missense variants in FAM50A cause Armfield X-linked intellectual disability syndrome with abnormal neurogenesis and craniofacial patterning [#0]. Its splicing activity has downstream consequences in disease contexts: in KSHV-transformed cells FAM50A controls SHP2 pre-mRNA splicing such that its loss favors a high-activity SHP2 isoform, lowering STAT3 Y705 phosphorylation and suppressing transformation and proliferation [#4]. FAM50A also acts in the nucleus through partner interactions — it forms a complex with C9ORF78 at C9ORF78 residue S121 to enhance ASNS transcription and L-asparagine biosynthesis, driving breast cancer brain metastasis [#5], and it physically interacts with Runx2 to enhance Ambn promoter transactivation and enamel matrix gene expression during ameloblast differentiation [#1]. Consistent with a role in genome and transcriptome integrity, FAM50A depletion in cancer cells provokes DNA damage and an inflammatory interferon-β/IL-6 response alongside reduced proliferation, invasion, and migration [#6].\",\n  \"teleology\": [\n    {\n      \"year\": 1997,\n      \"claim\": \"Before any functional assignment, the basic molecular identity of the gene needed defining; this established XAP5/FAM50A as an Xq28-encoded nuclear protein with developmental expression bias.\",\n      \"evidence\": \"Full-length cDNA isolation, genomic mapping, and expression profiling showing a predicted NLS and enhanced fetal expression\",\n      \"pmids\": [\"9339379\"],\n      \"confidence\": \"Low\",\n      \"gaps\": [\"Purely descriptive — no functional or mechanistic assay of the protein\", \"Nuclear localization inferred from predicted NLS, not demonstrated\", \"No molecular activity defined\"]\n    },\n    {\n      \"year\": 1999,\n      \"claim\": \"The evolutionary and expression context was extended, raising the question of how the X-linked gene is regulated relative to its retroposon, particularly in germline tissue.\",\n      \"evidence\": \"Phylogenetic analysis, genomic structure determination, and tissue expression profiling identifying the autosomal intronless retroposon X5L with differential testis expression\",\n      \"pmids\": [\"10534398\"],\n      \"confidence\": \"Low\",\n      \"gaps\": [\"Compensation hypothesis untested functionally\", \"No protein-level mechanism\", \"Relationship of X5L to FAM50A function unresolved\"]\n    },\n    {\n      \"year\": 2017,\n      \"claim\": \"The first functional partner was identified, addressing whether FAM50A acts on specific transcriptional programs; it showed FAM50A enhances Runx2-driven enamel gene transactivation in ameloblasts.\",\n      \"evidence\": \"Co-IP, fluorescence microscopy, and promoter transactivation reporter assays with overexpression/knockdown in mouse ameloblast cells\",\n      \"pmids\": [\"28574578\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"Single-lab study without reciprocal validation in other systems\", \"Mechanism of Runx2 affinity enhancement unresolved\", \"Connection to spliceosomal role not established\"]\n    },\n    {\n      \"year\": 2020,\n      \"claim\": \"The core molecular function and disease basis were established together, answering what process FAM50A serves: it is a spliceosome C complex component whose loss disrupts 3' splice-site selection and developmental transcript pools, and its missense variants cause Armfield XLID.\",\n      \"evidence\": \"Patient variant identification, zebrafish knockout with RNA-seq and in vivo complementation, PPI data, and patient-cell transcriptomics\",\n      \"pmids\": [\"32703943\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Precise biochemical step within the C complex not defined\", \"How variants impair splicing at the molecular level not resolved\", \"Target transcript selectivity rules unknown\"]\n    },\n    {\n      \"year\": 2023,\n      \"claim\": \"A consequence of FAM50A loss in cancer cells was probed, linking its mRNA-processing role to genome stability and innate immune signaling.\",\n      \"evidence\": \"Knockdown with DNA damage, cytokine expression, and proliferation/invasion/migration phenotype assays\",\n      \"pmids\": [\"37393403\"],\n      \"confidence\": \"Low\",\n      \"gaps\": [\"Limited pathway mechanistic resolution\", \"Causal link between splicing defects and DNA damage not established\", \"Single-lab phenotypic study\"]\n    },\n    {\n      \"year\": 2025,\n      \"claim\": \"Two studies connected FAM50A's splicing/nuclear activity to disease drivers, showing isoform-level control of SHP2/STAT3 in viral oncogenesis and a C9ORF78-dependent ASNS transcriptional axis in metastasis.\",\n      \"evidence\": \"CRISPR-Cas9 knockout with splicing, SHP2 isoform enzymatic and STAT3 phosphorylation readouts (mBio); Co-IP with S121 site-mapping, ASNS transcription/metabolic assays, and in vivo metastasis models (Science Advances)\",\n      \"pmids\": [\"40503897\", \"40166334\", \"40531994\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"The two disease mechanisms (SHP2 splicing vs ASNS transcription) not integrated into one biochemical model\", \"Whether ASNS upregulation is splicing-dependent or a separate transcriptional function unclear\", \"C9ORF78 complex stoichiometry and structure undefined\"]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"It remains unresolved how FAM50A's role within the spliceosome C complex mechanistically determines its specificity for 3' alternative splice sites and how that single biochemical activity gives rise to its diverse transcriptional and disease outputs.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"High\",\n      \"gaps\": [\"No structural model of FAM50A within the C complex\", \"No defined rule for target transcript/splice-site selection\", \"Unclear whether transcriptional partner functions (Runx2, C9ORF78) are separable from spliceosomal activity\"]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0140098\", \"supporting_discovery_ids\": [0, 4]},\n      {\"term_id\": \"GO:0140110\", \"supporting_discovery_ids\": [1, 5]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005634\", \"supporting_discovery_ids\": [0, 1, 3]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"R-HSA-8953854\", \"supporting_discovery_ids\": [0, 4]}\n    ],\n    \"complexes\": [\"spliceosome C complex\"],\n    \"partners\": [\"RUNX2\", \"C9ORF78\"],\n    \"other_free_text\": []\n  }\n}","audit_flag":null,"evaluation":{"faith_supported":4,"faith_total":5,"faith_pct":80.0}}