{"gene":"SUMF1","run_date":"2026-04-28T21:42:57","timeline":{"discoveries":[{"year":2003,"finding":"SUMF1 encodes the Cα-formylglycine (FGly)-generating enzyme (FGE), which post-translationally converts a conserved cysteine residue in the active site of all sulfatases to FGly, an essential catalytic residue required for sulfatase activity. SUMF1 defines a new gene family conserved from prokaryotes to eukaryotes, with orthologs in vertebrates and a paralog SUMF2 of unknown function.","method":"Bioinformatic/phylogenetic analysis of gene family; biochemical identification of FGE as product of SUMF1","journal":"Gene","confidence":"High","confidence_rationale":"Tier 1/2 — foundational identification of enzymatic activity, replicated across multiple subsequent studies","pmids":["14563551"],"is_preprint":false},{"year":2004,"finding":"SUMF1/FGE strongly enhances the activity of co-expressed sulfatases in COS-7 cells; missense mutations in SUMF1 that cause multiple sulfatase deficiency (MSD) result in severely impaired sulfatase-enhancing activity, with some mutations showing variable effects across different sulfatases.","method":"Transient co-expression of SUMF1 mutants with individual sulfatases in COS-7 cells; enzymatic activity assays","journal":"Human mutation","confidence":"High","confidence_rationale":"Tier 2 — functional co-expression assays with multiple mutants and multiple sulfatase substrates, replicated across labs","pmids":["15146462"],"is_preprint":false},{"year":2007,"finding":"SUMF1 co-expression with sulfatase genes (via AAV or lentiviral vectors) enhances sulfatase activity and improves clearance of intracellular glycosaminoglycan or sulfolipid accumulation in cells from patients with five different sulfatase deficiencies (MLD, CDPX, MPS II, IIIA, VI), and in vivo in MPS-IIIA mouse muscle.","method":"AAV/lentivirus-mediated co-delivery in patient cells and mouse models; enzymatic activity assays; biochemical storage clearance assays","journal":"The Biochemical journal","confidence":"High","confidence_rationale":"Tier 2 — multiple sulfatase substrates, multiple disease models, in vitro and in vivo validation","pmids":["17206939"],"is_preprint":false},{"year":2007,"finding":"Co-delivery of SUMF1 with SGSH (sulfamidase) via AAV2/5 into the brains of MPS-IIIA mice results in synergistic increases in SGSH activity, visible reduction in lysosomal storage and inflammatory markers, and improvement in motor and cognitive functions, demonstrating SUMF1's enhancing role in the CNS context.","method":"Intraventricular AAV2/5 injection in neonatal MPS-IIIA mice; enzymatic activity assays in brain regions; histological storage markers; behavioral testing","journal":"Human molecular genetics","confidence":"High","confidence_rationale":"Tier 2 — in vivo model with multiple orthogonal readouts (biochemical, histological, behavioral)","pmids":["17725987"],"is_preprint":false},{"year":2007,"finding":"MSD is caused by hypomorphic SUMF1 mutations; all known MSD-causing SUMF1 missense mutant proteins are properly localized to the ER and of correct molecular weight, but provide only partial rescue of sulfatase activities when expressed in Sumf1 knockout MEFs. Complete loss of SUMF1 function is likely lethal in humans.","method":"Viral-mediated expression of SUMF1 mutants in Sumf1-/- MEFs; subcellular localization; enzymatic activity rescue assays","journal":"Human mutation","confidence":"High","confidence_rationale":"Tier 2 — KO cell rescue assay with multiple mutants, direct localization, functional readout","pmids":["17657823"],"is_preprint":false},{"year":2008,"finding":"SUMF1 is largely retained in the ER (despite lacking canonical retention signals) where it activates nascent sulfatases, and part of SUMF1 is secreted and taken up paracrinally. SUMF1 physically interacts with PDI, ERGIC-53, and ERp44: PDI couples SUMF1 ER retention and activation; ERGIC-53 favors SUMF1 export from the ER; ERp44 retrieves SUMF1 to the ER. Silencing ERGIC-53 causes proteasomal degradation of SUMF1; down-regulating ERp44 promotes SUMF1 secretion.","method":"Co-immunoprecipitation, subcellular fractionation/localization, siRNA knockdown of interactors, functional sulfatase activation assays, proteasome inhibitor experiments","journal":"Human molecular genetics","confidence":"High","confidence_rationale":"Tier 2 — reciprocal Co-IP with multiple interactors, functional knockdown of each interactor with distinct phenotypic readouts, replicated with multiple orthogonal methods","pmids":["18508857"],"is_preprint":false},{"year":2008,"finding":"SUMF1 missense mutations (p.A177P, p.W179S, p.A279V, p.R349W) do not affect ER localization of FGE but decrease specific enzymatic activity to <1–23% of wild type, and variably decrease protein stability; both reduced enzyme activity and reduced protein stability contribute to MSD clinical severity.","method":"Subcellular localization in MSD fibroblasts; FGE enzymatic activity assays; protein stability/western blot analysis","journal":"Human mutation","confidence":"High","confidence_rationale":"Tier 2 — functional characterization with multiple mutants using activity assays, stability assays, and localization in patient cells","pmids":["18157819"],"is_preprint":false},{"year":2011,"finding":"Clinical phenotypic severity in MSD patients correlates with both residual FGE enzymatic activity and FGE protein stability: near-complete loss of FGE activity with highly unstable protein causes neonatal severe phenotype, while high residual activity with unstable protein causes mild phenotype.","method":"FGE expression, localization, and stability analysis (western blot, immunofluorescence) in patient-derived cells; FGE activity assays; sulfatase activity measurements; clinical correlation","journal":"European journal of human genetics : EJHG","confidence":"High","confidence_rationale":"Tier 2 — multi-parameter functional analysis across 10 patients with clinical correlation; multiple orthogonal methods","pmids":["21224894"],"is_preprint":false},{"year":2019,"finding":"FGE (encoded by SUMF1) is a copper-dependent post-translational protein modifier that generates formylglycine (an aldehyde-containing amino acid) at a specific cysteine in sulfatase active sites; this formylglycine residue is essential for sulfate ester hydrolysis and can serve as a bioconjugation handle in mammalian expression systems.","method":"In vitro and mammalian cell reconstitution of FGE-mediated fGly generation; biochemical characterization; application to antibody-drug conjugate production","journal":"Methods in molecular biology (Clifton, N.J.)","confidence":"Medium","confidence_rationale":"Tier 1/2 — enzymatic mechanism established, but this paper is largely methodological; underlying mechanism well-established by prior studies","pmids":["31161504"],"is_preprint":false},{"year":2020,"finding":"A novel SUMF1 missense variant (p.A348V) produces a highly unstable FGE protein that lacks catalytic function, causing a neonatal severe form of MSD; functional analysis in cell culture confirmed loss of FGE activity.","method":"Expression of SUMF1 variant in cell culture model; FGE protein stability and activity assays; genotype-phenotype correlation","journal":"Molecular genetics & genomic medicine","confidence":"Medium","confidence_rationale":"Tier 2 — single lab, cell culture expression with activity and stability assays; supports established genotype-phenotype model","pmids":["32048457"],"is_preprint":false},{"year":2017,"finding":"A novel SUMF1 variant (p.E113K) correctly localizes to the ER but is retained intracellularly and exhibits only ~15% of wild-type FGE activity when expressed in immortalized MSD cells; structural modeling predicts destabilization of secondary structure affecting disulfide bridging and the active site groove.","method":"Cell culture expression of FGE variant; subcellular localization; FGE activity assay (steroid sulfatase activation); structural analysis based on crystal structure","journal":"Molecular genetics and metabolism","confidence":"Medium","confidence_rationale":"Tier 2/3 — single lab, functional activity assay + localization; structural prediction is computational","pmids":["28566233"],"is_preprint":false},{"year":2024,"finding":"SUMF1 (FGE) does not contain ER retention sequences and relies on ERp44 engagement for proper inter-compartmental distribution in the early secretory pathway, placing it in the same regulatory class as Ero1α, Ero1β, Prx4, and ERAP1.","method":"Biochemical interaction analysis in ER pathway context; comparison to known ERp44 clients","journal":"bioRxiv","confidence":"Low","confidence_rationale":"Tier 3 — preprint, inference from interaction data; mechanism of ERp44-mediated retention was established earlier in PMID:18508857","pmids":[],"is_preprint":true},{"year":2025,"finding":"In zebrafish, sumf1 (ortholog of human SUMF1) acts as a positive regulator of sulfatase activity; sumf1 and sumf2 expression levels invert at gastrulation onset, predicting a reduction in sulfatase activity. Overexpressing sumf1 delays convergence and extension (C&E) onset, while loss of sumf1 function causes precocious C&E. The effector is Sulf1, an extracellular sulfatase modifying heparan sulfate proteoglycans (HSPGs), and altered HSPG sulfation levels suppress sumf1/sumf2 mutant C&E timing phenotypes.","method":"Zebrafish embryonic explants; gain- and loss-of-function experiments (overexpression, morpholino/mutant); genetic epistasis with Sulf1 and HSPG sulfation levels; C&E morphogenesis timing assays","journal":"bioRxiv","confidence":"Medium","confidence_rationale":"Tier 2 — preprint with multiple orthogonal genetic approaches (GOF, LOF, epistasis) in zebrafish ortholog; mechanism is functionally validated","pmids":[],"is_preprint":true}],"current_model":"SUMF1 encodes the formylglycine-generating enzyme (FGE), a copper-dependent enzyme that resides primarily in the ER where it post-translationally converts a conserved active-site cysteine in all known sulfatases to Cα-formylglycine, an essential catalytic residue; FGE's ER retention and secretion are dynamically regulated by sequential interactions with PDI (which couples retention and activation), ERGIC-53 (which promotes ER export), and ERp44 (which retrieves FGE to the ER), and mutations in SUMF1 that reduce both FGE enzymatic activity and protein stability cause multiple sulfatase deficiency with severity proportional to the degree of residual FGE function."},"narrative":{"teleology":[{"year":2003,"claim":"The gene responsible for generating the catalytic formylglycine residue in all sulfatases was identified as SUMF1, establishing it as the master activator of the entire sulfatase family and defining a new conserved gene family.","evidence":"Bioinformatic and biochemical identification of SUMF1/FGE as the Cα-formylglycine-generating enzyme","pmids":["14563551"],"confidence":"High","gaps":["Catalytic mechanism (cofactor requirement) not yet determined","Paralog SUMF2 function unknown","Structural basis for substrate recognition unclear"]},{"year":2004,"claim":"Functional co-expression studies demonstrated that SUMF1 directly enhances sulfatase activity and that MSD-causing missense mutations impair this activity, establishing the genotype-phenotype link for multiple sulfatase deficiency.","evidence":"Co-expression of SUMF1 wild-type and mutants with individual sulfatases in COS-7 cells; enzymatic activity assays","pmids":["15146462"],"confidence":"High","gaps":["Molecular basis for differential effects of mutations on different sulfatases not resolved","Protein stability vs. catalytic deficiency not yet dissected"]},{"year":2007,"claim":"In vivo co-delivery of SUMF1 with sulfatases via AAV vectors demonstrated therapeutic synergy, clearing lysosomal storage and improving neurological function in MPS-IIIA mice, while knockout rescue experiments established that all MSD mutations are hypomorphic and complete SUMF1 loss is likely lethal.","evidence":"AAV-mediated co-delivery in patient cells and MPS-IIIA mouse brain; Sumf1-/- MEF rescue with MSD mutants; behavioral, histological, and biochemical readouts","pmids":["17206939","17725987","17657823"],"confidence":"High","gaps":["Therapeutic window and long-term efficacy of gene therapy not established","Whether any SUMF1 null human exists remains unclear","Contribution of secreted/paracrine FGE to in vivo correction not quantified"]},{"year":2008,"claim":"The ER trafficking mechanism of FGE was resolved: despite lacking canonical retention signals, FGE is retained in the ER through PDI interaction, exported via ERGIC-53, and retrieved from post-ER compartments by ERp44, with each interaction serving a distinct functional role in balancing intracellular activation versus paracrine secretion.","evidence":"Co-immunoprecipitation with PDI, ERGIC-53, and ERp44; siRNA knockdown of each interactor with distinct phenotypic outcomes; subcellular fractionation","pmids":["18508857"],"confidence":"High","gaps":["Structural basis of FGE interaction with each ER chaperone not determined","How the balance between retention and secretion is physiologically regulated is unknown","Whether MSD mutations alter these interactions not tested"]},{"year":2008,"claim":"Dissecting multiple MSD mutations revealed that both reduced specific enzymatic activity and decreased protein stability independently contribute to disease, resolving why mutations with similar catalytic impairment can produce different clinical severities.","evidence":"FGE activity assays, protein stability measurements, and ER localization in MSD patient fibroblasts for multiple SUMF1 missense variants","pmids":["18157819"],"confidence":"High","gaps":["Structural mechanism by which individual mutations destabilize FGE not resolved at atomic level","Contribution of ER quality control/degradation to residual FGE levels not quantified"]},{"year":2011,"claim":"A systematic genotype-phenotype correlation across MSD patients established that the combination of residual FGE activity and protein stability quantitatively predicts clinical severity, from neonatal lethal to mild late-onset forms.","evidence":"Multi-parameter functional analysis (FGE expression, stability, activity, sulfatase activities) across 10 MSD patients with clinical correlation","pmids":["21224894"],"confidence":"High","gaps":["Modifier genes or environmental factors influencing MSD severity not addressed","Whether pharmacological stabilization of mutant FGE could be therapeutic remains untested"]},{"year":2019,"claim":"The copper dependence of FGE catalysis was established, placing it in the class of copper-dependent oxidases and enabling biotechnological exploitation of formylglycine as a site-specific bioconjugation handle.","evidence":"In vitro and mammalian cell reconstitution of copper-dependent FGE-mediated fGly generation; application to antibody-drug conjugate production","pmids":["31161504"],"confidence":"Medium","gaps":["How copper loading of FGE is regulated in the ER lumen not determined","Whether copper availability limits FGE activity in disease states unknown"]},{"year":null,"claim":"Key unresolved questions include the structural basis for FGE substrate recognition across diverse sulfatases, how copper is delivered to FGE in the ER, the function of paralog SUMF2 and its relationship to SUMF1, and whether pharmacological stabilization of mutant FGE proteins is a viable therapeutic strategy for MSD.","evidence":"","pmids":[],"confidence":"Low","gaps":["No high-resolution structure of FGE-sulfatase complex","SUMF2 function remains uncharacterized","No pharmacological chaperone strategy validated for MSD"]}],"mechanism_profile":{"molecular_activity":[{"term_id":"GO:0140096","term_label":"catalytic activity, acting on a protein","supporting_discovery_ids":[0,1,8]},{"term_id":"GO:0016491","term_label":"oxidoreductase activity","supporting_discovery_ids":[0,8]}],"localization":[{"term_id":"GO:0005783","term_label":"endoplasmic reticulum","supporting_discovery_ids":[4,5,6,10]},{"term_id":"GO:0005576","term_label":"extracellular region","supporting_discovery_ids":[5]}],"pathway":[{"term_id":"R-HSA-392499","term_label":"Metabolism of proteins","supporting_discovery_ids":[0,1,8]},{"term_id":"R-HSA-9609507","term_label":"Protein localization","supporting_discovery_ids":[5]}],"complexes":[],"partners":["PDI","ERGIC-53","ERP44"],"other_free_text":[]},"mechanistic_narrative":"SUMF1 encodes the formylglycine-generating enzyme (FGE), a copper-dependent oxidase that post-translationally converts a conserved active-site cysteine to Cα-formylglycine in all known sulfatases, a modification essential for sulfate ester hydrolysis [PMID:14563551, PMID:31161504]. FGE resides primarily in the endoplasmic reticulum, where its retention and secretion are dynamically regulated by sequential interactions with PDI (which couples retention and activation), ERGIC-53 (which promotes ER export and prevents proteasomal degradation), and ERp44 (which retrieves secreted FGE back to the ER), enabling both cell-autonomous sulfatase activation and paracrine activity [PMID:18508857]. Co-expression of SUMF1 with individual sulfatases synergistically enhances their enzymatic activity in vitro and in vivo, including correction of glycosaminoglycan storage and neurological deficits in mouse models of mucopolysaccharidosis IIIA [PMID:17206939, PMID:17725987]. Hypomorphic SUMF1 mutations that reduce both FGE catalytic activity and protein stability cause multiple sulfatase deficiency (MSD), with clinical severity proportional to the degree of residual FGE function; complete loss of SUMF1 is predicted to be lethal [PMID:17657823, PMID:21224894]."},"prefetch_data":{"uniprot":{"accession":"Q8NBK3","full_name":"Formylglycine-generating enzyme","aliases":["C-alpha-formylglycine-generating enzyme 1","Sulfatase-modifying factor 1"],"length_aa":374,"mass_kda":40.6,"function":"Oxidase that catalyzes the conversion of cysteine to 3-oxoalanine on target proteins, using molecular oxygen and an unidentified reducing agent (PubMed:12757706, PubMed:15657036, PubMed:15907468, PubMed:16368756, PubMed:21224894, PubMed:25931126). 3-oxoalanine modification, which is also named formylglycine (fGly), occurs in the maturation of arylsulfatases and some alkaline phosphatases that use the hydrated form of 3-oxoalanine as a catalytic nucleophile (PubMed:12757706, PubMed:15657036, PubMed:15907468, PubMed:16368756, PubMed:25931126). Known substrates include GALNS, ARSA, STS and ARSE (PubMed:12757706, PubMed:15657036, PubMed:15907468)","subcellular_location":"Endoplasmic reticulum lumen","url":"https://www.uniprot.org/uniprotkb/Q8NBK3/entry"},"depmap":{"release":"DepMap","has_data":true,"is_common_essential":false,"resolved_as":"","url":"https://depmap.org/portal/gene/SUMF1","classification":"Not Classified","n_dependent_lines":0,"n_total_lines":1208,"dependency_fraction":0.0},"opencell":{"profiled":false,"resolved_as":"","ensg_id":"","cell_line_id":"","localizations":[],"interactors":[],"url":"https://opencell.sf.czbiohub.org/search/SUMF1","total_profiled":1310},"omim":[{"mim_id":"607940","title":"SULFATASE-MODIFYING FACTOR 2; SUMF2","url":"https://www.omim.org/entry/607940"},{"mim_id":"607939","title":"SULFATASE-MODIFYING FACTOR 1; SUMF1","url":"https://www.omim.org/entry/607939"},{"mim_id":"607280","title":"CONTACTIN 4; CNTN4","url":"https://www.omim.org/entry/607280"},{"mim_id":"606658","title":"SPINOCEREBELLAR ATAXIA 15; SCA15","url":"https://www.omim.org/entry/606658"},{"mim_id":"272200","title":"MULTIPLE SULFATASE DEFICIENCY; MSD","url":"https://www.omim.org/entry/272200"}],"hpa":{"profiled":true,"resolved_as":"","reliability":"","locations":[],"tissue_specificity":"Low tissue specificity","tissue_distribution":"Detected in all","driving_tissues":[],"url":"https://www.proteinatlas.org/search/SUMF1"},"hgnc":{"alias_symbol":["FGE","UNQ3037"],"prev_symbol":[]},"alphafold":{"accession":"Q8NBK3","domains":[{"cath_id":"3.90.1580.10","chopping":"90-147_208-368","consensus_level":"high","plddt":98.2349,"start":90,"end":368}],"viewer_url":"https://alphafold.ebi.ac.uk/entry/Q8NBK3","model_url":"https://alphafold.ebi.ac.uk/files/AF-Q8NBK3-F1-model_v6.cif","pae_url":"https://alphafold.ebi.ac.uk/files/AF-Q8NBK3-F1-predicted_aligned_error_v6.png","plddt_mean":83.56},"mouse_models":{"mgi_url":"https://www.informatics.jax.org/marker/summary?nomen=SUMF1","jax_strain_url":"https://www.jax.org/strain/search?query=SUMF1"},"sequence":{"accession":"Q8NBK3","fasta_url":"https://rest.uniprot.org/uniprotkb/Q8NBK3.fasta","uniprot_url":"https://www.uniprot.org/uniprotkb/Q8NBK3/entry","alphafold_viewer_url":"https://alphafold.ebi.ac.uk/entry/Q8NBK3"}},"corpus_meta":[{"pmid":"24524415","id":"PMC_24524415","title":"Intracerebral administration of adeno-associated viral vector serotype rh.10 carrying human SGSH and SUMF1 cDNAs in children with mucopolysaccharidosis type IIIA disease: results of a phase I/II trial.","date":"2014","source":"Human gene therapy","url":"https://pubmed.ncbi.nlm.nih.gov/24524415","citation_count":205,"is_preprint":false},{"pmid":"17725987","id":"PMC_17725987","title":"Functional correction of CNS lesions in an MPS-IIIA mouse model by intracerebral AAV-mediated delivery of sulfamidase and SUMF1 genes.","date":"2007","source":"Human molecular genetics","url":"https://pubmed.ncbi.nlm.nih.gov/17725987","citation_count":104,"is_preprint":false},{"pmid":"17206939","id":"PMC_17206939","title":"SUMF1 enhances sulfatase activities in vivo in five sulfatase deficiencies.","date":"2007","source":"The Biochemical journal","url":"https://pubmed.ncbi.nlm.nih.gov/17206939","citation_count":65,"is_preprint":false},{"pmid":"14563551","id":"PMC_14563551","title":"The human SUMF1 gene, required for posttranslational sulfatase modification, defines a new gene family which is conserved from pro- to eukaryotes.","date":"2003","source":"Gene","url":"https://pubmed.ncbi.nlm.nih.gov/14563551","citation_count":59,"is_preprint":false},{"pmid":"15146462","id":"PMC_15146462","title":"Molecular and functional analysis of SUMF1 mutations in multiple sulfatase deficiency.","date":"2004","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/15146462","citation_count":59,"is_preprint":false},{"pmid":"18508857","id":"PMC_18508857","title":"Multistep, sequential control of the trafficking and function of the multiple sulfatase deficiency gene product, SUMF1 by PDI, ERGIC-53 and ERp44.","date":"2008","source":"Human molecular genetics","url":"https://pubmed.ncbi.nlm.nih.gov/18508857","citation_count":58,"is_preprint":false},{"pmid":"21224894","id":"PMC_21224894","title":"SUMF1 mutations affecting stability and activity of formylglycine generating enzyme predict clinical outcome in multiple sulfatase deficiency.","date":"2011","source":"European journal of human genetics : EJHG","url":"https://pubmed.ncbi.nlm.nih.gov/21224894","citation_count":52,"is_preprint":false},{"pmid":"17657823","id":"PMC_17657823","title":"Multiple sulfatase deficiency is due to hypomorphic mutations of the SUMF1 gene.","date":"2007","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/17657823","citation_count":35,"is_preprint":false},{"pmid":"18157819","id":"PMC_18157819","title":"Molecular analysis of SUMF1 mutations: stability and residual activity of mutant formylglycine-generating enzyme determine disease severity in multiple sulfatase deficiency.","date":"2008","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/18157819","citation_count":35,"is_preprint":false},{"pmid":"25885655","id":"PMC_25885655","title":"Natural disease history and characterisation of SUMF1 molecular defects in ten unrelated patients with multiple sulfatase deficiency.","date":"2015","source":"Orphanet journal of rare diseases","url":"https://pubmed.ncbi.nlm.nih.gov/25885655","citation_count":29,"is_preprint":false},{"pmid":"18989752","id":"PMC_18989752","title":"Effect of elongation factor 1alpha promoter and SUMF1 over in vitro expression of N-acetylgalactosamine-6-sulfate sulfatase.","date":"2008","source":"Molecular biology reports","url":"https://pubmed.ncbi.nlm.nih.gov/18989752","citation_count":26,"is_preprint":false},{"pmid":"28464818","id":"PMC_28464818","title":"Sulfatase modifying factor 1 (SUMF1) is associated with Chronic Obstructive Pulmonary Disease.","date":"2017","source":"Respiratory research","url":"https://pubmed.ncbi.nlm.nih.gov/28464818","citation_count":10,"is_preprint":false},{"pmid":"28566233","id":"PMC_28566233","title":"Expanding the genetic cause of multiple sulfatase deficiency: A novel SUMF1 variant in a patient displaying a severe late infantile form of the disease.","date":"2017","source":"Molecular genetics and metabolism","url":"https://pubmed.ncbi.nlm.nih.gov/28566233","citation_count":9,"is_preprint":false},{"pmid":"32626305","id":"PMC_32626305","title":"Scientific Opinion on Flavouring Group Evaluation 210 Revision 3 (FGE.210Rev3): Consideration of genotoxic potential for α,β-unsaturated alicyclic ketones and precursors from chemical subgroup 2.4 of FGE.19.","date":"2019","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32626305","citation_count":9,"is_preprint":false},{"pmid":"32625708","id":"PMC_32625708","title":"Scientific Opinion on Flavouring Group Evaluation 201 Revision 2 (FGE.201Rev2): 2-alkylated, aliphatic, acyclic alpha,beta-unsaturated aldehydes and precursors, with or without additional double-bonds, from chemical subgroup 1.1.2 of FGE.19.","date":"2018","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625708","citation_count":9,"is_preprint":false},{"pmid":"30598261","id":"PMC_30598261","title":"Integrative expression analysis identifies a novel interplay between CFTR and linc-SUMF1-2 that involves CF-associated gene dysregulation.","date":"2018","source":"Biochemical and biophysical research communications","url":"https://pubmed.ncbi.nlm.nih.gov/30598261","citation_count":8,"is_preprint":false},{"pmid":"32625963","id":"PMC_32625963","title":"Scientific Opinion on Flavouring Group Evaluation 203, Revision 2 (FGE.203Rev2): α,β-unsaturated aliphatic aldehydes and precursors from chemical subgroup 1.1.4 of FGE.19 with two or more conjugated double-bonds and with or without additional non-conjugated double-bonds.","date":"2018","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625963","citation_count":8,"is_preprint":false},{"pmid":"36433920","id":"PMC_36433920","title":"New mouse models with hypomorphic SUMF1 variants mimic attenuated forms of multiple sulfatase deficiency.","date":"2022","source":"Journal of inherited metabolic disease","url":"https://pubmed.ncbi.nlm.nih.gov/36433920","citation_count":6,"is_preprint":false},{"pmid":"36794062","id":"PMC_36794062","title":"Flavouring Group Evaluation 21 Revision 6 (FGE.21Rev6): thiazoles, thiophenes, thiazoline and thienyl derivatives from chemical groups 29 and 30.","date":"2023","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/36794062","citation_count":6,"is_preprint":false},{"pmid":"32645339","id":"PMC_32645339","title":"Detection of 2,4-dichlorophenoxyacetic acid herbicide using a FGE-sulfatase based whole-cell Agrobacterium biosensor.","date":"2020","source":"Journal of microbiological methods","url":"https://pubmed.ncbi.nlm.nih.gov/32645339","citation_count":6,"is_preprint":false},{"pmid":"32625501","id":"PMC_32625501","title":"Scientific Opinion on Flavouring Group Evaluation 226 Revision 1 (FGE.226Rev1): consideration of genotoxicity data on one α,β-unsaturated aldehyde from chemical subgroup 1.1.1(b) of FGE.19.","date":"2017","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625501","citation_count":6,"is_preprint":false},{"pmid":"32625330","id":"PMC_32625330","title":"Scientific Opinion of Flavouring Group Evaluation 410 (FGE.410): 4',5,7-trihydroxyflavanone from chemical group 25 (phenol derivatives containing ring-alkyl, ring-alkoxy, and side-chains with an oxygenated functional group).","date":"2017","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625330","citation_count":6,"is_preprint":false},{"pmid":"32626370","id":"PMC_32626370","title":"Scientific Opinion on Flavouring Group Evaluation 204 Revision 1 (FGE.204Rev1): consideration of genotoxicity data on representatives for 17 monounsaturated, aliphatic, α,β-unsaturated ketones and precursors from chemical subgroup 1.2.1 of FGE.19.","date":"2019","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32626370","citation_count":5,"is_preprint":false},{"pmid":"32626109","id":"PMC_32626109","title":"Scientific Opinion on Flavouring Group Evaluation 208 Revision 3 (FGE.208Rev3): consideration of genotoxicity data on alicyclic aldehydes with α,β-unsaturation in ring/side-chain and precursors from chemical subgroup 2.2 of FGE.19.","date":"2019","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32626109","citation_count":5,"is_preprint":false},{"pmid":"25222778","id":"PMC_25222778","title":"Molecular evaluation of a novel missense mutation & an insertional truncating mutation in SUMF1 gene.","date":"2014","source":"The Indian journal of medical research","url":"https://pubmed.ncbi.nlm.nih.gov/25222778","citation_count":5,"is_preprint":false},{"pmid":"36959582","id":"PMC_36959582","title":"Late infantile form of multiple sulfatase deficiency with a novel missense variant in the SUMF1 gene: case report and review.","date":"2023","source":"BMC pediatrics","url":"https://pubmed.ncbi.nlm.nih.gov/36959582","citation_count":4,"is_preprint":false},{"pmid":"31161504","id":"PMC_31161504","title":"Site-Specific Labeling of Proteins Using the Formylglycine-Generating Enzyme (FGE).","date":"2019","source":"Methods in molecular biology (Clifton, N.J.)","url":"https://pubmed.ncbi.nlm.nih.gov/31161504","citation_count":4,"is_preprint":false},{"pmid":"38711805","id":"PMC_38711805","title":"Flavouring group evaluation 419 (FGE.419): 2-methyl-1-(2-(5-(p-tolyl)-1H-imidazol-2-yl)piperidin-1-yl)butan-1-one.","date":"2024","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/38711805","citation_count":3,"is_preprint":false},{"pmid":"32048457","id":"PMC_32048457","title":"A homozygous missense variant of SUMF1 in the Bedouin population extends the clinical spectrum in ultrarare neonatal multiple sulfatase deficiency.","date":"2020","source":"Molecular genetics & genomic medicine","url":"https://pubmed.ncbi.nlm.nih.gov/32048457","citation_count":3,"is_preprint":false},{"pmid":"37089172","id":"PMC_37089172","title":"Flavouring Group Evaluation 217 Revision 3 (FGE.217Rev3): consideration of genotoxic potential for α,β-unsaturated ketones and precursors from chemical subgroup 4.1 of FGE.19: lactones.","date":"2023","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/37089172","citation_count":3,"is_preprint":false},{"pmid":"35991962","id":"PMC_35991962","title":"Scientific opinion on Flavouring group evaluation 216 revision 2 (FGE.216Rev2): consideration of the genotoxicity potential of α,β-unsaturated 2-phenyl-2-alkenals from subgroup 3.3 of FGE.19.","date":"2022","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/35991962","citation_count":2,"is_preprint":false},{"pmid":"35814922","id":"PMC_35814922","title":"Scientific opinion on flavouring group evaluation 415 (FGE.415): (E)-3-benzo[1,3]dioxol-5-yl-N,N-diphenyl-2-propenamide.","date":"2022","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/35814922","citation_count":1,"is_preprint":false},{"pmid":"37344788","id":"PMC_37344788","title":"Association between SUMF1 polymorphisms and COVID-19 severity.","date":"2023","source":"BMC genomic data","url":"https://pubmed.ncbi.nlm.nih.gov/37344788","citation_count":1,"is_preprint":false},{"pmid":"36441600","id":"PMC_36441600","title":"Genetic analysis of a novel SUMF1 variation associated with a late infantile form of multiple sulfatase deficiency.","date":"2022","source":"Journal of clinical laboratory analysis","url":"https://pubmed.ncbi.nlm.nih.gov/36441600","citation_count":1,"is_preprint":false},{"pmid":"40772760","id":"PMC_40772760","title":"Structure of the T9SS PorKN ring complex reveals conformational plasticity based on the repurposed FGE fold.","date":"2025","source":"mBio","url":"https://pubmed.ncbi.nlm.nih.gov/40772760","citation_count":1,"is_preprint":false},{"pmid":"38863195","id":"PMC_38863195","title":"Non-syndromic retinal dystrophy associated with biallelic variation of SUMF1 and reduced leukocyte sulfatase activity.","date":"2024","source":"Clinical genetics","url":"https://pubmed.ncbi.nlm.nih.gov/38863195","citation_count":1,"is_preprint":false},{"pmid":"38460946","id":"PMC_38460946","title":"SUMF1 overexpression promotes tumorous cell growth and migration and is correlated with the immune status of patients with glioma.","date":"2024","source":"Aging","url":"https://pubmed.ncbi.nlm.nih.gov/38460946","citation_count":1,"is_preprint":false},{"pmid":"39870870","id":"PMC_39870870","title":"Preclinical use of a clinically-relevant scAAV9/SUMF1 vector for the treatment of multiple sulfatase deficiency.","date":"2025","source":"Communications medicine","url":"https://pubmed.ncbi.nlm.nih.gov/39870870","citation_count":0,"is_preprint":false},{"pmid":"33795796","id":"PMC_33795796","title":"Family-based gene-environment interaction using sequence kernel association test (FGE-SKAT) for complex quantitative traits.","date":"2021","source":"Scientific reports","url":"https://pubmed.ncbi.nlm.nih.gov/33795796","citation_count":0,"is_preprint":false},{"pmid":"39896350","id":"PMC_39896350","title":"Flavouring group evaluation 418 (FGE. 418): 3-[3-(2-isopropyl-5-methyl-cyclohexyl)-ureido]-butyric acid ethyl ester.","date":"2025","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/39896350","citation_count":0,"is_preprint":false},{"pmid":"42016130","id":"PMC_42016130","title":"Scientific Opinion on Flavouring Group Evaluation 226 (FGE.226): Consideration of genotoxicity data on one α,β-unsaturated aldehyde from chemical subgroup 1.1.1(b) of FGE.19 by EFSA.","date":"2012","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/42016130","citation_count":0,"is_preprint":false},{"pmid":"41155512","id":"PMC_41155512","title":"SUMF1 Common Variant rs793391 Is Associated with Response to Inhaled Corticosteroids in Patients with COPD.","date":"2025","source":"International journal of molecular sciences","url":"https://pubmed.ncbi.nlm.nih.gov/41155512","citation_count":0,"is_preprint":false},{"pmid":null,"id":"bio_10.1101_2025.10.09.681375","title":"Sulfatase modifying factors control the timing of zebrafish convergence and extension morphogenesis","date":"2025-10-10","source":"bioRxiv","url":"https://doi.org/10.1101/2025.10.09.681375","citation_count":0,"is_preprint":true},{"pmid":null,"id":"bio_10.1101_2024.06.12.598629","title":"TMX5/TXNDC15, a natural trapping mutant of the PDI family is a client of the proteostatic factor ERp44","date":"2024-06-13","source":"bioRxiv","url":"https://doi.org/10.1101/2024.06.12.598629","citation_count":0,"is_preprint":true}],"cost":{"stage1":{"model":"claude-sonnet-4-6","input_tokens":25818,"output_tokens":3242,"usd":0.063042},"stage2":{"model":"claude-opus-4-6","input_tokens":6607,"output_tokens":2088,"usd":0.127853},"total_usd":0.190895,"stage1_batch_id":"msgbatch_01RPbVc9PDH9zQmYK5Hs6srm","stage2_batch_id":"msgbatch_01JVbVLzeKN2yVrsGp3zC8aM","note":"batch pricing = 50% of standard"},"stage1_raw":"```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2003,\n      \"finding\": \"SUMF1 encodes the Cα-formylglycine (FGly)-generating enzyme (FGE), which post-translationally converts a conserved cysteine residue in the active site of all sulfatases to FGly, an essential catalytic residue required for sulfatase activity. SUMF1 defines a new gene family conserved from prokaryotes to eukaryotes, with orthologs in vertebrates and a paralog SUMF2 of unknown function.\",\n      \"method\": \"Bioinformatic/phylogenetic analysis of gene family; biochemical identification of FGE as product of SUMF1\",\n      \"journal\": \"Gene\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1/2 — foundational identification of enzymatic activity, replicated across multiple subsequent studies\",\n      \"pmids\": [\"14563551\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2004,\n      \"finding\": \"SUMF1/FGE strongly enhances the activity of co-expressed sulfatases in COS-7 cells; missense mutations in SUMF1 that cause multiple sulfatase deficiency (MSD) result in severely impaired sulfatase-enhancing activity, with some mutations showing variable effects across different sulfatases.\",\n      \"method\": \"Transient co-expression of SUMF1 mutants with individual sulfatases in COS-7 cells; enzymatic activity assays\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — functional co-expression assays with multiple mutants and multiple sulfatase substrates, replicated across labs\",\n      \"pmids\": [\"15146462\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"SUMF1 co-expression with sulfatase genes (via AAV or lentiviral vectors) enhances sulfatase activity and improves clearance of intracellular glycosaminoglycan or sulfolipid accumulation in cells from patients with five different sulfatase deficiencies (MLD, CDPX, MPS II, IIIA, VI), and in vivo in MPS-IIIA mouse muscle.\",\n      \"method\": \"AAV/lentivirus-mediated co-delivery in patient cells and mouse models; enzymatic activity assays; biochemical storage clearance assays\",\n      \"journal\": \"The Biochemical journal\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multiple sulfatase substrates, multiple disease models, in vitro and in vivo validation\",\n      \"pmids\": [\"17206939\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"Co-delivery of SUMF1 with SGSH (sulfamidase) via AAV2/5 into the brains of MPS-IIIA mice results in synergistic increases in SGSH activity, visible reduction in lysosomal storage and inflammatory markers, and improvement in motor and cognitive functions, demonstrating SUMF1's enhancing role in the CNS context.\",\n      \"method\": \"Intraventricular AAV2/5 injection in neonatal MPS-IIIA mice; enzymatic activity assays in brain regions; histological storage markers; behavioral testing\",\n      \"journal\": \"Human molecular genetics\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — in vivo model with multiple orthogonal readouts (biochemical, histological, behavioral)\",\n      \"pmids\": [\"17725987\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"MSD is caused by hypomorphic SUMF1 mutations; all known MSD-causing SUMF1 missense mutant proteins are properly localized to the ER and of correct molecular weight, but provide only partial rescue of sulfatase activities when expressed in Sumf1 knockout MEFs. Complete loss of SUMF1 function is likely lethal in humans.\",\n      \"method\": \"Viral-mediated expression of SUMF1 mutants in Sumf1-/- MEFs; subcellular localization; enzymatic activity rescue assays\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — KO cell rescue assay with multiple mutants, direct localization, functional readout\",\n      \"pmids\": [\"17657823\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2008,\n      \"finding\": \"SUMF1 is largely retained in the ER (despite lacking canonical retention signals) where it activates nascent sulfatases, and part of SUMF1 is secreted and taken up paracrinally. SUMF1 physically interacts with PDI, ERGIC-53, and ERp44: PDI couples SUMF1 ER retention and activation; ERGIC-53 favors SUMF1 export from the ER; ERp44 retrieves SUMF1 to the ER. Silencing ERGIC-53 causes proteasomal degradation of SUMF1; down-regulating ERp44 promotes SUMF1 secretion.\",\n      \"method\": \"Co-immunoprecipitation, subcellular fractionation/localization, siRNA knockdown of interactors, functional sulfatase activation assays, proteasome inhibitor experiments\",\n      \"journal\": \"Human molecular genetics\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — reciprocal Co-IP with multiple interactors, functional knockdown of each interactor with distinct phenotypic readouts, replicated with multiple orthogonal methods\",\n      \"pmids\": [\"18508857\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2008,\n      \"finding\": \"SUMF1 missense mutations (p.A177P, p.W179S, p.A279V, p.R349W) do not affect ER localization of FGE but decrease specific enzymatic activity to <1–23% of wild type, and variably decrease protein stability; both reduced enzyme activity and reduced protein stability contribute to MSD clinical severity.\",\n      \"method\": \"Subcellular localization in MSD fibroblasts; FGE enzymatic activity assays; protein stability/western blot analysis\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — functional characterization with multiple mutants using activity assays, stability assays, and localization in patient cells\",\n      \"pmids\": [\"18157819\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2011,\n      \"finding\": \"Clinical phenotypic severity in MSD patients correlates with both residual FGE enzymatic activity and FGE protein stability: near-complete loss of FGE activity with highly unstable protein causes neonatal severe phenotype, while high residual activity with unstable protein causes mild phenotype.\",\n      \"method\": \"FGE expression, localization, and stability analysis (western blot, immunofluorescence) in patient-derived cells; FGE activity assays; sulfatase activity measurements; clinical correlation\",\n      \"journal\": \"European journal of human genetics : EJHG\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multi-parameter functional analysis across 10 patients with clinical correlation; multiple orthogonal methods\",\n      \"pmids\": [\"21224894\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2019,\n      \"finding\": \"FGE (encoded by SUMF1) is a copper-dependent post-translational protein modifier that generates formylglycine (an aldehyde-containing amino acid) at a specific cysteine in sulfatase active sites; this formylglycine residue is essential for sulfate ester hydrolysis and can serve as a bioconjugation handle in mammalian expression systems.\",\n      \"method\": \"In vitro and mammalian cell reconstitution of FGE-mediated fGly generation; biochemical characterization; application to antibody-drug conjugate production\",\n      \"journal\": \"Methods in molecular biology (Clifton, N.J.)\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 1/2 — enzymatic mechanism established, but this paper is largely methodological; underlying mechanism well-established by prior studies\",\n      \"pmids\": [\"31161504\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"A novel SUMF1 missense variant (p.A348V) produces a highly unstable FGE protein that lacks catalytic function, causing a neonatal severe form of MSD; functional analysis in cell culture confirmed loss of FGE activity.\",\n      \"method\": \"Expression of SUMF1 variant in cell culture model; FGE protein stability and activity assays; genotype-phenotype correlation\",\n      \"journal\": \"Molecular genetics & genomic medicine\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — single lab, cell culture expression with activity and stability assays; supports established genotype-phenotype model\",\n      \"pmids\": [\"32048457\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2017,\n      \"finding\": \"A novel SUMF1 variant (p.E113K) correctly localizes to the ER but is retained intracellularly and exhibits only ~15% of wild-type FGE activity when expressed in immortalized MSD cells; structural modeling predicts destabilization of secondary structure affecting disulfide bridging and the active site groove.\",\n      \"method\": \"Cell culture expression of FGE variant; subcellular localization; FGE activity assay (steroid sulfatase activation); structural analysis based on crystal structure\",\n      \"journal\": \"Molecular genetics and metabolism\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2/3 — single lab, functional activity assay + localization; structural prediction is computational\",\n      \"pmids\": [\"28566233\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2024,\n      \"finding\": \"SUMF1 (FGE) does not contain ER retention sequences and relies on ERp44 engagement for proper inter-compartmental distribution in the early secretory pathway, placing it in the same regulatory class as Ero1α, Ero1β, Prx4, and ERAP1.\",\n      \"method\": \"Biochemical interaction analysis in ER pathway context; comparison to known ERp44 clients\",\n      \"journal\": \"bioRxiv\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 3 — preprint, inference from interaction data; mechanism of ERp44-mediated retention was established earlier in PMID:18508857\",\n      \"pmids\": [],\n      \"is_preprint\": true\n    },\n    {\n      \"year\": 2025,\n      \"finding\": \"In zebrafish, sumf1 (ortholog of human SUMF1) acts as a positive regulator of sulfatase activity; sumf1 and sumf2 expression levels invert at gastrulation onset, predicting a reduction in sulfatase activity. Overexpressing sumf1 delays convergence and extension (C&E) onset, while loss of sumf1 function causes precocious C&E. The effector is Sulf1, an extracellular sulfatase modifying heparan sulfate proteoglycans (HSPGs), and altered HSPG sulfation levels suppress sumf1/sumf2 mutant C&E timing phenotypes.\",\n      \"method\": \"Zebrafish embryonic explants; gain- and loss-of-function experiments (overexpression, morpholino/mutant); genetic epistasis with Sulf1 and HSPG sulfation levels; C&E morphogenesis timing assays\",\n      \"journal\": \"bioRxiv\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — preprint with multiple orthogonal genetic approaches (GOF, LOF, epistasis) in zebrafish ortholog; mechanism is functionally validated\",\n      \"pmids\": [],\n      \"is_preprint\": true\n    }\n  ],\n  \"current_model\": \"SUMF1 encodes the formylglycine-generating enzyme (FGE), a copper-dependent enzyme that resides primarily in the ER where it post-translationally converts a conserved active-site cysteine in all known sulfatases to Cα-formylglycine, an essential catalytic residue; FGE's ER retention and secretion are dynamically regulated by sequential interactions with PDI (which couples retention and activation), ERGIC-53 (which promotes ER export), and ERp44 (which retrieves FGE to the ER), and mutations in SUMF1 that reduce both FGE enzymatic activity and protein stability cause multiple sulfatase deficiency with severity proportional to the degree of residual FGE function.\"\n}\n```","stage2_raw":"```json\n{\n  \"mechanistic_narrative\": \"SUMF1 encodes the formylglycine-generating enzyme (FGE), a copper-dependent oxidase that post-translationally converts a conserved active-site cysteine to Cα-formylglycine in all known sulfatases, a modification essential for sulfate ester hydrolysis [PMID:14563551, PMID:31161504]. FGE resides primarily in the endoplasmic reticulum, where its retention and secretion are dynamically regulated by sequential interactions with PDI (which couples retention and activation), ERGIC-53 (which promotes ER export and prevents proteasomal degradation), and ERp44 (which retrieves secreted FGE back to the ER), enabling both cell-autonomous sulfatase activation and paracrine activity [PMID:18508857]. Co-expression of SUMF1 with individual sulfatases synergistically enhances their enzymatic activity in vitro and in vivo, including correction of glycosaminoglycan storage and neurological deficits in mouse models of mucopolysaccharidosis IIIA [PMID:17206939, PMID:17725987]. Hypomorphic SUMF1 mutations that reduce both FGE catalytic activity and protein stability cause multiple sulfatase deficiency (MSD), with clinical severity proportional to the degree of residual FGE function; complete loss of SUMF1 is predicted to be lethal [PMID:17657823, PMID:21224894].\",\n  \"teleology\": [\n    {\n      \"year\": 2003,\n      \"claim\": \"The gene responsible for generating the catalytic formylglycine residue in all sulfatases was identified as SUMF1, establishing it as the master activator of the entire sulfatase family and defining a new conserved gene family.\",\n      \"evidence\": \"Bioinformatic and biochemical identification of SUMF1/FGE as the Cα-formylglycine-generating enzyme\",\n      \"pmids\": [\"14563551\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Catalytic mechanism (cofactor requirement) not yet determined\", \"Paralog SUMF2 function unknown\", \"Structural basis for substrate recognition unclear\"]\n    },\n    {\n      \"year\": 2004,\n      \"claim\": \"Functional co-expression studies demonstrated that SUMF1 directly enhances sulfatase activity and that MSD-causing missense mutations impair this activity, establishing the genotype-phenotype link for multiple sulfatase deficiency.\",\n      \"evidence\": \"Co-expression of SUMF1 wild-type and mutants with individual sulfatases in COS-7 cells; enzymatic activity assays\",\n      \"pmids\": [\"15146462\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Molecular basis for differential effects of mutations on different sulfatases not resolved\", \"Protein stability vs. catalytic deficiency not yet dissected\"]\n    },\n    {\n      \"year\": 2007,\n      \"claim\": \"In vivo co-delivery of SUMF1 with sulfatases via AAV vectors demonstrated therapeutic synergy, clearing lysosomal storage and improving neurological function in MPS-IIIA mice, while knockout rescue experiments established that all MSD mutations are hypomorphic and complete SUMF1 loss is likely lethal.\",\n      \"evidence\": \"AAV-mediated co-delivery in patient cells and MPS-IIIA mouse brain; Sumf1-/- MEF rescue with MSD mutants; behavioral, histological, and biochemical readouts\",\n      \"pmids\": [\"17206939\", \"17725987\", \"17657823\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Therapeutic window and long-term efficacy of gene therapy not established\", \"Whether any SUMF1 null human exists remains unclear\", \"Contribution of secreted/paracrine FGE to in vivo correction not quantified\"]\n    },\n    {\n      \"year\": 2008,\n      \"claim\": \"The ER trafficking mechanism of FGE was resolved: despite lacking canonical retention signals, FGE is retained in the ER through PDI interaction, exported via ERGIC-53, and retrieved from post-ER compartments by ERp44, with each interaction serving a distinct functional role in balancing intracellular activation versus paracrine secretion.\",\n      \"evidence\": \"Co-immunoprecipitation with PDI, ERGIC-53, and ERp44; siRNA knockdown of each interactor with distinct phenotypic outcomes; subcellular fractionation\",\n      \"pmids\": [\"18508857\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Structural basis of FGE interaction with each ER chaperone not determined\", \"How the balance between retention and secretion is physiologically regulated is unknown\", \"Whether MSD mutations alter these interactions not tested\"]\n    },\n    {\n      \"year\": 2008,\n      \"claim\": \"Dissecting multiple MSD mutations revealed that both reduced specific enzymatic activity and decreased protein stability independently contribute to disease, resolving why mutations with similar catalytic impairment can produce different clinical severities.\",\n      \"evidence\": \"FGE activity assays, protein stability measurements, and ER localization in MSD patient fibroblasts for multiple SUMF1 missense variants\",\n      \"pmids\": [\"18157819\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Structural mechanism by which individual mutations destabilize FGE not resolved at atomic level\", \"Contribution of ER quality control/degradation to residual FGE levels not quantified\"]\n    },\n    {\n      \"year\": 2011,\n      \"claim\": \"A systematic genotype-phenotype correlation across MSD patients established that the combination of residual FGE activity and protein stability quantitatively predicts clinical severity, from neonatal lethal to mild late-onset forms.\",\n      \"evidence\": \"Multi-parameter functional analysis (FGE expression, stability, activity, sulfatase activities) across 10 MSD patients with clinical correlation\",\n      \"pmids\": [\"21224894\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Modifier genes or environmental factors influencing MSD severity not addressed\", \"Whether pharmacological stabilization of mutant FGE could be therapeutic remains untested\"]\n    },\n    {\n      \"year\": 2019,\n      \"claim\": \"The copper dependence of FGE catalysis was established, placing it in the class of copper-dependent oxidases and enabling biotechnological exploitation of formylglycine as a site-specific bioconjugation handle.\",\n      \"evidence\": \"In vitro and mammalian cell reconstitution of copper-dependent FGE-mediated fGly generation; application to antibody-drug conjugate production\",\n      \"pmids\": [\"31161504\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"How copper loading of FGE is regulated in the ER lumen not determined\", \"Whether copper availability limits FGE activity in disease states unknown\"]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"Key unresolved questions include the structural basis for FGE substrate recognition across diverse sulfatases, how copper is delivered to FGE in the ER, the function of paralog SUMF2 and its relationship to SUMF1, and whether pharmacological stabilization of mutant FGE proteins is a viable therapeutic strategy for MSD.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"Low\",\n      \"gaps\": [\"No high-resolution structure of FGE-sulfatase complex\", \"SUMF2 function remains uncharacterized\", \"No pharmacological chaperone strategy validated for MSD\"]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0140096\", \"supporting_discovery_ids\": [0, 1, 8]},\n      {\"term_id\": \"GO:0016491\", \"supporting_discovery_ids\": [0, 8]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005783\", \"supporting_discovery_ids\": [4, 5, 6, 10]},\n      {\"term_id\": \"GO:0005576\", \"supporting_discovery_ids\": [5]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"R-HSA-392499\", \"supporting_discovery_ids\": [0, 1, 8]},\n      {\"term_id\": \"R-HSA-9609507\", \"supporting_discovery_ids\": [5]}\n    ],\n    \"complexes\": [],\n    \"partners\": [\"PDI\", \"ERGIC-53\", \"ERp44\"],\n    \"other_free_text\": []\n  }\n}\n```"}