{"gene":"SUMF1","run_date":"2026-06-10T10:51:54","timeline":{"discoveries":[{"year":2003,"finding":"SUMF1 encodes the Cα-formylglycine (FGly)-generating enzyme (FGE) that post-translationally converts a conserved cysteine residue to FGly in the active sites of all sulfatases, and this modification is essential for sulfatase catalytic activity. SUMF1 defines a new gene family conserved from prokaryotes to eukaryotes, with vertebrate genomes also containing a paralog SUMF2.","method":"Gene family bioinformatic analysis combined with established biochemistry of FGly modification; functional context from identification of the MSD disease gene","journal":"Gene","confidence":"High","confidence_rationale":"Tier 1 / Strong — foundational identification of the enzymatic function, independently replicated across multiple labs in subsequent papers","pmids":["14563551"],"is_preprint":false},{"year":2004,"finding":"SUMF1 strongly enhances the activity of sulfatases when co-expressed in cells. Eleven missense mutations (p.R349W, p.R224W, p.L20F, p.A348P, p.S155P, p.C218Y, p.N259I, p.A279V, p.R349Q, p.C336R, p.A177P) resulted in severely impaired sulfatase-enhancing activity, while two mutations (p.R345C, p.P266L) showed high residual activity on some but not all sulfatases tested.","method":"Expression of SUMF1 missense mutants in COS-7 cells with functional sulfatase activity assays","journal":"Human mutation","confidence":"High","confidence_rationale":"Tier 2 / Strong — functional cell-based assays across 11 mutations in 20 patients, replicated in subsequent studies","pmids":["15146462"],"is_preprint":false},{"year":2007,"finding":"SUMF1 exhibits an enhancing effect on sulfatase activity when co-expressed with sulfatases in multiple disease cell types (MLD, CDPX, MPS II, IIIA, and VI). In vivo, AAV-mediated co-delivery of SUMF1 with sulfamidase in muscle of MPS-IIIA mice resulted in more efficient rescue of the phenotype compared to sulfamidase alone.","method":"AAV and lentiviral vector co-delivery in patient-derived cells and mouse models; sulfatase activity assays; GAG/sulfolipid clearance measurements","journal":"The Biochemical journal","confidence":"High","confidence_rationale":"Tier 2 / Strong — multiple orthogonal methods (viral delivery, enzymatic assays, in vivo mouse rescue) across five different sulfatase deficiency diseases","pmids":["17206939"],"is_preprint":false},{"year":2007,"finding":"Co-delivery of SUMF1 and SGSH via AAV2/5 to the CNS results in a synergistic increase in SGSH (sulfamidase) activity in primary neural cells and murine brain, demonstrating SUMF1's enhancing function on sulfatase activity in the central nervous system.","method":"AAV2/5-CMV-SGSH-IRES-SUMF1 vector delivery in primary neural cells and intraventricular injection in neonatal MPS-IIIA mice; SGSH activity assays; lysosomal storage and inflammatory marker analysis","journal":"Human molecular genetics","confidence":"High","confidence_rationale":"Tier 2 / Strong — in vivo and in vitro experiments with functional readouts, replicated across multiple brain regions","pmids":["17725987"],"is_preprint":false},{"year":2007,"finding":"MSD is caused by hypomorphic SUMF1 mutations: four FGE mutants (p.S155P, p.R224W, p.R345C, p.R349W) expressed in Sumf1 KO mouse embryonic fibroblasts produced stable proteins of appropriate molecular weight that localized correctly to the ER, and each partially rescued sulfatase activities, indicating residual enzymatic function.","method":"Viral-mediated over-expression of SUMF1 mutants in Sumf1-/- MEFs; Western blot for protein stability; ER localization; sulfatase activity assays","journal":"Human mutation","confidence":"High","confidence_rationale":"Tier 2 / Strong — clean KO rescue experiment with four mutants, multiple orthogonal methods (localization, stability, activity)","pmids":["17657823"],"is_preprint":false},{"year":2008,"finding":"SUMF1 is largely retained in the endoplasmic reticulum (ER) despite lacking canonical retention/retrieval signals, where it activates nascent sulfatases. SUMF1 physically interacts with PDI, ERp44, and ERGIC-53. PDI couples SUMF1 retention and activation in the ER; ERGIC-53 favors SUMF1 export from the ER (silencing ERGIC-53 causes proteasomal degradation of SUMF1); ERp44 retrieves SUMF1 to the ER (down-regulating ERp44 promotes SUMF1 secretion). Part of SUMF1 is secreted and paracrinally taken up by distant cells.","method":"Co-immunoprecipitation; RNAi silencing; functional secretion assays; subcellular fractionation and localization; proteasome inhibition experiments","journal":"Human molecular genetics","confidence":"High","confidence_rationale":"Tier 2 / Strong — reciprocal Co-IP, multiple RNAi knockdowns with defined functional consequences, multiple orthogonal methods in one study","pmids":["18508857"],"is_preprint":false},{"year":2008,"finding":"SUMF1 missense mutations p.A177P, p.W179S, p.A279V, and p.R349W do not affect ER localization of FGE in MSD fibroblasts but differentially affect enzymatic activity (p.A177P and p.R349W: <1%; p.W179S: ~3%; p.A279V: ~23%) and protein stability. Both residual enzyme activity and protein stability of FGE mutants contribute to MSD clinical phenotype.","method":"Subcellular localization studies in MSD patient fibroblasts; FGE-specific enzymatic activity assays; Western blot for protein stability assessment","journal":"Human mutation","confidence":"High","confidence_rationale":"Tier 2 / Moderate — direct functional characterization of four mutants with enzymatic assays, localization, and stability in patient cells, single lab with multiple orthogonal methods","pmids":["18157819"],"is_preprint":false},{"year":2011,"finding":"Both residual FGE enzymatic activity and FGE protein stability are required for MSD phenotypic outcome. Nonsense mutations causing near-total loss of FGE activity and highly unstable protein produce severe neonatal phenotype; missense mutation G263V causes unstable protein but high residual FGE activity and a mild phenotype.","method":"FGE expression analysis, localization, stability (Western blot), and enzymatic activity assays in patient-derived cells from 10 MSD patients","journal":"European journal of human genetics : EJHG","confidence":"High","confidence_rationale":"Tier 2 / Strong — systematic characterization of 10 patients with multiple orthogonal methods, replicated across multiple studies","pmids":["21224894"],"is_preprint":false},{"year":2019,"finding":"FGE (SUMF1-encoded protein) is a copper-dependent post-translational modifier that generates formylglycine (fGly) in the active sites of sulfatases. The fGly aldehyde can serve as a reactive handle for chemical bioconjugation when a sulfatase-consensus sequence is engineered into target proteins.","method":"Site-specific labeling assay using FGE in mammalian expression systems; antibody-drug conjugate production","journal":"Methods in molecular biology (Clifton, N.J.)","confidence":"Medium","confidence_rationale":"Tier 2 / Weak — confirms copper-dependency and aldehyde chemistry of FGE in a methods paper, single lab application","pmids":["31161504"],"is_preprint":false},{"year":2017,"finding":"Novel SUMF1 variant FGE-E113K correctly localizes to the ER but is retained intracellularly (unlike wild-type FGE) and exhibits only ~15% of wild-type FGE activity in cell culture assays activating steroid sulfatase.","method":"Cell culture expression of FGE-E113K; ER localization by immunofluorescence; FGE activity assay on steroid sulfatase in immortalized MSD cells; crystal structure-based structural modeling","journal":"Molecular genetics and metabolism","confidence":"Medium","confidence_rationale":"Tier 2 / Weak — direct functional and localization assays in cell model, single lab with two orthogonal methods","pmids":["28566233"],"is_preprint":false},{"year":2020,"finding":"Novel SUMF1 variant A348V produces a formylglycine-generating enzyme (FGE) that is highly unstable and lacks catalytic function, resulting in the most severe (neonatal) MSD phenotype.","method":"Expression of SUMF1-A348V in cell culture; protein stability and catalytic activity assays","journal":"Molecular genetics & genomic medicine","confidence":"Medium","confidence_rationale":"Tier 2 / Weak — functional characterization in cell model, single lab","pmids":["32048457"],"is_preprint":false},{"year":2022,"finding":"Knock-in mice carrying patient-derived hypomorphic SUMF1 variants p.Ser153Pro and p.Ala277Val show partial reduction of sulfatase activities, confirming these variants produce residual but reduced FGE function in vivo.","method":"Genetic engineering of knock-in mouse lines; biochemical measurement of sulfatase activities; histopathological analysis","journal":"Journal of inherited metabolic disease","confidence":"Medium","confidence_rationale":"Tier 2 / Moderate — in vivo knock-in model with biochemical validation of residual activity, direct functional consequence demonstrated","pmids":["36433920"],"is_preprint":false},{"year":2025,"finding":"In zebrafish, sumf1 and sumf2 encode positive and negative regulators of sulfatase activity respectively; their expression levels invert at gastrulation onset. Overexpressing sumf1 delays convergence and extension (C&E) onset, while loss of sumf1 function causes precocious C&E. The extracellular sulfatase Sulf1 (which modifies heparan sulfate proteoglycans) is identified as the key effector downstream of sumf1/sumf2 in timing C&E morphogenesis.","method":"Zebrafish embryonic explants; mRNA overexpression; morpholino/genetic loss-of-function; C&E timing assays; HSPG sulfation manipulation","journal":"bioRxiv","confidence":"Medium","confidence_rationale":"Tier 2 / Moderate — multiple orthogonal genetic approaches in zebrafish with defined morphogenetic readout, but preprint not yet peer-reviewed","pmids":["bio_10.1101_2025.10.09.681375"],"is_preprint":true}],"current_model":"SUMF1 encodes the formylglycine-generating enzyme (FGE), a copper-dependent post-translational modifier that resides in the endoplasmic reticulum (ER) and converts a conserved cysteine to formylglycine in the catalytic sites of all cellular sulfatases; FGE activity is regulated by sequential interactions with PDI (which couples ER retention to activation), ERGIC-53 (which facilitates ER export), and ERp44 (which retrieves secreted FGE back to the ER), while the degree of residual FGE enzymatic activity combined with mutant protein stability determines the severity of multiple sulfatase deficiency."},"narrative":{"mechanistic_narrative":"SUMF1 encodes the formylglycine-generating enzyme (FGE), a post-translational modifier that converts a conserved active-site cysteine to Cα-formylglycine (FGly) in all cellular sulfatases, a modification essential for sulfatase catalytic activity, and which defines a gene family conserved from prokaryotes to eukaryotes [PMID:14563551]. Co-expression of SUMF1 strongly enhances sulfatase activity across diverse sulfatase-deficiency contexts, and co-delivery of SUMF1 with individual sulfatases synergistically boosts enzyme activity and corrects disease phenotypes in cell and mouse models, including in the CNS [PMID:15146462, PMID:17206939, PMID:17725987]. FGE acts as a copper-dependent enzyme that installs an FGly aldehyde at a sulfatase consensus motif [PMID:31161504]. Although it lacks canonical ER retention signals, FGE is largely retained in the ER where it activates nascent sulfatases, with localization and trafficking controlled by sequential physical interactions: PDI couples ER retention to activation, ERGIC-53 promotes ER export (its loss triggers proteasomal degradation of FGE), and ERp44 retrieves secreted FGE to the ER; a fraction of FGE is secreted and taken up paracrinally by distant cells [PMID:18508857]. Loss of FGE function causes multiple sulfatase deficiency (MSD): hypomorphic SUMF1 mutations produce ER-localized FGE with variable residual activity, and the combination of residual enzymatic activity and mutant protein stability determines clinical severity, ranging from severe neonatal to milder forms [PMID:17657823, PMID:18157819, PMID:21224894]. Beyond its conserved enzymatic role, sumf1 acts as a developmental timer of convergence-extension morphogenesis in zebrafish through the extracellular heparan-sulfate sulfatase Sulf1 [PMID:bio_10.1101_2025.10.09.681375].","teleology":[{"year":2003,"claim":"Established the molecular identity and enzymatic function of SUMF1, answering what gene product is responsible for the universal FGly modification required for sulfatase activity.","evidence":"Gene family bioinformatic analysis combined with biochemistry of FGly modification and MSD disease gene identification","pmids":["14563551"],"confidence":"High","gaps":["Catalytic mechanism and cofactor requirement not yet defined","Subcellular site of modification not yet established"]},{"year":2004,"claim":"Demonstrated that SUMF1 enhances sulfatase activity in cells and that disease-associated missense mutations differentially impair this enhancing function, linking specific residues to enzyme competence.","evidence":"Expression of 11 SUMF1 missense mutants in COS-7 cells with sulfatase activity assays across 20 patients","pmids":["15146462"],"confidence":"High","gaps":["Did not distinguish loss of stability from loss of catalysis","No structural basis for mutation effects"]},{"year":2007,"claim":"Showed that SUMF1 co-delivery synergistically enhances multiple distinct sulfatases in patient cells and rescues lysosomal storage phenotypes in vivo, including the CNS, establishing therapeutic relevance of co-expression.","evidence":"AAV/lentiviral co-delivery of SUMF1 with sulfatases in patient cells and MPS-IIIA mouse muscle and brain; GAG/sulfolipid clearance and activity assays","pmids":["17206939","17725987"],"confidence":"High","gaps":["Mechanism of synergy at the molecular level not dissected","Optimal stoichiometry of FGE to sulfatase undefined"]},{"year":2007,"claim":"Defined MSD as a hypomorphic disorder by showing disease mutants retain correct ER localization and partial activity, reframing severity as a function of residual function rather than complete loss.","evidence":"Viral overexpression of four SUMF1 mutants in Sumf1-/- MEFs with stability, localization, and rescue assays","pmids":["17657823"],"confidence":"High","gaps":["Did not separate contributions of stability versus intrinsic catalytic defect for all mutants"]},{"year":2008,"claim":"Resolved how FGE is retained in the ER despite lacking canonical signals, identifying a tripartite interactome (PDI, ERGIC-53, ERp44) that couples retention, export, retrieval, and paracrine secretion.","evidence":"Reciprocal Co-IP, RNAi silencing, secretion and proteasome inhibition assays, subcellular fractionation","pmids":["18508857"],"confidence":"High","gaps":["Functional consequence of paracrine FGE uptake for sulfatase activation in recipient cells not quantified","Sequence determinants of each interaction not mapped"]},{"year":2008,"claim":"Established that MSD clinical phenotype reflects both residual enzymatic activity and protein stability, by showing localization-competent mutants vary independently in activity and stability.","evidence":"Localization, FGE-specific activity, and stability assays of four mutants in MSD patient fibroblasts","pmids":["18157819","21224894"],"confidence":"High","gaps":["Quantitative genotype-phenotype prediction model not derived"]},{"year":2019,"claim":"Confirmed FGE is a copper-dependent enzyme and that the FGly aldehyde it generates can be exploited as a bioconjugation handle, extending mechanistic understanding to cofactor dependence and reactive chemistry.","evidence":"Site-specific labeling assay in mammalian expression systems for antibody-drug conjugate production","pmids":["31161504"],"confidence":"Medium","gaps":["Role of copper in cellular FGE activation not addressed in vivo","Methods-focused, not a mechanistic structural study"]},{"year":2020,"claim":"Extended the activity-stability model with additional variants, including a highly unstable, catalytically dead mutant producing the most severe neonatal phenotype and a partly-retained variant with reduced activity.","evidence":"Cell-culture expression, localization, stability, and activity assays of FGE-E113K and A348V variants","pmids":["28566233","32048457"],"confidence":"Medium","gaps":["Single-lab variant characterizations without in vivo confirmation","Structural rationale only partially modeled"]},{"year":2022,"claim":"Validated the residual-function model in vivo by generating patient-variant knock-in mice that show partial reduction of sulfatase activities, confirming hypomorphic behavior of specific alleles.","evidence":"Knock-in mouse lines with biochemical sulfatase activity measurement and histopathology","pmids":["36433920"],"confidence":"Medium","gaps":["Tissue-specific differences in residual activity not fully mapped","Genotype-phenotype correlation across alleles incomplete"]},{"year":2025,"claim":"Revealed a developmental role beyond housekeeping sulfatase activation, with sumf1/sumf2 acting as opposing temporal regulators of convergence-extension morphogenesis via the extracellular sulfatase Sulf1.","evidence":"Zebrafish explants, mRNA overexpression, loss-of-function, and HSPG sulfation manipulation (preprint)","pmids":["bio_10.1101_2025.10.09.681375"],"confidence":"Medium","gaps":["Preprint not yet peer-reviewed","Whether this developmental timing role is conserved in mammals unknown","Molecular basis of sumf1/sumf2 expression inversion not defined"]},{"year":null,"claim":"The structural and catalytic determinants linking specific copper-coordination and FGE-substrate recognition to graded residual activity in MSD remain incompletely defined.","evidence":"","pmids":[],"confidence":"Medium","gaps":["No high-resolution mechanism connecting cofactor handling to mutant activity loss","Predictive activity-stability-to-severity framework not established"]}],"mechanism_profile":{"molecular_activity":[{"term_id":"GO:0140096","term_label":"catalytic activity, acting on a protein","supporting_discovery_ids":[0,1,8]},{"term_id":"GO:0016491","term_label":"oxidoreductase activity","supporting_discovery_ids":[8]}],"localization":[{"term_id":"GO:0005783","term_label":"endoplasmic reticulum","supporting_discovery_ids":[4,5,6]}],"pathway":[{"term_id":"R-HSA-392499","term_label":"Metabolism of proteins","supporting_discovery_ids":[0,5]},{"term_id":"R-HSA-1643685","term_label":"Disease","supporting_discovery_ids":[4,7]}],"complexes":[],"partners":["PDI","ERGIC-53","ERP44"],"other_free_text":[]}},"prefetch_data":{"uniprot":{"accession":"Q8NBK3","full_name":"Formylglycine-generating enzyme","aliases":["C-alpha-formylglycine-generating enzyme 1","Sulfatase-modifying factor 1"],"length_aa":374,"mass_kda":40.6,"function":"Oxidase that catalyzes the conversion of cysteine to 3-oxoalanine on target proteins, using molecular oxygen and an unidentified reducing agent (PubMed:12757706, PubMed:15657036, PubMed:15907468, PubMed:16368756, PubMed:21224894, PubMed:25931126). 3-oxoalanine modification, which is also named formylglycine (fGly), occurs in the maturation of arylsulfatases and some alkaline phosphatases that use the hydrated form of 3-oxoalanine as a catalytic nucleophile (PubMed:12757706, PubMed:15657036, PubMed:15907468, PubMed:16368756, PubMed:25931126). Known substrates include GALNS, ARSA, STS and ARSE (PubMed:12757706, PubMed:15657036, PubMed:15907468)","subcellular_location":"Endoplasmic reticulum lumen","url":"https://www.uniprot.org/uniprotkb/Q8NBK3/entry"},"depmap":{"release":"DepMap","has_data":true,"is_common_essential":false,"resolved_as":"","url":"https://depmap.org/portal/gene/SUMF1","classification":"Not Classified","n_dependent_lines":0,"n_total_lines":1208,"dependency_fraction":0.0},"opencell":{"profiled":false,"resolved_as":"","ensg_id":"","cell_line_id":"","localizations":[],"interactors":[],"url":"https://opencell.sf.czbiohub.org/search/SUMF1","total_profiled":1310},"omim":[{"mim_id":"607940","title":"SULFATASE-MODIFYING FACTOR 2; SUMF2","url":"https://www.omim.org/entry/607940"},{"mim_id":"607939","title":"SULFATASE-MODIFYING FACTOR 1; SUMF1","url":"https://www.omim.org/entry/607939"},{"mim_id":"607280","title":"CONTACTIN 4; CNTN4","url":"https://www.omim.org/entry/607280"},{"mim_id":"606658","title":"SPINOCEREBELLAR ATAXIA 15; SCA15","url":"https://www.omim.org/entry/606658"},{"mim_id":"272200","title":"MULTIPLE SULFATASE DEFICIENCY; MSD","url":"https://www.omim.org/entry/272200"}],"hpa":{"profiled":true,"resolved_as":"","reliability":"","locations":[],"tissue_specificity":"Low tissue specificity","tissue_distribution":"Detected in all","driving_tissues":[],"url":"https://www.proteinatlas.org/search/SUMF1"},"hgnc":{"alias_symbol":["FGE","UNQ3037"],"prev_symbol":[]},"alphafold":{"accession":"Q8NBK3","domains":[{"cath_id":"3.90.1580.10","chopping":"90-147_208-368","consensus_level":"high","plddt":98.2349,"start":90,"end":368}],"viewer_url":"https://alphafold.ebi.ac.uk/entry/Q8NBK3","model_url":"https://alphafold.ebi.ac.uk/files/AF-Q8NBK3-F1-model_v6.cif","pae_url":"https://alphafold.ebi.ac.uk/files/AF-Q8NBK3-F1-predicted_aligned_error_v6.png","plddt_mean":83.56},"mouse_models":{"mgi_url":"https://www.informatics.jax.org/marker/summary?nomen=SUMF1","jax_strain_url":"https://www.jax.org/strain/search?query=SUMF1"},"sequence":{"accession":"Q8NBK3","fasta_url":"https://rest.uniprot.org/uniprotkb/Q8NBK3.fasta","uniprot_url":"https://www.uniprot.org/uniprotkb/Q8NBK3/entry","alphafold_viewer_url":"https://alphafold.ebi.ac.uk/entry/Q8NBK3"}},"corpus_meta":[{"pmid":"24524415","id":"PMC_24524415","title":"Intracerebral administration of adeno-associated viral vector serotype rh.10 carrying human SGSH and SUMF1 cDNAs in children with mucopolysaccharidosis type IIIA disease: results of a phase I/II trial.","date":"2014","source":"Human gene therapy","url":"https://pubmed.ncbi.nlm.nih.gov/24524415","citation_count":208,"is_preprint":false},{"pmid":"17725987","id":"PMC_17725987","title":"Functional correction of CNS lesions in an MPS-IIIA mouse model by intracerebral AAV-mediated delivery of sulfamidase and SUMF1 genes.","date":"2007","source":"Human molecular genetics","url":"https://pubmed.ncbi.nlm.nih.gov/17725987","citation_count":104,"is_preprint":false},{"pmid":"17206939","id":"PMC_17206939","title":"SUMF1 enhances sulfatase activities in vivo in five sulfatase deficiencies.","date":"2007","source":"The Biochemical journal","url":"https://pubmed.ncbi.nlm.nih.gov/17206939","citation_count":66,"is_preprint":false},{"pmid":"14563551","id":"PMC_14563551","title":"The human SUMF1 gene, required for posttranslational sulfatase modification, defines a new gene family which is conserved from pro- to eukaryotes.","date":"2003","source":"Gene","url":"https://pubmed.ncbi.nlm.nih.gov/14563551","citation_count":59,"is_preprint":false},{"pmid":"18508857","id":"PMC_18508857","title":"Multistep, sequential control of the trafficking and function of the multiple sulfatase deficiency gene product, SUMF1 by PDI, ERGIC-53 and ERp44.","date":"2008","source":"Human molecular genetics","url":"https://pubmed.ncbi.nlm.nih.gov/18508857","citation_count":59,"is_preprint":false},{"pmid":"15146462","id":"PMC_15146462","title":"Molecular and functional analysis of SUMF1 mutations in multiple sulfatase deficiency.","date":"2004","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/15146462","citation_count":59,"is_preprint":false},{"pmid":"21224894","id":"PMC_21224894","title":"SUMF1 mutations affecting stability and activity of formylglycine generating enzyme predict clinical outcome in multiple sulfatase deficiency.","date":"2011","source":"European journal of human genetics : EJHG","url":"https://pubmed.ncbi.nlm.nih.gov/21224894","citation_count":52,"is_preprint":false},{"pmid":"17657823","id":"PMC_17657823","title":"Multiple sulfatase deficiency is due to hypomorphic mutations of the SUMF1 gene.","date":"2007","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/17657823","citation_count":35,"is_preprint":false},{"pmid":"18157819","id":"PMC_18157819","title":"Molecular analysis of SUMF1 mutations: stability and residual activity of mutant formylglycine-generating enzyme determine disease severity in multiple sulfatase deficiency.","date":"2008","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/18157819","citation_count":35,"is_preprint":false},{"pmid":"25885655","id":"PMC_25885655","title":"Natural disease history and characterisation of SUMF1 molecular defects in ten unrelated patients with multiple sulfatase deficiency.","date":"2015","source":"Orphanet journal of rare diseases","url":"https://pubmed.ncbi.nlm.nih.gov/25885655","citation_count":29,"is_preprint":false},{"pmid":"18989752","id":"PMC_18989752","title":"Effect of elongation factor 1alpha promoter and SUMF1 over in vitro expression of N-acetylgalactosamine-6-sulfate sulfatase.","date":"2008","source":"Molecular biology reports","url":"https://pubmed.ncbi.nlm.nih.gov/18989752","citation_count":26,"is_preprint":false},{"pmid":"28464818","id":"PMC_28464818","title":"Sulfatase modifying factor 1 (SUMF1) is associated with Chronic Obstructive Pulmonary Disease.","date":"2017","source":"Respiratory research","url":"https://pubmed.ncbi.nlm.nih.gov/28464818","citation_count":10,"is_preprint":false},{"pmid":"32625708","id":"PMC_32625708","title":"Scientific Opinion on Flavouring Group Evaluation 201 Revision 2 (FGE.201Rev2): 2-alkylated, aliphatic, acyclic alpha,beta-unsaturated aldehydes and precursors, with or without additional double-bonds, from chemical subgroup 1.1.2 of FGE.19.","date":"2018","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625708","citation_count":9,"is_preprint":false},{"pmid":"32626305","id":"PMC_32626305","title":"Scientific Opinion on Flavouring Group Evaluation 210 Revision 3 (FGE.210Rev3): Consideration of genotoxic potential for α,β-unsaturated alicyclic ketones and precursors from chemical subgroup 2.4 of FGE.19.","date":"2019","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32626305","citation_count":9,"is_preprint":false},{"pmid":"28566233","id":"PMC_28566233","title":"Expanding the genetic cause of multiple sulfatase deficiency: A novel SUMF1 variant in a patient displaying a severe late infantile form of the disease.","date":"2017","source":"Molecular genetics and metabolism","url":"https://pubmed.ncbi.nlm.nih.gov/28566233","citation_count":9,"is_preprint":false},{"pmid":"30598261","id":"PMC_30598261","title":"Integrative expression analysis identifies a novel interplay between CFTR and linc-SUMF1-2 that involves CF-associated gene dysregulation.","date":"2018","source":"Biochemical and biophysical research communications","url":"https://pubmed.ncbi.nlm.nih.gov/30598261","citation_count":8,"is_preprint":false},{"pmid":"32625963","id":"PMC_32625963","title":"Scientific Opinion on Flavouring Group Evaluation 203, Revision 2 (FGE.203Rev2): α,β-unsaturated aliphatic aldehydes and precursors from chemical subgroup 1.1.4 of FGE.19 with two or more conjugated double-bonds and with or without additional non-conjugated double-bonds.","date":"2018","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625963","citation_count":8,"is_preprint":false},{"pmid":"32625501","id":"PMC_32625501","title":"Scientific Opinion on Flavouring Group Evaluation 226 Revision 1 (FGE.226Rev1): consideration of genotoxicity data on one α,β-unsaturated aldehyde from chemical subgroup 1.1.1(b) of FGE.19.","date":"2017","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625501","citation_count":7,"is_preprint":false},{"pmid":"36433920","id":"PMC_36433920","title":"New mouse models with hypomorphic SUMF1 variants mimic attenuated forms of multiple sulfatase deficiency.","date":"2022","source":"Journal of inherited metabolic disease","url":"https://pubmed.ncbi.nlm.nih.gov/36433920","citation_count":6,"is_preprint":false},{"pmid":"36794062","id":"PMC_36794062","title":"Flavouring Group Evaluation 21 Revision 6 (FGE.21Rev6): thiazoles, thiophenes, thiazoline and thienyl derivatives from chemical groups 29 and 30.","date":"2023","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/36794062","citation_count":6,"is_preprint":false},{"pmid":"32645339","id":"PMC_32645339","title":"Detection of 2,4-dichlorophenoxyacetic acid herbicide using a FGE-sulfatase based whole-cell Agrobacterium biosensor.","date":"2020","source":"Journal of microbiological methods","url":"https://pubmed.ncbi.nlm.nih.gov/32645339","citation_count":6,"is_preprint":false},{"pmid":"32625330","id":"PMC_32625330","title":"Scientific Opinion of Flavouring Group Evaluation 410 (FGE.410): 4',5,7-trihydroxyflavanone from chemical group 25 (phenol derivatives containing ring-alkyl, ring-alkoxy, and side-chains with an oxygenated functional group).","date":"2017","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32625330","citation_count":6,"is_preprint":false},{"pmid":"25222778","id":"PMC_25222778","title":"Molecular evaluation of a novel missense mutation & an insertional truncating mutation in SUMF1 gene.","date":"2014","source":"The Indian journal of medical research","url":"https://pubmed.ncbi.nlm.nih.gov/25222778","citation_count":5,"is_preprint":false},{"pmid":"32626370","id":"PMC_32626370","title":"Scientific Opinion on Flavouring Group Evaluation 204 Revision 1 (FGE.204Rev1): consideration of genotoxicity data on representatives for 17 monounsaturated, aliphatic, α,β-unsaturated ketones and precursors from chemical subgroup 1.2.1 of FGE.19.","date":"2019","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32626370","citation_count":5,"is_preprint":false},{"pmid":"32626109","id":"PMC_32626109","title":"Scientific Opinion on Flavouring Group Evaluation 208 Revision 3 (FGE.208Rev3): consideration of genotoxicity data on alicyclic aldehydes with α,β-unsaturation in ring/side-chain and precursors from chemical subgroup 2.2 of FGE.19.","date":"2019","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/32626109","citation_count":5,"is_preprint":false},{"pmid":"36959582","id":"PMC_36959582","title":"Late infantile form of multiple sulfatase deficiency with a novel missense variant in the SUMF1 gene: case report and review.","date":"2023","source":"BMC pediatrics","url":"https://pubmed.ncbi.nlm.nih.gov/36959582","citation_count":4,"is_preprint":false},{"pmid":"31161504","id":"PMC_31161504","title":"Site-Specific Labeling of Proteins Using the Formylglycine-Generating Enzyme (FGE).","date":"2019","source":"Methods in molecular biology (Clifton, N.J.)","url":"https://pubmed.ncbi.nlm.nih.gov/31161504","citation_count":4,"is_preprint":false},{"pmid":"38711805","id":"PMC_38711805","title":"Flavouring group evaluation 419 (FGE.419): 2-methyl-1-(2-(5-(p-tolyl)-1H-imidazol-2-yl)piperidin-1-yl)butan-1-one.","date":"2024","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/38711805","citation_count":3,"is_preprint":false},{"pmid":"32048457","id":"PMC_32048457","title":"A homozygous missense variant of SUMF1 in the Bedouin population extends the clinical spectrum in ultrarare neonatal multiple sulfatase deficiency.","date":"2020","source":"Molecular genetics & genomic medicine","url":"https://pubmed.ncbi.nlm.nih.gov/32048457","citation_count":3,"is_preprint":false},{"pmid":"37089172","id":"PMC_37089172","title":"Flavouring Group Evaluation 217 Revision 3 (FGE.217Rev3): consideration of genotoxic potential for α,β-unsaturated ketones and precursors from chemical subgroup 4.1 of FGE.19: lactones.","date":"2023","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/37089172","citation_count":3,"is_preprint":false},{"pmid":"35991962","id":"PMC_35991962","title":"Scientific opinion on Flavouring group evaluation 216 revision 2 (FGE.216Rev2): consideration of the genotoxicity potential of α,β-unsaturated 2-phenyl-2-alkenals from subgroup 3.3 of FGE.19.","date":"2022","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/35991962","citation_count":3,"is_preprint":false},{"pmid":"38863195","id":"PMC_38863195","title":"Non-syndromic retinal dystrophy associated with biallelic variation of SUMF1 and reduced leukocyte sulfatase activity.","date":"2024","source":"Clinical genetics","url":"https://pubmed.ncbi.nlm.nih.gov/38863195","citation_count":2,"is_preprint":false},{"pmid":"37344788","id":"PMC_37344788","title":"Association between SUMF1 polymorphisms and COVID-19 severity.","date":"2023","source":"BMC genomic data","url":"https://pubmed.ncbi.nlm.nih.gov/37344788","citation_count":1,"is_preprint":false},{"pmid":"38460946","id":"PMC_38460946","title":"SUMF1 overexpression promotes tumorous cell growth and migration and is correlated with the immune status of patients with glioma.","date":"2024","source":"Aging","url":"https://pubmed.ncbi.nlm.nih.gov/38460946","citation_count":1,"is_preprint":false},{"pmid":"36441600","id":"PMC_36441600","title":"Genetic analysis of a novel SUMF1 variation associated with a late infantile form of multiple sulfatase deficiency.","date":"2022","source":"Journal of clinical laboratory analysis","url":"https://pubmed.ncbi.nlm.nih.gov/36441600","citation_count":1,"is_preprint":false},{"pmid":"40772760","id":"PMC_40772760","title":"Structure of the T9SS PorKN ring complex reveals conformational plasticity based on the repurposed FGE fold.","date":"2025","source":"mBio","url":"https://pubmed.ncbi.nlm.nih.gov/40772760","citation_count":1,"is_preprint":false},{"pmid":"39870870","id":"PMC_39870870","title":"Preclinical use of a clinically-relevant scAAV9/SUMF1 vector for the treatment of multiple sulfatase deficiency.","date":"2025","source":"Communications medicine","url":"https://pubmed.ncbi.nlm.nih.gov/39870870","citation_count":1,"is_preprint":false},{"pmid":"35814922","id":"PMC_35814922","title":"Scientific opinion on flavouring group evaluation 415 (FGE.415): (E)-3-benzo[1,3]dioxol-5-yl-N,N-diphenyl-2-propenamide.","date":"2022","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/35814922","citation_count":1,"is_preprint":false},{"pmid":"41155512","id":"PMC_41155512","title":"SUMF1 Common Variant rs793391 Is Associated with Response to Inhaled Corticosteroids in Patients with COPD.","date":"2025","source":"International journal of molecular sciences","url":"https://pubmed.ncbi.nlm.nih.gov/41155512","citation_count":0,"is_preprint":false},{"pmid":"33795796","id":"PMC_33795796","title":"Family-based gene-environment interaction using sequence kernel association test (FGE-SKAT) for complex quantitative traits.","date":"2021","source":"Scientific reports","url":"https://pubmed.ncbi.nlm.nih.gov/33795796","citation_count":0,"is_preprint":false},{"pmid":"39896350","id":"PMC_39896350","title":"Flavouring group evaluation 418 (FGE. 418): 3-[3-(2-isopropyl-5-methyl-cyclohexyl)-ureido]-butyric acid ethyl ester.","date":"2025","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/39896350","citation_count":0,"is_preprint":false},{"pmid":"42109808","id":"PMC_42109808","title":"Scientific Opinion on Flavouring Group Evaluation 208 Revision 1 (FGE.208Rev1): Consideration of genotoxicity data on representatives for 10 alicyclic aldehydes with the α,β-unsaturation in ring / side-chain and precursors from chemical subgroup 2.2 of FGE.19.","date":"2015","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/42109808","citation_count":0,"is_preprint":false},{"pmid":"42016130","id":"PMC_42016130","title":"Scientific Opinion on Flavouring Group Evaluation 226 (FGE.226): Consideration of genotoxicity data on one α,β-unsaturated aldehyde from chemical subgroup 1.1.1(b) of FGE.19 by EFSA.","date":"2012","source":"EFSA journal. European Food Safety Authority","url":"https://pubmed.ncbi.nlm.nih.gov/42016130","citation_count":0,"is_preprint":false},{"pmid":null,"id":"bio_10.1101_2025.10.09.681375","title":"Sulfatase modifying factors control the timing of zebrafish convergence and extension morphogenesis","date":"2025-10-10","source":"bioRxiv","url":"https://doi.org/10.1101/2025.10.09.681375","citation_count":0,"is_preprint":true}],"cost":{"stage1":{"model":"claude-sonnet-4-6","input_tokens":26757,"output_tokens":3320,"usd":0.065035,"stage1_stop_reason":"end_turn"},"stage2":{"model":"claude-opus-4-8","input_tokens":10804,"output_tokens":3292,"usd":0.06816,"stage2_stop_reason":"end_turn"},"total_usd":0.133195,"stage1_batch_id":"msgbatch_017qqVz33Ga63SCDwzb35XPb","stage2_batch_id":"msgbatch_01Scr2Ei9x4HJMiRGxwrmvLK","note":"batch pricing = 50% of standard"},"stage1_raw":"```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2003,\n      \"finding\": \"SUMF1 encodes the Cα-formylglycine (FGly)-generating enzyme (FGE) that post-translationally converts a conserved cysteine residue to FGly in the active sites of all sulfatases, and this modification is essential for sulfatase catalytic activity. SUMF1 defines a new gene family conserved from prokaryotes to eukaryotes, with vertebrate genomes also containing a paralog SUMF2.\",\n      \"method\": \"Gene family bioinformatic analysis combined with established biochemistry of FGly modification; functional context from identification of the MSD disease gene\",\n      \"journal\": \"Gene\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 / Strong — foundational identification of the enzymatic function, independently replicated across multiple labs in subsequent papers\",\n      \"pmids\": [\"14563551\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2004,\n      \"finding\": \"SUMF1 strongly enhances the activity of sulfatases when co-expressed in cells. Eleven missense mutations (p.R349W, p.R224W, p.L20F, p.A348P, p.S155P, p.C218Y, p.N259I, p.A279V, p.R349Q, p.C336R, p.A177P) resulted in severely impaired sulfatase-enhancing activity, while two mutations (p.R345C, p.P266L) showed high residual activity on some but not all sulfatases tested.\",\n      \"method\": \"Expression of SUMF1 missense mutants in COS-7 cells with functional sulfatase activity assays\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — functional cell-based assays across 11 mutations in 20 patients, replicated in subsequent studies\",\n      \"pmids\": [\"15146462\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"SUMF1 exhibits an enhancing effect on sulfatase activity when co-expressed with sulfatases in multiple disease cell types (MLD, CDPX, MPS II, IIIA, and VI). In vivo, AAV-mediated co-delivery of SUMF1 with sulfamidase in muscle of MPS-IIIA mice resulted in more efficient rescue of the phenotype compared to sulfamidase alone.\",\n      \"method\": \"AAV and lentiviral vector co-delivery in patient-derived cells and mouse models; sulfatase activity assays; GAG/sulfolipid clearance measurements\",\n      \"journal\": \"The Biochemical journal\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — multiple orthogonal methods (viral delivery, enzymatic assays, in vivo mouse rescue) across five different sulfatase deficiency diseases\",\n      \"pmids\": [\"17206939\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"Co-delivery of SUMF1 and SGSH via AAV2/5 to the CNS results in a synergistic increase in SGSH (sulfamidase) activity in primary neural cells and murine brain, demonstrating SUMF1's enhancing function on sulfatase activity in the central nervous system.\",\n      \"method\": \"AAV2/5-CMV-SGSH-IRES-SUMF1 vector delivery in primary neural cells and intraventricular injection in neonatal MPS-IIIA mice; SGSH activity assays; lysosomal storage and inflammatory marker analysis\",\n      \"journal\": \"Human molecular genetics\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — in vivo and in vitro experiments with functional readouts, replicated across multiple brain regions\",\n      \"pmids\": [\"17725987\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"MSD is caused by hypomorphic SUMF1 mutations: four FGE mutants (p.S155P, p.R224W, p.R345C, p.R349W) expressed in Sumf1 KO mouse embryonic fibroblasts produced stable proteins of appropriate molecular weight that localized correctly to the ER, and each partially rescued sulfatase activities, indicating residual enzymatic function.\",\n      \"method\": \"Viral-mediated over-expression of SUMF1 mutants in Sumf1-/- MEFs; Western blot for protein stability; ER localization; sulfatase activity assays\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — clean KO rescue experiment with four mutants, multiple orthogonal methods (localization, stability, activity)\",\n      \"pmids\": [\"17657823\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2008,\n      \"finding\": \"SUMF1 is largely retained in the endoplasmic reticulum (ER) despite lacking canonical retention/retrieval signals, where it activates nascent sulfatases. SUMF1 physically interacts with PDI, ERp44, and ERGIC-53. PDI couples SUMF1 retention and activation in the ER; ERGIC-53 favors SUMF1 export from the ER (silencing ERGIC-53 causes proteasomal degradation of SUMF1); ERp44 retrieves SUMF1 to the ER (down-regulating ERp44 promotes SUMF1 secretion). Part of SUMF1 is secreted and paracrinally taken up by distant cells.\",\n      \"method\": \"Co-immunoprecipitation; RNAi silencing; functional secretion assays; subcellular fractionation and localization; proteasome inhibition experiments\",\n      \"journal\": \"Human molecular genetics\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — reciprocal Co-IP, multiple RNAi knockdowns with defined functional consequences, multiple orthogonal methods in one study\",\n      \"pmids\": [\"18508857\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2008,\n      \"finding\": \"SUMF1 missense mutations p.A177P, p.W179S, p.A279V, and p.R349W do not affect ER localization of FGE in MSD fibroblasts but differentially affect enzymatic activity (p.A177P and p.R349W: <1%; p.W179S: ~3%; p.A279V: ~23%) and protein stability. Both residual enzyme activity and protein stability of FGE mutants contribute to MSD clinical phenotype.\",\n      \"method\": \"Subcellular localization studies in MSD patient fibroblasts; FGE-specific enzymatic activity assays; Western blot for protein stability assessment\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Moderate — direct functional characterization of four mutants with enzymatic assays, localization, and stability in patient cells, single lab with multiple orthogonal methods\",\n      \"pmids\": [\"18157819\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2011,\n      \"finding\": \"Both residual FGE enzymatic activity and FGE protein stability are required for MSD phenotypic outcome. Nonsense mutations causing near-total loss of FGE activity and highly unstable protein produce severe neonatal phenotype; missense mutation G263V causes unstable protein but high residual FGE activity and a mild phenotype.\",\n      \"method\": \"FGE expression analysis, localization, stability (Western blot), and enzymatic activity assays in patient-derived cells from 10 MSD patients\",\n      \"journal\": \"European journal of human genetics : EJHG\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 / Strong — systematic characterization of 10 patients with multiple orthogonal methods, replicated across multiple studies\",\n      \"pmids\": [\"21224894\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2019,\n      \"finding\": \"FGE (SUMF1-encoded protein) is a copper-dependent post-translational modifier that generates formylglycine (fGly) in the active sites of sulfatases. The fGly aldehyde can serve as a reactive handle for chemical bioconjugation when a sulfatase-consensus sequence is engineered into target proteins.\",\n      \"method\": \"Site-specific labeling assay using FGE in mammalian expression systems; antibody-drug conjugate production\",\n      \"journal\": \"Methods in molecular biology (Clifton, N.J.)\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Weak — confirms copper-dependency and aldehyde chemistry of FGE in a methods paper, single lab application\",\n      \"pmids\": [\"31161504\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2017,\n      \"finding\": \"Novel SUMF1 variant FGE-E113K correctly localizes to the ER but is retained intracellularly (unlike wild-type FGE) and exhibits only ~15% of wild-type FGE activity in cell culture assays activating steroid sulfatase.\",\n      \"method\": \"Cell culture expression of FGE-E113K; ER localization by immunofluorescence; FGE activity assay on steroid sulfatase in immortalized MSD cells; crystal structure-based structural modeling\",\n      \"journal\": \"Molecular genetics and metabolism\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Weak — direct functional and localization assays in cell model, single lab with two orthogonal methods\",\n      \"pmids\": [\"28566233\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"Novel SUMF1 variant A348V produces a formylglycine-generating enzyme (FGE) that is highly unstable and lacks catalytic function, resulting in the most severe (neonatal) MSD phenotype.\",\n      \"method\": \"Expression of SUMF1-A348V in cell culture; protein stability and catalytic activity assays\",\n      \"journal\": \"Molecular genetics & genomic medicine\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Weak — functional characterization in cell model, single lab\",\n      \"pmids\": [\"32048457\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2022,\n      \"finding\": \"Knock-in mice carrying patient-derived hypomorphic SUMF1 variants p.Ser153Pro and p.Ala277Val show partial reduction of sulfatase activities, confirming these variants produce residual but reduced FGE function in vivo.\",\n      \"method\": \"Genetic engineering of knock-in mouse lines; biochemical measurement of sulfatase activities; histopathological analysis\",\n      \"journal\": \"Journal of inherited metabolic disease\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Moderate — in vivo knock-in model with biochemical validation of residual activity, direct functional consequence demonstrated\",\n      \"pmids\": [\"36433920\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2025,\n      \"finding\": \"In zebrafish, sumf1 and sumf2 encode positive and negative regulators of sulfatase activity respectively; their expression levels invert at gastrulation onset. Overexpressing sumf1 delays convergence and extension (C&E) onset, while loss of sumf1 function causes precocious C&E. The extracellular sulfatase Sulf1 (which modifies heparan sulfate proteoglycans) is identified as the key effector downstream of sumf1/sumf2 in timing C&E morphogenesis.\",\n      \"method\": \"Zebrafish embryonic explants; mRNA overexpression; morpholino/genetic loss-of-function; C&E timing assays; HSPG sulfation manipulation\",\n      \"journal\": \"bioRxiv\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 / Moderate — multiple orthogonal genetic approaches in zebrafish with defined morphogenetic readout, but preprint not yet peer-reviewed\",\n      \"pmids\": [\"bio_10.1101_2025.10.09.681375\"],\n      \"is_preprint\": true\n    }\n  ],\n  \"current_model\": \"SUMF1 encodes the formylglycine-generating enzyme (FGE), a copper-dependent post-translational modifier that resides in the endoplasmic reticulum (ER) and converts a conserved cysteine to formylglycine in the catalytic sites of all cellular sulfatases; FGE activity is regulated by sequential interactions with PDI (which couples ER retention to activation), ERGIC-53 (which facilitates ER export), and ERp44 (which retrieves secreted FGE back to the ER), while the degree of residual FGE enzymatic activity combined with mutant protein stability determines the severity of multiple sulfatase deficiency.\"\n}\n```","stage2_raw":"{\n  \"mechanistic_narrative\": \"SUMF1 encodes the formylglycine-generating enzyme (FGE), a post-translational modifier that converts a conserved active-site cysteine to Cα-formylglycine (FGly) in all cellular sulfatases, a modification essential for sulfatase catalytic activity, and which defines a gene family conserved from prokaryotes to eukaryotes [#0]. Co-expression of SUMF1 strongly enhances sulfatase activity across diverse sulfatase-deficiency contexts, and co-delivery of SUMF1 with individual sulfatases synergistically boosts enzyme activity and corrects disease phenotypes in cell and mouse models, including in the CNS [#1, #2, #3]. FGE acts as a copper-dependent enzyme that installs an FGly aldehyde at a sulfatase consensus motif [#8]. Although it lacks canonical ER retention signals, FGE is largely retained in the ER where it activates nascent sulfatases, with localization and trafficking controlled by sequential physical interactions: PDI couples ER retention to activation, ERGIC-53 promotes ER export (its loss triggers proteasomal degradation of FGE), and ERp44 retrieves secreted FGE to the ER; a fraction of FGE is secreted and taken up paracrinally by distant cells [#5]. Loss of FGE function causes multiple sulfatase deficiency (MSD): hypomorphic SUMF1 mutations produce ER-localized FGE with variable residual activity, and the combination of residual enzymatic activity and mutant protein stability determines clinical severity, ranging from severe neonatal to milder forms [#4, #6, #7]. Beyond its conserved enzymatic role, sumf1 acts as a developmental timer of convergence-extension morphogenesis in zebrafish through the extracellular heparan-sulfate sulfatase Sulf1 [#12].\",\n  \"teleology\": [\n    {\n      \"year\": 2003,\n      \"claim\": \"Established the molecular identity and enzymatic function of SUMF1, answering what gene product is responsible for the universal FGly modification required for sulfatase activity.\",\n      \"evidence\": \"Gene family bioinformatic analysis combined with biochemistry of FGly modification and MSD disease gene identification\",\n      \"pmids\": [\"14563551\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Catalytic mechanism and cofactor requirement not yet defined\", \"Subcellular site of modification not yet established\"]\n    },\n    {\n      \"year\": 2004,\n      \"claim\": \"Demonstrated that SUMF1 enhances sulfatase activity in cells and that disease-associated missense mutations differentially impair this enhancing function, linking specific residues to enzyme competence.\",\n      \"evidence\": \"Expression of 11 SUMF1 missense mutants in COS-7 cells with sulfatase activity assays across 20 patients\",\n      \"pmids\": [\"15146462\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Did not distinguish loss of stability from loss of catalysis\", \"No structural basis for mutation effects\"]\n    },\n    {\n      \"year\": 2007,\n      \"claim\": \"Showed that SUMF1 co-delivery synergistically enhances multiple distinct sulfatases in patient cells and rescues lysosomal storage phenotypes in vivo, including the CNS, establishing therapeutic relevance of co-expression.\",\n      \"evidence\": \"AAV/lentiviral co-delivery of SUMF1 with sulfatases in patient cells and MPS-IIIA mouse muscle and brain; GAG/sulfolipid clearance and activity assays\",\n      \"pmids\": [\"17206939\", \"17725987\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Mechanism of synergy at the molecular level not dissected\", \"Optimal stoichiometry of FGE to sulfatase undefined\"]\n    },\n    {\n      \"year\": 2007,\n      \"claim\": \"Defined MSD as a hypomorphic disorder by showing disease mutants retain correct ER localization and partial activity, reframing severity as a function of residual function rather than complete loss.\",\n      \"evidence\": \"Viral overexpression of four SUMF1 mutants in Sumf1-/- MEFs with stability, localization, and rescue assays\",\n      \"pmids\": [\"17657823\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Did not separate contributions of stability versus intrinsic catalytic defect for all mutants\"]\n    },\n    {\n      \"year\": 2008,\n      \"claim\": \"Resolved how FGE is retained in the ER despite lacking canonical signals, identifying a tripartite interactome (PDI, ERGIC-53, ERp44) that couples retention, export, retrieval, and paracrine secretion.\",\n      \"evidence\": \"Reciprocal Co-IP, RNAi silencing, secretion and proteasome inhibition assays, subcellular fractionation\",\n      \"pmids\": [\"18508857\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Functional consequence of paracrine FGE uptake for sulfatase activation in recipient cells not quantified\", \"Sequence determinants of each interaction not mapped\"]\n    },\n    {\n      \"year\": 2008,\n      \"claim\": \"Established that MSD clinical phenotype reflects both residual enzymatic activity and protein stability, by showing localization-competent mutants vary independently in activity and stability.\",\n      \"evidence\": \"Localization, FGE-specific activity, and stability assays of four mutants in MSD patient fibroblasts\",\n      \"pmids\": [\"18157819\", \"21224894\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\"Quantitative genotype-phenotype prediction model not derived\"]\n    },\n    {\n      \"year\": 2019,\n      \"claim\": \"Confirmed FGE is a copper-dependent enzyme and that the FGly aldehyde it generates can be exploited as a bioconjugation handle, extending mechanistic understanding to cofactor dependence and reactive chemistry.\",\n      \"evidence\": \"Site-specific labeling assay in mammalian expression systems for antibody-drug conjugate production\",\n      \"pmids\": [\"31161504\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"Role of copper in cellular FGE activation not addressed in vivo\", \"Methods-focused, not a mechanistic structural study\"]\n    },\n    {\n      \"year\": 2020,\n      \"claim\": \"Extended the activity-stability model with additional variants, including a highly unstable, catalytically dead mutant producing the most severe neonatal phenotype and a partly-retained variant with reduced activity.\",\n      \"evidence\": \"Cell-culture expression, localization, stability, and activity assays of FGE-E113K and A348V variants\",\n      \"pmids\": [\"28566233\", \"32048457\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"Single-lab variant characterizations without in vivo confirmation\", \"Structural rationale only partially modeled\"]\n    },\n    {\n      \"year\": 2022,\n      \"claim\": \"Validated the residual-function model in vivo by generating patient-variant knock-in mice that show partial reduction of sulfatase activities, confirming hypomorphic behavior of specific alleles.\",\n      \"evidence\": \"Knock-in mouse lines with biochemical sulfatase activity measurement and histopathology\",\n      \"pmids\": [\"36433920\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"Tissue-specific differences in residual activity not fully mapped\", \"Genotype-phenotype correlation across alleles incomplete\"]\n    },\n    {\n      \"year\": 2025,\n      \"claim\": \"Revealed a developmental role beyond housekeeping sulfatase activation, with sumf1/sumf2 acting as opposing temporal regulators of convergence-extension morphogenesis via the extracellular sulfatase Sulf1.\",\n      \"evidence\": \"Zebrafish explants, mRNA overexpression, loss-of-function, and HSPG sulfation manipulation (preprint)\",\n      \"pmids\": [\"bio_10.1101_2025.10.09.681375\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"Preprint not yet peer-reviewed\", \"Whether this developmental timing role is conserved in mammals unknown\", \"Molecular basis of sumf1/sumf2 expression inversion not defined\"]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"The structural and catalytic determinants linking specific copper-coordination and FGE-substrate recognition to graded residual activity in MSD remain incompletely defined.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\"No high-resolution mechanism connecting cofactor handling to mutant activity loss\", \"Predictive activity-stability-to-severity framework not established\"]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0140096\", \"supporting_discovery_ids\": [0, 1, 8]},\n      {\"term_id\": \"GO:0016491\", \"supporting_discovery_ids\": [8]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005783\", \"supporting_discovery_ids\": [4, 5, 6]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"R-HSA-392499\", \"supporting_discovery_ids\": [0, 5]},\n      {\"term_id\": \"R-HSA-1643685\", \"supporting_discovery_ids\": [4, 7]}\n    ],\n    \"complexes\": [],\n    \"partners\": [\"PDI\", \"ERGIC-53\", \"ERp44\"],\n    \"other_free_text\": []\n  }\n}","audit_flag":null,"evaluation":{"pairwise":"win","faith_supported":5,"faith_total":5,"faith_pct":100.0}}