{"gene":"CTSC","run_date":"2026-04-28T17:28:53","timeline":{"discoveries":[{"year":2001,"finding":"Crystal structure of human dipeptidyl peptidase I (cathepsin C/CTSC) revealed that an exclusion domain transforms the papain-like endopeptidase framework into a tetrameric enzyme. The four active sites are exposed to solvent in a tetrahedral arrangement, the exclusion domain blocks access of polypeptide chains except at their termini, and Asp1 positions the N-terminal amino group of the substrate, explaining the strict dipeptidyl aminopeptidase specificity. Missense mutations causing Haim-Munk and Papillon-Lefèvre syndromes were mapped to positions that disrupt the fold.","method":"X-ray crystallography with functional/structural interpretation; mapping of disease mutations","journal":"The EMBO journal","confidence":"High","confidence_rationale":"Tier 1 — crystal structure with mechanistic interpretation, disease mutation mapping; seminal single paper with multiple orthogonal insights","pmids":["11726493"],"is_preprint":false},{"year":1993,"finding":"CTSC (dipeptidyl peptidase I, DPPI) is required for post-translational activation of granule serine proteases in immune cells. Inhibition of DPPI in CD8+ T cells, LAK cells, mast cells, and myeloid cells impaired generation of cathepsin G, granzyme, and other serine protease activities. In U-937 cells, DPPI inhibition caused accumulation of the pro-enzyme form of cathepsin G bearing its N-terminal dipeptide extension, demonstrating that DPPI cleaves the two-residue activation peptide.","method":"Pharmacological inhibition of DPPI in cell lines, pulse-chase radiolabeling, immunoblotting for pro- vs. active enzyme forms","journal":"The Journal of biological chemistry","confidence":"High","confidence_rationale":"Tier 2 — multiple cell types, direct biochemical demonstration of pro-enzyme accumulation upon DPPI inhibition; foundational mechanistic paper","pmids":["8428921"],"is_preprint":false},{"year":1999,"finding":"Loss-of-function mutations in CTSC (cathepsin C gene at 11q14) cause Papillon-Lefèvre syndrome (autosomal recessive palmoplantar keratosis and periodontitis). Functional assays in patient leukocytes demonstrated near-total loss of cathepsin C enzymatic activity; obligate carriers showed reduced activity, confirming the causal relationship.","method":"Homozygosity mapping, genomic sequencing, functional enzyme activity assay in patient leukocytes","journal":"Nature genetics","confidence":"High","confidence_rationale":"Tier 1-2 — genetic mapping combined with direct enzymatic activity measurement in patient cells; independently confirmed in the same year by Hart et al.","pmids":["10581027","10593994"],"is_preprint":false},{"year":2001,"finding":"Human pro-DPPI (pro-cathepsin C) expressed in insect cells is a dimer incapable of autoactivation. Cathepsin L efficiently activates pro-DPPI at pH 4.5 via two cleavage pathways: (1) initial cleavage within the pro-region followed by removal of the activation peptide and then separation into heavy/light chains, or (2) separation of the pro-region from the catalytic domain first. Cathepsin S is a less efficient activator; cathepsin B and DPPI itself cannot activate the proenzyme. Cystatin C and stefins A and B inhibit active DPPI with Ki values of 0.5–1.1 nM.","method":"Baculovirus expression, affinity purification of active and precursor forms, in vitro activation assays, kinetic inhibition measurements, CD spectroscopy, glycosylation analysis","journal":"Biochemistry","confidence":"High","confidence_rationale":"Tier 1 — reconstituted in vitro activation system with multiple protease combinations, kinetic characterization; rigorous biochemical study","pmids":["11327826"],"is_preprint":false},{"year":2000,"finding":"Haim-Munk syndrome (HMS) and Papillon-Lefèvre syndrome (PLS) are allelic disorders caused by mutations in CTSC. A mutation in exon 6 of CTSC (2127A→G, changing a conserved amino acid) segregates with HMS in four nuclear families from the Cochin isolate. A mutation at the same codon (2126C→T) causes classical PLS in a Turkish family, demonstrating that different mutations at the same residue cause phenotypically distinct syndromes.","method":"Sequencing of CTSC in affected families, haplotype analysis, co-segregation analysis","journal":"Journal of medical genetics","confidence":"High","confidence_rationale":"Tier 2 — multiple families, allelic mapping; confirms CTSC as causal gene for both syndromes","pmids":["10662807"],"is_preprint":false},{"year":2004,"finding":"In humans with Papillon-Lefèvre syndrome (CTSC/DPPI loss-of-function), neutrophil-derived serine proteases (cathepsin G, neutrophil elastase) are severely reduced in activity and stability, confirming DPPI's essential role in their activation. Surprisingly, granzyme activities in cytotoxic lymphocytes (LAK cells) are retained at significant levels, and LAK-mediated cytotoxicity against K562 is normal. Neutrophils from PLS patients do not uniformly fail to kill S. aureus and E. coli, indicating that serine proteases are not the principal bactericidal mechanism.","method":"Enzymatic activity assays in primary patient cells (neutrophils, LAK cells), bactericidal assays, immunological functional assays","journal":"Journal of immunology","confidence":"High","confidence_rationale":"Tier 2 — primary patient cells, multiple orthogonal functional assays; defines the human cell-type specificity of CTSC-dependent serine protease activation","pmids":["15585850"],"is_preprint":false},{"year":1997,"finding":"The human CTSC (dipeptidyl-peptidase I) gene spans ~3.5 kb and consists of two exons and one intron, a genomic organization distinct from other papain-type cysteine proteinases. By FISH, the gene maps to chromosomal region 11q14.1–q14.3. Northern analysis shows highest mRNA expression in lung, kidney, and placenta; high levels in polymorphonuclear leukocytes and alveolar macrophages. IL-2 treatment of lymphocytes significantly increases DPPI mRNA levels, indicating transcriptional regulation.","method":"Genomic cloning, FISH chromosomal mapping, Northern blot analysis, IL-2 stimulation experiment","journal":"The Journal of biological chemistry","confidence":"High","confidence_rationale":"Tier 2 — direct experimental characterization of gene structure, chromosomal location, and regulated expression; foundational molecular genetics paper","pmids":["9092576"],"is_preprint":false},{"year":2014,"finding":"CTSC mRNA is negatively regulated by miR-23a. In NK cells, activation down-regulates miR-23a, which de-represses CTSC expression, leading to increased cathepsin C protein and granzyme B activity required for cytotoxicity. Treatment with all-trans retinoic acid (ATRA) induces miR-23a expression, decreases CTSC mRNA and protein levels, reduces granzyme B activity, and impairs NK cell cytotoxicity in a mouse model.","method":"miRNA/mRNA expression profiling, functional validation with ATRA treatment, in vivo mouse cytotoxicity model, granzyme B activity assay","journal":"The international journal of biochemistry & cell biology","confidence":"Medium","confidence_rationale":"Tier 2-3 — miR-23a/CTSC regulatory link validated in NK cells and in vivo, but mechanism of miRNA targeting not fully reconstituted","pmids":["24440757"],"is_preprint":false},{"year":2021,"finding":"Tumor-secreted CTSC promotes breast cancer lung metastasis by enzymatically activating neutrophil membrane-bound proteinase 3 (PR3). Activated PR3 processes IL-1β, which activates NF-κB, upregulating IL-6 and CCL3 for neutrophil recruitment. The CTSC–PR3–IL-1β axis also induces neutrophil ROS production and NET formation; NETs degrade thrombospondin-1 and support metastatic colonization. Pharmacological targeting of CTSC with compound AZD7986 suppresses lung metastasis in a mouse model.","method":"In vitro enzymatic activation assays, co-culture experiments, mouse metastasis models (genetic and pharmacological CTSC inhibition), NET quantification, thrombospondin-1 degradation assays, human tumor correlations","journal":"Cancer cell","confidence":"High","confidence_rationale":"Tier 1-2 — multiple orthogonal methods including enzymatic reconstitution, genetic knockdown, pharmacological inhibition, and in vivo models; independently consistent results","pmids":["33450198"],"is_preprint":false},{"year":2020,"finding":"In a phase 2 clinical trial, brensocatib (INS1007), an oral reversible inhibitor of DPP-1 (cathepsin C/CTSC), reduced sputum neutrophil elastase activity in patients with bronchiectasis and prolonged time to first exacerbation, demonstrating that pharmacological inhibition of CTSC reduces downstream neutrophil serine protease activity in vivo in humans.","method":"Randomized double-blind placebo-controlled phase 2 trial; sputum neutrophil elastase activity measured as pharmacodynamic endpoint","journal":"The New England journal of medicine","confidence":"High","confidence_rationale":"Tier 2 — large randomized controlled trial directly measuring CTSC-dependent neutrophil elastase activity as pharmacodynamic readout in humans","pmids":["32897034"],"is_preprint":false},{"year":2002,"finding":"CTSC mutations in North American PLS families (including novel p.G139R) are associated with dramatically reduced CTSC protease enzyme activity in patient leukocytes, with almost no detectable activity. Biochemical analysis confirmed that mutations altering restriction enzyme sites in conserved regions of CTSC abolish enzymatic function.","method":"CTSC gene sequencing, restriction enzyme analysis, direct enzymatic activity measurement in patient leukocytes","journal":"Human mutation","confidence":"Medium","confidence_rationale":"Tier 2 — direct functional assay in patient cells confirming loss of enzymatic activity for specific mutations","pmids":["12112662"],"is_preprint":false},{"year":2014,"finding":"Mutations in the CTSC gene cluster predominantly in exons 5–7, which encode the heavy chain of cathepsin C. This region is implicated in tetramerization, suggesting that tetramer formation is important for CTSC enzymatic activity. Genotype–phenotype analysis showed that the same CTSC mutation can produce different phenotypic severity, implicating modifier genes or environmental factors outside CTSC.","method":"Mutational database analysis, genotype-phenotype correlation across 75 published mutations","journal":"Molecular genetics & genomic medicine","confidence":"Low","confidence_rationale":"Tier 4 — database/correlative analysis without direct experimental validation of tetramerization role","pmids":["24936511"],"is_preprint":false},{"year":2021,"finding":"In a PLS patient with a 503A>G substitution in CTSC exon 4 (p.Tyr168Cys), neutrophil lysates lacked CTSC protein and showed no CTSC or neutrophil serine protease (NSP) activity. Neutrophil counts, morphology, priming, chemotaxis, radical production, and apoptosis regulation were normal, but NET formation upon PMA stimulation was severely depressed, identifying NSP-dependent NET formation as a specific functional deficit in CTSC-deficient neutrophils.","method":"Patient neutrophil functional assays: enzyme activity, chemotaxis, oxidative burst, NET quantification, apoptosis; comparison with another PLS mutation and healthy controls","journal":"PloS one","confidence":"Medium","confidence_rationale":"Tier 2 — primary patient cells, multiple orthogonal functional readouts; demonstrates selective NET formation defect","pmids":["34932608"],"is_preprint":false},{"year":2024,"finding":"Novel indolinone-based CTSC inhibitor B22 was shown to inhibit CTSC enzymatic activity by binding to the S2 pocket and S1 site of cathepsin C. B22 further inhibits downstream neutrophil serine protease activity and exerts anti-inflammatory effects by modulating cytokine levels in inflammatory bowel disease models in vitro and in vivo.","method":"In vitro CTSC activity assay, molecular docking, in vivo IBD mouse model, cytokine measurement","journal":"European journal of medicinal chemistry","confidence":"Medium","confidence_rationale":"Tier 2-3 — direct enzymatic inhibition assay with binding site identification and in vivo validation; single study","pmids":["39383651"],"is_preprint":false},{"year":2022,"finding":"Fish CTSC (TroCTSC from Trachinotus ovatus) has three conserved catalytic residues (Cys251, His397, Asn419); mutation of any of these dramatically reduces proteolytic activity. Recombinant TroCTSC has optimal hydrolase activity at 40°C and pH 5.5, is promoted by Zn2+ and Ca2+ but inhibited by Fe2+ and Cu2+. TroCTSC localizes to the cytoplasm and partially co-localizes with lysosomes; after V. harveyi stimulation it translocates to the nucleus. Overexpression enhances bacterial clearance and pro-inflammatory cytokine expression; knockdown reduces antibacterial capacity.","method":"Site-directed mutagenesis of catalytic residues, recombinant protein activity assay, immunofluorescence/co-localization, in vivo overexpression/knockdown in goldfish","journal":"Fish & shellfish immunology","confidence":"Medium","confidence_rationale":"Tier 1-2 — catalytic residue mutagenesis with direct activity measurement; ortholog study with functional in vivo validation","pmids":["35952999"],"is_preprint":false},{"year":2020,"finding":"Praeruptorin B reduces migration and invasion of renal carcinoma cells by suppressing the EGFR–MEK–ERK signaling pathway, which leads to downregulation of CTSC mRNA and protein expression. EGF-induced upregulation of CTSC was blocked by praeruptorin B, placing CTSC downstream of EGFR–MEK–ERK signaling in the regulation of tumor cell invasiveness.","method":"Cell migration/invasion assays, Western blot for CTSC and signaling proteins, EGF stimulation and praeruptorin B inhibition experiments","journal":"International journal of molecular sciences","confidence":"Medium","confidence_rationale":"Tier 3 — pathway placement via pharmacological inhibition in cell lines; single study","pmids":["32331211"],"is_preprint":false}],"current_model":"CTSC (cathepsin C/dipeptidyl peptidase I) is a lysosomal cysteine protease that functions as a tetramer with a unique exclusion domain that restricts substrate access to N-terminal dipeptides; it is the obligate activator of granule serine proteases (neutrophil elastase, cathepsin G, proteinase 3, granzymes) in immune cells by cleaving their two-residue activation peptides, a role confirmed by loss-of-function mutations causing Papillon-Lefèvre and Haim-Munk syndromes; active CTSC is generated by cathepsin L-mediated processing of its proenzyme, and its activity is regulated post-transcriptionally by miR-23a; in cancer contexts, secreted CTSC drives metastasis by activating neutrophil PR3 to process IL-1β, promoting NF-κB-dependent neutrophil recruitment and NET formation that degrades anti-metastatic thrombospondin-1."},"narrative":{"teleology":[{"year":1993,"claim":"Establishing that CTSC is the enzyme responsible for converting pro-forms of granule serine proteases to their active forms resolved how immune effector cells generate mature cathepsin G, granzymes, and related proteases.","evidence":"Pharmacological DPPI inhibition in CD8+ T cells, LAK cells, mast cells, and myeloid lines with pulse-chase labeling showing pro-cathepsin G accumulation","pmids":["8428921"],"confidence":"High","gaps":["Structural basis of dipeptide specificity was unknown","Whether CTSC was essential for all granule serine proteases in all immune lineages was untested genetically"]},{"year":1997,"claim":"Characterization of the CTSC gene structure and chromosomal location provided the molecular framework for subsequent disease-gene mapping and showed that CTSC expression is transcriptionally regulated by immune signals such as IL-2.","evidence":"Genomic cloning, FISH mapping to 11q14, Northern blot tissue profiling, and IL-2 stimulation of lymphocytes","pmids":["9092576"],"confidence":"High","gaps":["Promoter elements and transcription factors driving tissue-specific expression were not defined","Whether IL-2-induced upregulation has functional consequences for serine protease activation was not tested"]},{"year":1999,"claim":"Identification of CTSC loss-of-function mutations as the cause of Papillon-Lefèvre syndrome, and subsequently Haim-Munk syndrome, proved that CTSC is essential for normal skin and periodontal homeostasis in humans and that the two syndromes are allelic.","evidence":"Homozygosity mapping, CTSC sequencing in affected families, enzymatic activity assays in patient leukocytes; allelic mutations at the same codon causing PLS vs. HMS","pmids":["10581027","10593994","10662807"],"confidence":"High","gaps":["How CTSC deficiency leads specifically to periodontal destruction and keratosis remained mechanistically unclear","Modifier genes explaining phenotypic variability between allelic disorders were not identified"]},{"year":2001,"claim":"The crystal structure explained how the exclusion domain converts the papain-fold endopeptidase into a strict dipeptidyl aminopeptidase by blocking internal polypeptide access, and showed that disease mutations disrupt this fold; concurrently, in vitro reconstitution demonstrated that cathepsin L is the physiological activator of pro-CTSC.","evidence":"X-ray crystallography of human CTSC tetramer with disease mutation mapping; baculovirus expression of pro-DPPI with in vitro activation by cathepsin L and kinetic inhibition by cystatins","pmids":["11726493","11327826"],"confidence":"High","gaps":["Dynamics of tetramer assembly in vivo and whether monomeric/dimeric intermediates are functional were not addressed","In vivo confirmation that cathepsin L is the predominant activator in specific cell types was lacking"]},{"year":2004,"claim":"Analysis of PLS patient immune cells revealed that CTSC dependence for serine protease activation is cell-type-specific: neutrophil cathepsin G and elastase are severely affected, whereas cytotoxic lymphocyte granzymes retain significant activity, resolving an apparent paradox of selective immunodeficiency.","evidence":"Enzymatic activity assays and bactericidal/cytotoxicity functional assays in primary neutrophils and LAK cells from PLS patients","pmids":["15585850"],"confidence":"High","gaps":["Alternative activation pathways for granzymes in lymphocytes were not identified","Whether residual granzyme activity is sufficient for all cytotoxic lymphocyte functions in vivo was not determined"]},{"year":2014,"claim":"Discovery that miR-23a negatively regulates CTSC mRNA introduced a post-transcriptional control layer linking NK cell activation status to granzyme B maturation and cytotoxic potential.","evidence":"miRNA profiling in NK cells, ATRA-mediated miR-23a induction reducing CTSC protein and granzyme B activity, in vivo mouse cytotoxicity model","pmids":["24440757"],"confidence":"Medium","gaps":["Direct miR-23a binding site validation (e.g., luciferase reporter with mutation) was not shown","Relevance of this regulatory axis in neutrophils or other CTSC-dependent cell types was not tested"]},{"year":2020,"claim":"A phase 2 clinical trial demonstrated that pharmacological inhibition of CTSC (brensocatib) reduces neutrophil elastase activity in vivo in humans and improves clinical outcomes in bronchiectasis, validating the CTSC–serine protease axis as a therapeutic target.","evidence":"Randomized double-blind placebo-controlled trial measuring sputum neutrophil elastase activity as pharmacodynamic endpoint","pmids":["32897034"],"confidence":"High","gaps":["Long-term immunological consequences of sustained CTSC inhibition in humans were not assessed","Whether brensocatib affects granzyme-dependent immunity was not evaluated"]},{"year":2021,"claim":"Two advances clarified CTSC's roles in neutrophil biology and cancer: CTSC-deficient neutrophils were shown to have a selective defect in NET formation, and tumor-secreted CTSC was found to promote metastasis via a PR3–IL-1β–NF-κB cascade driving neutrophil recruitment and NET-mediated thrombospondin-1 degradation.","evidence":"PLS patient neutrophil functional profiling showing selective NET impairment; mouse metastasis models with genetic/pharmacological CTSC inhibition, enzymatic reconstitution of CTSC–PR3 activation, NET quantification","pmids":["34932608","33450198"],"confidence":"High","gaps":["Whether tumor-secreted CTSC activates serine proteases beyond PR3 in the tumor microenvironment is unknown","The mechanism by which CTSC is secreted by tumor cells rather than retained in lysosomes was not elucidated"]},{"year":null,"claim":"Key unresolved questions include the structural basis of CTSC tetramer assembly dynamics in vivo, the identity of alternative granzyme-activating proteases in lymphocytes, and the long-term immunological safety profile of chronic CTSC inhibition.","evidence":"","pmids":[],"confidence":"Low","gaps":["No in vivo evidence identifying the protease(s) that activate granzymes independently of CTSC","No structural data on conformational changes during CTSC proenzyme processing in a cellular context","Mechanism of CTSC secretion by tumor cells into the extracellular space is undefined"]}],"mechanism_profile":{"molecular_activity":[{"term_id":"GO:0140096","term_label":"catalytic activity, acting on a protein","supporting_discovery_ids":[0,1,3,5,8,9,14]},{"term_id":"GO:0016787","term_label":"hydrolase activity","supporting_discovery_ids":[0,1,3,14]}],"localization":[{"term_id":"GO:0005764","term_label":"lysosome","supporting_discovery_ids":[0,3,14]},{"term_id":"GO:0005576","term_label":"extracellular region","supporting_discovery_ids":[8]}],"pathway":[{"term_id":"GO:0005576","term_label":"extracellular region","supporting_discovery_ids":[8]},{"term_id":"R-HSA-168256","term_label":"Immune System","supporting_discovery_ids":[1,5,8,9,12]},{"term_id":"R-HSA-392499","term_label":"Metabolism of proteins","supporting_discovery_ids":[0,1,3]},{"term_id":"R-HSA-1643685","term_label":"Disease","supporting_discovery_ids":[2,4,8,10]}],"complexes":[],"partners":["CTSG","ELANE","PRTN3","GZMB","CTSL"],"other_free_text":[]},"mechanistic_narrative":"CTSC (cathepsin C / dipeptidyl peptidase 1) is a lysosomal cysteine protease that functions as a tetramer whose unique exclusion domain restricts catalysis to the sequential removal of N-terminal dipeptides from polypeptide substrates [PMID:11726493]. CTSC is the obligate activator of granule serine proteases—neutrophil elastase, cathepsin G, proteinase 3, and granzymes—by cleaving their two-residue activation peptides; its proenzyme is itself processed by cathepsin L, while endogenous inhibitors (cystatins) and the microRNA miR-23a regulate its activity post-translationally and post-transcriptionally [PMID:8428921, PMID:11327826, PMID:24440757]. Loss-of-function mutations in CTSC cause the allelic Mendelian disorders Papillon-Lefèvre syndrome and Haim-Munk syndrome, characterized by palmoplantar keratosis, severe periodontitis, and impaired neutrophil serine protease activity and NET formation [PMID:10581027, PMID:10662807, PMID:34932608]. In cancer, tumor-secreted CTSC activates neutrophil PR3 to drive an IL-1β–NF-κB inflammatory cascade and NET-dependent degradation of thrombospondin-1, promoting metastatic colonization, a pathway pharmacologically targetable by DPP-1 inhibitors such as brensocatib [PMID:33450198, PMID:32897034]."},"prefetch_data":{"uniprot":{"accession":"P53634","full_name":"Dipeptidyl peptidase 1","aliases":["Cathepsin C","Cathepsin J","Dipeptidyl peptidase I","DPP-I","DPPI","Dipeptidyl transferase"],"length_aa":463,"mass_kda":51.9,"function":"Thiol protease (PubMed:1586157). Has dipeptidylpeptidase activity (PubMed:1586157). Active against a broad range of dipeptide substrates composed of both polar and hydrophobic amino acids (PubMed:1586157). Proline cannot occupy the P1 position and arginine cannot occupy the P2 position of the substrate (PubMed:1586157). Can act as both an exopeptidase and endopeptidase (PubMed:1586157). Activates serine proteases such as elastase, cathepsin G and granzymes A and B (PubMed:8428921)","subcellular_location":"Lysosome","url":"https://www.uniprot.org/uniprotkb/P53634/entry"},"depmap":{"release":"DepMap","has_data":true,"is_common_essential":false,"resolved_as":"","url":"https://depmap.org/portal/gene/CTSC","classification":"Not Classified","n_dependent_lines":0,"n_total_lines":1208,"dependency_fraction":0.0},"opencell":{"profiled":false,"resolved_as":"","ensg_id":"","cell_line_id":"","localizations":[],"interactors":[],"url":"https://opencell.sf.czbiohub.org/search/CTSC","total_profiled":1310},"omim":[{"mim_id":"621351","title":"SERINE PROTEASE 57; PRSS57","url":"https://www.omim.org/entry/621351"},{"mim_id":"607658","title":"HYPOTRICHOSIS-OSTEOLYSIS-PERIODONTITIS-PALMOPLANTAR KERATODERMA SYNDROME","url":"https://www.omim.org/entry/607658"},{"mim_id":"602365","title":"CATHEPSIN C; CTSC","url":"https://www.omim.org/entry/602365"},{"mim_id":"301031","title":"CONGENITAL DISORDER OF GLYCOSYLATION, TYPE Icc; CDG1CC","url":"https://www.omim.org/entry/301031"},{"mim_id":"300853","title":"IMMUNODEFICIENCY, X-LINKED, WITH MAGNESIUM DEFECT, EPSTEIN-BARR VIRUS INFECTION, AND NEOPLASIA; XMEN","url":"https://www.omim.org/entry/300853"}],"hpa":{"profiled":true,"resolved_as":"","reliability":"Supported","locations":[{"location":"Vesicles","reliability":"Supported"}],"tissue_specificity":"Low tissue specificity","tissue_distribution":"Detected in all","driving_tissues":[],"url":"https://www.proteinatlas.org/search/CTSC"},"hgnc":{"alias_symbol":["DPP1"],"prev_symbol":["PLS","PALS"]},"alphafold":{"accession":"P53634","domains":[{"cath_id":"2.40.128.80","chopping":"33-156","consensus_level":"high","plddt":90.8852,"start":33,"end":156},{"cath_id":"3.90.70.10","chopping":"186-461","consensus_level":"high","plddt":94.8904,"start":186,"end":461}],"viewer_url":"https://alphafold.ebi.ac.uk/entry/P53634","model_url":"https://alphafold.ebi.ac.uk/files/AF-P53634-F1-model_v6.cif","pae_url":"https://alphafold.ebi.ac.uk/files/AF-P53634-F1-predicted_aligned_error_v6.png","plddt_mean":90.12},"mouse_models":{"mgi_url":"https://www.informatics.jax.org/marker/summary?nomen=CTSC","jax_strain_url":"https://www.jax.org/strain/search?query=CTSC"},"sequence":{"accession":"P53634","fasta_url":"https://rest.uniprot.org/uniprotkb/P53634.fasta","uniprot_url":"https://www.uniprot.org/uniprotkb/P53634/entry","alphafold_viewer_url":"https://alphafold.ebi.ac.uk/entry/P53634"}},"corpus_meta":[{"pmid":"21693065","id":"PMC_21693065","title":"Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems.","date":"2011","source":"BMC bioinformatics","url":"https://pubmed.ncbi.nlm.nih.gov/21693065","citation_count":660,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"32897034","id":"PMC_32897034","title":"Phase 2 Trial of the DPP-1 Inhibitor Brensocatib in Bronchiectasis.","date":"2020","source":"The New England journal of medicine","url":"https://pubmed.ncbi.nlm.nih.gov/32897034","citation_count":282,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12607117","id":"PMC_12607117","title":"Prediction of clinical outcome with microarray data: a partial least squares discriminant analysis (PLS-DA) approach.","date":"2003","source":"Human genetics","url":"https://pubmed.ncbi.nlm.nih.gov/12607117","citation_count":249,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"10753959","id":"PMC_10753959","title":"Molecular cloning and characterization of Pals, proteins associated with mLin-7.","date":"2000","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/10753959","citation_count":147,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"27342308","id":"PMC_27342308","title":"FERONIA and Her Pals: Functions and Mechanisms.","date":"2016","source":"Plant physiology","url":"https://pubmed.ncbi.nlm.nih.gov/27342308","citation_count":139,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"9452443","id":"PMC_9452443","title":"Isolation and characterization of the Saccharomyces cerevisiae DPP1 gene encoding diacylglycerol pyrophosphate phosphatase.","date":"1998","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/9452443","citation_count":113,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"10329682","id":"PMC_10329682","title":"The LPP1 and DPP1 gene products account for most of the isoprenoid phosphate phosphatase activities in Saccharomyces cerevisiae.","date":"1999","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/10329682","citation_count":112,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"21644597","id":"PMC_21644597","title":"Theoretical Justification of Wavelength Selection in PLS Calibration:  Development of a New Algorithm.","date":"1998","source":"Analytical chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/21644597","citation_count":102,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"2790931","id":"PMC_2790931","title":"Splenic outer periarterial lymphoid sheath (PALS): an immunoproliferative microenvironment constituted by antigen-laden marginal metallophils and ED2-positive macrophages in the rat.","date":"1989","source":"Cell and tissue research","url":"https://pubmed.ncbi.nlm.nih.gov/2790931","citation_count":88,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"28394309","id":"PMC_28394309","title":"MORF9 increases the RNA-binding activity of PLS-type pentatricopeptide repeat protein in plastid RNA editing.","date":"2017","source":"Nature plants","url":"https://pubmed.ncbi.nlm.nih.gov/28394309","citation_count":84,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34193885","id":"PMC_34193885","title":"Lipidomics study of plasma from patients suggest that ALS and PLS are part of a continuum of motor neuron disorders.","date":"2021","source":"Scientific reports","url":"https://pubmed.ncbi.nlm.nih.gov/34193885","citation_count":55,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"25460796","id":"PMC_25460796","title":"Liaison alcaline: Pals entice non-endosomal ESCRTs to the plasma membrane for pH signaling.","date":"2014","source":"Current opinion in microbiology","url":"https://pubmed.ncbi.nlm.nih.gov/25460796","citation_count":54,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"11113652","id":"PMC_11113652","title":"Prediction of drug transport processes using simple parameters and PLS statistics. The use of ACD/logP and ACD/ChemSketch descriptors.","date":"2001","source":"European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences","url":"https://pubmed.ncbi.nlm.nih.gov/11113652","citation_count":53,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"11139591","id":"PMC_11139591","title":"Regulation of the Saccharomyces cerevisiae DPP1-encoded diacylglycerol pyrophosphate phosphatase by zinc.","date":"2001","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/11139591","citation_count":52,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"24047899","id":"PMC_24047899","title":"Structure of a PLS-class pentatricopeptide repeat protein provides insights into mechanism of RNA recognition.","date":"2013","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/24047899","citation_count":51,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"24936511","id":"PMC_24936511","title":"CTSC and Papillon-Lefèvre syndrome: detection of recurrent mutations in Hungarian patients, a review of published variants and database update.","date":"2014","source":"Molecular genetics & genomic medicine","url":"https://pubmed.ncbi.nlm.nih.gov/24936511","citation_count":49,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"27095196","id":"PMC_27095196","title":"A PPR protein in the PLS subfamily stabilizes the 5'-end of processed rpl16 mRNAs in maize chloroplasts.","date":"2016","source":"Nucleic acids research","url":"https://pubmed.ncbi.nlm.nih.gov/27095196","citation_count":46,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"11752289","id":"PMC_11752289","title":"PALS db: Putative Alternative Splicing database.","date":"2002","source":"Nucleic acids research","url":"https://pubmed.ncbi.nlm.nih.gov/11752289","citation_count":43,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12696101","id":"PMC_12696101","title":"PLS-ANN based classification model for oral submucous fibrosis and oral carcinogenesis.","date":"2003","source":"Lasers in surgery and medicine","url":"https://pubmed.ncbi.nlm.nih.gov/12696101","citation_count":40,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"11536212","id":"PMC_11536212","title":"Theoretical calculation and prediction of drug transport processes using simple parameters and partial least squares projections to latent structures (PLS) statistics. The use of electrotopological state indices.","date":"2001","source":"Journal of pharmaceutical sciences","url":"https://pubmed.ncbi.nlm.nih.gov/11536212","citation_count":39,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"28031031","id":"PMC_28031031","title":"Gene regulatory network inference using PLS-based methods.","date":"2016","source":"BMC bioinformatics","url":"https://pubmed.ncbi.nlm.nih.gov/28031031","citation_count":39,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"25286920","id":"PMC_25286920","title":"Deviance residuals-based sparse PLS and sparse kernel PLS regression for censored data.","date":"2014","source":"Bioinformatics (Oxford, England)","url":"https://pubmed.ncbi.nlm.nih.gov/25286920","citation_count":38,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34738744","id":"PMC_34738744","title":"Precise Diabetic Wound Therapy: PLS Nanospheres Eliminate Senescent Cells via DPP4 Targeting and PARP1 Activation.","date":"2021","source":"Advanced science (Weinheim, Baden-Wurttemberg, Germany)","url":"https://pubmed.ncbi.nlm.nih.gov/34738744","citation_count":37,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"21595958","id":"PMC_21595958","title":"Modeling microRNA-mRNA interactions using PLS regression in human colon cancer.","date":"2011","source":"BMC medical genomics","url":"https://pubmed.ncbi.nlm.nih.gov/21595958","citation_count":37,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12112662","id":"PMC_12112662","title":"Biochemical and mutational analyses of the cathepsin c gene (CTSC) in three North American families with Papillon Lefèvre syndrome.","date":"2002","source":"Human mutation","url":"https://pubmed.ncbi.nlm.nih.gov/12112662","citation_count":36,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"24440757","id":"PMC_24440757","title":"All-trans retinoic acid (ATRA) induces miR-23a expression, decreases CTSC expression and granzyme B activity leading to impaired NK cell cytotoxicity.","date":"2014","source":"The international journal of biochemistry & cell biology","url":"https://pubmed.ncbi.nlm.nih.gov/24440757","citation_count":36,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23577016","id":"PMC_23577016","title":"Interaction of PLS and PIN and hormonal crosstalk in Arabidopsis root development.","date":"2013","source":"Frontiers in plant science","url":"https://pubmed.ncbi.nlm.nih.gov/23577016","citation_count":35,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"30605550","id":"PMC_30605550","title":"Comprehensive Prediction of Target RNA Editing Sites for PLS-Class PPR Proteins in Arabidopsis thaliana.","date":"2019","source":"Plant & cell physiology","url":"https://pubmed.ncbi.nlm.nih.gov/30605550","citation_count":33,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12653535","id":"PMC_12653535","title":"The comparative molecular surface analysis (CoMSA) with modified uniformative variable elimination-PLS (UVE-PLS) method: application to the steroids binding the aromatase enzyme.","date":"2003","source":"Journal of chemical information and computer sciences","url":"https://pubmed.ncbi.nlm.nih.gov/12653535","citation_count":33,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"21358586","id":"PMC_21358586","title":"QSAR models for CXCR2 receptor antagonists based on the genetic algorithm for data preprocessing prior to application of the PLS linear regression method and design of the new compounds using in silico virtual screening.","date":"2011","source":"Molecules (Basel, Switzerland)","url":"https://pubmed.ncbi.nlm.nih.gov/21358586","citation_count":32,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"14630917","id":"PMC_14630917","title":"Vacuole membrane topography of the DPP1-encoded diacylglycerol pyrophosphate phosphatase catalytic site from Saccharomyces cerevisiae.","date":"2003","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/14630917","citation_count":31,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"18488986","id":"PMC_18488986","title":"Determination of the secondary structure of proteins in different environments by FTIR-ATR spectroscopy and PLS regression.","date":"2008","source":"Biopolymers","url":"https://pubmed.ncbi.nlm.nih.gov/18488986","citation_count":31,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"35364075","id":"PMC_35364075","title":"Relative contributions of different sources to DOM in Erhai Lake as revealed by PLS-PM.","date":"2022","source":"Chemosphere","url":"https://pubmed.ncbi.nlm.nih.gov/35364075","citation_count":29,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"28081265","id":"PMC_28081265","title":"The Plasmin-Sensitive Protein Pls in Methicillin-Resistant Staphylococcus aureus (MRSA) Is a Glycoprotein.","date":"2017","source":"PLoS pathogens","url":"https://pubmed.ncbi.nlm.nih.gov/28081265","citation_count":28,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23880463","id":"PMC_23880463","title":"The multi-PDZ domain protein-1 (MUPP-1) expression regulates cellular levels of the PALS-1/PATJ polarity complex.","date":"2013","source":"Experimental cell research","url":"https://pubmed.ncbi.nlm.nih.gov/23880463","citation_count":28,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"22459857","id":"PMC_22459857","title":"A comparison of partial least squares (PLS) and sparse PLS regressions in genomic selection in French dairy cattle.","date":"2012","source":"Journal of dairy science","url":"https://pubmed.ncbi.nlm.nih.gov/22459857","citation_count":26,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"10703966","id":"PMC_10703966","title":"Simultaneous determination of paracetamol, caffeine and acetylsalicylic acid by means of a FI ultraviolet pls multioptosensing device.","date":"1999","source":"Journal of pharmaceutical and biomedical analysis","url":"https://pubmed.ncbi.nlm.nih.gov/10703966","citation_count":26,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"38876073","id":"PMC_38876073","title":"Optimizing feature selection with gradient boosting machines in PLS regression for predicting moisture and protein in multi-country corn kernels via NIR spectroscopy.","date":"2024","source":"Food chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/38876073","citation_count":25,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"15845485","id":"PMC_15845485","title":"The surface protein Pls of methicillin-resistant Staphylococcus aureus is a virulence factor in septic arthritis.","date":"2005","source":"Infection and immunity","url":"https://pubmed.ncbi.nlm.nih.gov/15845485","citation_count":25,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"30450510","id":"PMC_30450510","title":"Fast discrimination of bacteria using a filter paper-based SERS platform and PLS-DA with uncertainty estimation.","date":"2018","source":"Analytical and bioanalytical chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/30450510","citation_count":25,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"37463170","id":"PMC_37463170","title":"Multiple pals gene modules control a balance between immunity and development in Caenorhabditis elegans.","date":"2023","source":"PLoS pathogens","url":"https://pubmed.ncbi.nlm.nih.gov/37463170","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12799368","id":"PMC_12799368","title":"Regulation of the yeast DPP1-encoded diacylglycerol pyrophosphate phosphatase by transcription factor Gis1p.","date":"2003","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/12799368","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"18585168","id":"PMC_18585168","title":"Flow-batch technique for the simultaneous enzymatic determination of levodopa and carbidopa in pharmaceuticals using PLS and successive projections algorithm.","date":"2007","source":"Talanta","url":"https://pubmed.ncbi.nlm.nih.gov/18585168","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12385475","id":"PMC_12385475","title":"New molecular surface-based 3D-QSAR method using Kohonen neural network and 3-way PLS.","date":"2002","source":"Computers & chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/12385475","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"28454077","id":"PMC_28454077","title":"Comparison of FTIR-ATR and Raman spectroscopy in determination of VLDL triglycerides in blood serum with PLS regression.","date":"2017","source":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","url":"https://pubmed.ncbi.nlm.nih.gov/28454077","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"14689256","id":"PMC_14689256","title":"A quantitative structure-antifungal activity relationship study of oxygenated aromatic essential oil compounds using data structuring and PLS regression analysis.","date":"2003","source":"Journal of molecular modeling","url":"https://pubmed.ncbi.nlm.nih.gov/14689256","citation_count":24,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"15997377","id":"PMC_15997377","title":"Development of a PLS based method for determination of the quality of beers by use of NIR: spectral ranges and sample-introduction considerations.","date":"2005","source":"Analytical and bioanalytical chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/15997377","citation_count":22,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"16711735","id":"PMC_16711735","title":"Hierarchical PLS modeling for predicting the binding of a comprehensive set of structurally diverse protein-ligand complexes.","date":"2006","source":"Journal of chemical information and modeling","url":"https://pubmed.ncbi.nlm.nih.gov/16711735","citation_count":22,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"16564691","id":"PMC_16564691","title":"The application of a 3D-QSAR (autoMEP/PLS) approach as an efficient pharmacodynamic-driven filtering method for small-sized virtual library: application to a lead optimization of a human A3 adenosine receptor antagonist.","date":"2006","source":"Bioorganic & medicinal chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/16564691","citation_count":22,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"11016943","id":"PMC_11016943","title":"Regulation of the DPP1-encoded diacylglycerol pyrophosphate (DGPP) phosphatase by inositol and growth phase. Inhibition of DGPP phosphatase activity by CDP-diacylglyceron and activation of phosphatidylserine synthase activity by DGPP.","date":"2000","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/11016943","citation_count":21,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"36191002","id":"PMC_36191002","title":"A pals-25 gain-of-function allele triggers systemic resistance against natural pathogens of C. elegans.","date":"2022","source":"PLoS genetics","url":"https://pubmed.ncbi.nlm.nih.gov/36191002","citation_count":21,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"37096170","id":"PMC_37096170","title":"Determination of corn protein content using near-infrared spectroscopy combined with A-CARS-PLS.","date":"2023","source":"Food chemistry: X","url":"https://pubmed.ncbi.nlm.nih.gov/37096170","citation_count":20,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"18831790","id":"PMC_18831790","title":"Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis.","date":"2008","source":"BMC genomics","url":"https://pubmed.ncbi.nlm.nih.gov/18831790","citation_count":20,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"17471438","id":"PMC_17471438","title":"Staphylococcal chromosomal cassette mec type I, spa type, and expression of Pls are determinants of reduced cellular invasiveness of methicillin-resistant Staphylococcus aureus isolates.","date":"2007","source":"The Journal of infectious diseases","url":"https://pubmed.ncbi.nlm.nih.gov/17471438","citation_count":19,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"22750820","id":"PMC_22750820","title":"Preliminary antifungal and cytotoxic evaluation of synthetic cycloalkyl[b]thiophene derivatives with PLS-DA analysis.","date":"2012","source":"Acta pharmaceutica (Zagreb, Croatia)","url":"https://pubmed.ncbi.nlm.nih.gov/22750820","citation_count":19,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"32331211","id":"PMC_32331211","title":"Praeruptorin B Mitigates the Metastatic Ability of Human Renal Carcinoma Cells through Targeting CTSC and CTSV Expression.","date":"2020","source":"International journal of molecular sciences","url":"https://pubmed.ncbi.nlm.nih.gov/32331211","citation_count":17,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"17238252","id":"PMC_17238252","title":"Novel, customizable scoring functions, parameterized using N-PLS, for structure-based drug discovery.","date":"2007","source":"Journal of chemical information and modeling","url":"https://pubmed.ncbi.nlm.nih.gov/17238252","citation_count":17,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"20431145","id":"PMC_20431145","title":"Gene association networks from microarray data using a regularized estimation of partial correlation based on PLS regression.","date":"2010","source":"IEEE/ACM transactions on computational biology and bioinformatics","url":"https://pubmed.ncbi.nlm.nih.gov/20431145","citation_count":17,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"35850283","id":"PMC_35850283","title":"WAL3 encoding a PLS-type PPR protein regulates chloroplast development in rice.","date":"2022","source":"Plant science : an international journal of experimental plant biology","url":"https://pubmed.ncbi.nlm.nih.gov/35850283","citation_count":15,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"19563532","id":"PMC_19563532","title":"Pals-associated tight junction protein functionally links dopamine and angiotensin II to the regulation of sodium transport in renal epithelial cells.","date":"2009","source":"British journal of pharmacology","url":"https://pubmed.ncbi.nlm.nih.gov/19563532","citation_count":15,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"25944695","id":"PMC_25944695","title":"Irreversible dual inhibitory mode: the novel Btk inhibitor PLS-123 demonstrates promising anti-tumor activity in human B-cell lymphoma.","date":"2015","source":"Oncotarget","url":"https://pubmed.ncbi.nlm.nih.gov/25944695","citation_count":14,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"15022635","id":"PMC_15022635","title":"The operons, a criterion to compare the reliability of transcriptome analysis tools: ICA is more reliable than ANOVA, PLS and PCA.","date":"2004","source":"Computational biology and chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/15022635","citation_count":14,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"26807706","id":"PMC_26807706","title":"UPLC-QTOFMS based metabolomics followed by stepwise partial least square-discriminant analysis (PLS-DA) explore the possible relation between the variations in secondary metabolites and the phylogenetic divergences of the genus Panax.","date":"2016","source":"Journal of chromatography. B, Analytical technologies in the biomedical and life sciences","url":"https://pubmed.ncbi.nlm.nih.gov/26807706","citation_count":14,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"30969466","id":"PMC_30969466","title":"Decision Tree-PLS (DT-PLS) algorithm for the development of process: Specific local prediction models.","date":"2019","source":"Biotechnology progress","url":"https://pubmed.ncbi.nlm.nih.gov/30969466","citation_count":13,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"17654340","id":"PMC_17654340","title":"DFT study on the structure-toxicity relationship of dioxin compounds using PLS analysis.","date":"2007","source":"SAR and QSAR in environmental research","url":"https://pubmed.ncbi.nlm.nih.gov/17654340","citation_count":13,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"26205983","id":"PMC_26205983","title":"One mutation, two phenotypes: a single nonsense mutation of the CTSC gene causes two clinically distinct phenotypes.","date":"2015","source":"Clinical and experimental dermatology","url":"https://pubmed.ncbi.nlm.nih.gov/26205983","citation_count":13,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"16538457","id":"PMC_16538457","title":"Parallel factor analysis combined with PLS regression applied to the on-line monitoring of Pichia pastoris cultures.","date":"2006","source":"Analytical and bioanalytical chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/16538457","citation_count":13,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"32140593","id":"PMC_32140593","title":"Modelling the vigour of maize seeds submitted to artificial accelerated ageing based on ATR-FTIR data and chemometric tools (PCA, HCA and PLS-DA).","date":"2020","source":"Heliyon","url":"https://pubmed.ncbi.nlm.nih.gov/32140593","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"33985392","id":"PMC_33985392","title":"Phospholipids (PLs) know-how: exploring and exploiting phospholipase D for its industrial dissemination.","date":"2021","source":"Critical reviews in biotechnology","url":"https://pubmed.ncbi.nlm.nih.gov/33985392","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34366940","id":"PMC_34366940","title":"The Cognitive Model of Negative Symptoms in Schizophrenia: A Hierarchical Component Model With PLS-SEM.","date":"2021","source":"Frontiers in psychiatry","url":"https://pubmed.ncbi.nlm.nih.gov/34366940","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"31155560","id":"PMC_31155560","title":"A 3D-QSAR Analysis of CDK2 Inhibitors Using FMO Calculations and PLS Regression.","date":"2019","source":"Chemical & pharmaceutical bulletin","url":"https://pubmed.ncbi.nlm.nih.gov/31155560","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"25226219","id":"PMC_25226219","title":"Aconitum and delphinium diterpenoid alkaloids of local anesthetic activity: comparative QSAR analysis based on GA-MLRA/PLS and optimal descriptors approach.","date":"2014","source":"Journal of environmental science and health. Part C, Environmental carcinogenesis & ecotoxicology reviews","url":"https://pubmed.ncbi.nlm.nih.gov/25226219","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23870404","id":"PMC_23870404","title":"Simultaneous measurement of two enzyme activities using infrared spectroscopy: A comparative evaluation of PARAFAC, TUCKER and N-PLS modeling.","date":"2013","source":"Analytica chimica acta","url":"https://pubmed.ncbi.nlm.nih.gov/23870404","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"24257141","id":"PMC_24257141","title":"Complementary PLS and KNN algorithms for improved 3D-QSDAR consensus modeling of AhR binding.","date":"2013","source":"Journal of cheminformatics","url":"https://pubmed.ncbi.nlm.nih.gov/24257141","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34002606","id":"PMC_34002606","title":"Sparse PLS-Based Method for Overlapping Metabolite Set Enrichment Analysis.","date":"2021","source":"Journal of proteome research","url":"https://pubmed.ncbi.nlm.nih.gov/34002606","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"25591104","id":"PMC_25591104","title":"Robust Ultraviolet-Visible (UV-Vis) Partial Least-Squares (PLS) Models for Tannin Quantification in Red Wine.","date":"2015","source":"Journal of agricultural and food chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/25591104","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"18082646","id":"PMC_18082646","title":"Potential of front face fluorescence associated to PLS regression to predict nutritional parameters in heat treated infant formula models.","date":"2007","source":"Analytica chimica acta","url":"https://pubmed.ncbi.nlm.nih.gov/18082646","citation_count":12,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"21088147","id":"PMC_21088147","title":"Development and validation of a simple cell-based fluorescence assay for dipeptidyl peptidase 1 (DPP1) activity.","date":"2010","source":"Journal of biomolecular screening","url":"https://pubmed.ncbi.nlm.nih.gov/21088147","citation_count":11,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23397598","id":"PMC_23397598","title":"A novel seven-base deletion of the CTSC gene identified in a Hungarian family with Papillon-Lefèvre syndrome.","date":"2013","source":"Archives of dermatological research","url":"https://pubmed.ncbi.nlm.nih.gov/23397598","citation_count":11,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34839038","id":"PMC_34839038","title":"Liver proteome profiling in dairy cows during the transition from gestation to lactation: Effects of supplementation with essential fatty acids and conjugated linoleic acids as explored by PLS-DA.","date":"2021","source":"Journal of proteomics","url":"https://pubmed.ncbi.nlm.nih.gov/34839038","citation_count":11,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12927112","id":"PMC_12927112","title":"Multi-way PLS modeling of structure-activity data by incorporating electrostatic and lipophilic potentials on molecular surface.","date":"2003","source":"Computational biology and chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/12927112","citation_count":11,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23452493","id":"PMC_23452493","title":"A nonlinearized multivariate dominant factor-based partial least squares (PLS) model for coal analysis by using laser-induced breakdown spectroscopy.","date":"2013","source":"Applied spectroscopy","url":"https://pubmed.ncbi.nlm.nih.gov/23452493","citation_count":11,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"35952999","id":"PMC_35952999","title":"Cathepsin C (CTSC) contributes to the antibacterial immunity in golden pompano (Trachinotus ovatus).","date":"2022","source":"Fish & shellfish immunology","url":"https://pubmed.ncbi.nlm.nih.gov/35952999","citation_count":10,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"24236725","id":"PMC_24236725","title":"Docking and PLS studies on a set of thiophenes RNA polymerase inhibitors against Staphylococcus aureus.","date":"2014","source":"Current topics in medicinal chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/24236725","citation_count":10,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"31068678","id":"PMC_31068678","title":"Exome sequencing identifies a novel missense variant in CTSC causing nonsyndromic aggressive periodontitis.","date":"2019","source":"Journal of human genetics","url":"https://pubmed.ncbi.nlm.nih.gov/31068678","citation_count":10,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"22066983","id":"PMC_22066983","title":"PLS-DA - Docking Optimized Combined Energetic Terms (PLSDA-DOCET) protocol: a brief evaluation.","date":"2011","source":"Journal of chemical information and modeling","url":"https://pubmed.ncbi.nlm.nih.gov/22066983","citation_count":10,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"34932608","id":"PMC_34932608","title":"A rare CTSC mutation in Papillon-Lefèvre Syndrome results in abolished serine protease activity and reduced NET formation but otherwise normal neutrophil function.","date":"2021","source":"PloS one","url":"https://pubmed.ncbi.nlm.nih.gov/34932608","citation_count":9,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"36702253","id":"PMC_36702253","title":"Staphylococcal Periscope proteins Aap, SasG, and Pls project noncanonical legume-like lectin adhesin domains from the bacterial surface.","date":"2023","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/36702253","citation_count":9,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"16883848","id":"PMC_16883848","title":"[The PLS calibration model optimization and determination of rice protein content by near-infrared reflectance spectroscopy].","date":"2006","source":"Guang pu xue yu guang pu fen xi = Guang pu","url":"https://pubmed.ncbi.nlm.nih.gov/16883848","citation_count":9,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"17307386","id":"PMC_17307386","title":"A PLS-based extractive spectrophotometric method for simultaneous determination of carbamazepine and carbamazepine-10,11-epoxide in plasma and comparison with HPLC.","date":"2007","source":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","url":"https://pubmed.ncbi.nlm.nih.gov/17307386","citation_count":9,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"15349970","id":"PMC_15349970","title":"Retrospective statistical analysis of lyophilized protein formulations of progenipoietin using PLS: determination of the critical parameters for long-term storage stability.","date":"2004","source":"Journal of pharmaceutical sciences","url":"https://pubmed.ncbi.nlm.nih.gov/15349970","citation_count":9,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"39414666","id":"PMC_39414666","title":"Biomarkers in cardiogenic shock: old pals, new friends.","date":"2024","source":"Annals of intensive care","url":"https://pubmed.ncbi.nlm.nih.gov/39414666","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"36119701","id":"PMC_36119701","title":"Poor sleep quality is associated with fatigue and depression in early Parkinson's disease: A longitudinal study in the PALS cohort.","date":"2022","source":"Frontiers in neurology","url":"https://pubmed.ncbi.nlm.nih.gov/36119701","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"36476235","id":"PMC_36476235","title":"Growth, physiological, and temperature characteristics in chinese cabbage pakchoi as affected by Cd- stressed conditions and identifying its main controlling factors using PLS model.","date":"2022","source":"BMC plant biology","url":"https://pubmed.ncbi.nlm.nih.gov/36476235","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"17943190","id":"PMC_17943190","title":"Detection of an intragenic deletion expands the spectrum of CTSC mutations in Papillon-Lefèvre syndrome.","date":"2007","source":"The Journal of investigative dermatology","url":"https://pubmed.ncbi.nlm.nih.gov/17943190","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"38353455","id":"PMC_38353455","title":"The Prostate Cancer Active Lifestyle Study (PALS): A randomized controlled trial of diet and exercise in overweight and obese men on active surveillance.","date":"2024","source":"Cancer","url":"https://pubmed.ncbi.nlm.nih.gov/38353455","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"23216988","id":"PMC_23216988","title":"Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression.","date":"2012","source":"BMC bioinformatics","url":"https://pubmed.ncbi.nlm.nih.gov/23216988","citation_count":8,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"38951798","id":"PMC_38951798","title":"Rare variant analyses validate known ALS genes in a multi-ethnic population and identifies ANTXR2 as a candidate in PLS.","date":"2024","source":"BMC genomics","url":"https://pubmed.ncbi.nlm.nih.gov/38951798","citation_count":7,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"18467422","id":"PMC_18467422","title":"PaLS: filtering common literature, biological terms and pathway information.","date":"2008","source":"Nucleic acids research","url":"https://pubmed.ncbi.nlm.nih.gov/18467422","citation_count":7,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"39383651","id":"PMC_39383651","title":"Identification of novel indolinone derivatives as CTSC inhibitors to treat inflammatory bowel disease by modulating inflammatory factors.","date":"2024","source":"European journal of medicinal chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/39383651","citation_count":7,"is_preprint":false,"source_track":"pubmed_title"},{"pmid":"12477932","id":"PMC_12477932","title":"Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences.","date":"2002","source":"Proceedings of the National Academy of Sciences of the United States of America","url":"https://pubmed.ncbi.nlm.nih.gov/12477932","citation_count":1479,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"28514442","id":"PMC_28514442","title":"Architecture of the human interactome defines protein communities and disease networks.","date":"2017","source":"Nature","url":"https://pubmed.ncbi.nlm.nih.gov/28514442","citation_count":1085,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"26496610","id":"PMC_26496610","title":"A human interactome in three quantitative dimensions organized by stoichiometries and abundances.","date":"2015","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/26496610","citation_count":1015,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"11076863","id":"PMC_11076863","title":"DNA cloning using in vitro site-specific recombination.","date":"2000","source":"Genome research","url":"https://pubmed.ncbi.nlm.nih.gov/11076863","citation_count":815,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"33961781","id":"PMC_33961781","title":"Dual proteome-scale networks reveal cell-specific remodeling of the human interactome.","date":"2021","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/33961781","citation_count":705,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"22939629","id":"PMC_22939629","title":"A census of human soluble protein complexes.","date":"2012","source":"Cell","url":"https://pubmed.ncbi.nlm.nih.gov/22939629","citation_count":689,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"21873635","id":"PMC_21873635","title":"Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.","date":"2011","source":"Briefings in bioinformatics","url":"https://pubmed.ncbi.nlm.nih.gov/21873635","citation_count":656,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"19056867","id":"PMC_19056867","title":"Large-scale proteomics and phosphoproteomics of urinary exosomes.","date":"2008","source":"Journal of the American Society of Nephrology : JASN","url":"https://pubmed.ncbi.nlm.nih.gov/19056867","citation_count":607,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"33450198","id":"PMC_33450198","title":"Cathepsin C promotes breast cancer lung metastasis by modulating neutrophil infiltration and neutrophil extracellular trap formation.","date":"2021","source":"Cancer cell","url":"https://pubmed.ncbi.nlm.nih.gov/33450198","citation_count":508,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"8125298","id":"PMC_8125298","title":"Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides.","date":"1994","source":"Gene","url":"https://pubmed.ncbi.nlm.nih.gov/8125298","citation_count":492,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"10581027","id":"PMC_10581027","title":"Loss-of-function mutations in the cathepsin C gene result in periodontal disease and palmoplantar keratosis.","date":"1999","source":"Nature genetics","url":"https://pubmed.ncbi.nlm.nih.gov/10581027","citation_count":365,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"22863883","id":"PMC_22863883","title":"A high-throughput approach for measuring temporal changes in the interactome.","date":"2012","source":"Nature methods","url":"https://pubmed.ncbi.nlm.nih.gov/22863883","citation_count":273,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"15146197","id":"PMC_15146197","title":"Transcriptome characterization elucidates signaling networks that control human ES cell growth and differentiation.","date":"2004","source":"Nature biotechnology","url":"https://pubmed.ncbi.nlm.nih.gov/15146197","citation_count":266,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"19199708","id":"PMC_19199708","title":"Proteomic analysis of human parotid gland exosomes by multidimensional protein identification technology (MudPIT).","date":"2009","source":"Journal of proteome research","url":"https://pubmed.ncbi.nlm.nih.gov/19199708","citation_count":237,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"11726493","id":"PMC_11726493","title":"Structure of human dipeptidyl peptidase I (cathepsin C): exclusion domain added to an endopeptidase framework creates the machine for activation of granular serine proteases.","date":"2001","source":"The EMBO journal","url":"https://pubmed.ncbi.nlm.nih.gov/11726493","citation_count":217,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"10593994","id":"PMC_10593994","title":"Mutations of the cathepsin C gene are responsible for Papillon-Lefèvre syndrome.","date":"1999","source":"Journal of medical genetics","url":"https://pubmed.ncbi.nlm.nih.gov/10593994","citation_count":213,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"8428921","id":"PMC_8428921","title":"Generation of active myeloid and lymphoid granule serine proteases requires processing by the granule thiol protease dipeptidyl peptidase I.","date":"1993","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/8428921","citation_count":180,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"15585850","id":"PMC_15585850","title":"Papillon-Lefèvre syndrome: correlating the molecular, cellular, and clinical consequences of cathepsin C/dipeptidyl peptidase I deficiency in humans.","date":"2004","source":"Journal of immunology (Baltimore, Md. : 1950)","url":"https://pubmed.ncbi.nlm.nih.gov/15585850","citation_count":164,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"11230166","id":"PMC_11230166","title":"Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs.","date":"2001","source":"Genome research","url":"https://pubmed.ncbi.nlm.nih.gov/11230166","citation_count":151,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"10662807","id":"PMC_10662807","title":"Haim-Munk syndrome and Papillon-Lefèvre syndrome are allelic mutations in cathepsin C.","date":"2000","source":"Journal of medical genetics","url":"https://pubmed.ncbi.nlm.nih.gov/10662807","citation_count":150,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"30397336","id":"PMC_30397336","title":"Identification of phagocytosis regulators using magnetic genome-wide CRISPR screens.","date":"2018","source":"Nature genetics","url":"https://pubmed.ncbi.nlm.nih.gov/30397336","citation_count":146,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"11327826","id":"PMC_11327826","title":"Human recombinant pro-dipeptidyl peptidase I (cathepsin C) can be activated by cathepsins L and S but not by autocatalytic processing.","date":"2001","source":"Biochemistry","url":"https://pubmed.ncbi.nlm.nih.gov/11327826","citation_count":142,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"23533145","id":"PMC_23533145","title":"In-depth proteomic analyses of exosomes isolated from expressed prostatic secretions in urine.","date":"2013","source":"Proteomics","url":"https://pubmed.ncbi.nlm.nih.gov/23533145","citation_count":138,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"9092576","id":"PMC_9092576","title":"Human dipeptidyl-peptidase I. Gene characterization, localization, and expression.","date":"1997","source":"The Journal of biological chemistry","url":"https://pubmed.ncbi.nlm.nih.gov/9092576","citation_count":131,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"23376485","id":"PMC_23376485","title":"Proteomic analysis of podocyte exosome-enriched fraction from normal human urine.","date":"2013","source":"Journal of proteomics","url":"https://pubmed.ncbi.nlm.nih.gov/23376485","citation_count":126,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"26631737","id":"PMC_26631737","title":"Genome-wide Association Studies Identify Genetic Loci Associated With Albuminuria in Diabetes.","date":"2015","source":"Diabetes","url":"https://pubmed.ncbi.nlm.nih.gov/26631737","citation_count":125,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"15231747","id":"PMC_15231747","title":"A protein interaction framework for human mRNA degradation.","date":"2004","source":"Genome research","url":"https://pubmed.ncbi.nlm.nih.gov/15231747","citation_count":123,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"10756096","id":"PMC_10756096","title":"Statistical analysis of the 5' untranslated region of human mRNA using \"Oligo-Capped\" cDNA libraries.","date":"2000","source":"Genomics","url":"https://pubmed.ncbi.nlm.nih.gov/10756096","citation_count":121,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"22082156","id":"PMC_22082156","title":"A whole genome screen for HIV restriction factors.","date":"2011","source":"Retrovirology","url":"https://pubmed.ncbi.nlm.nih.gov/22082156","citation_count":111,"is_preprint":false,"source_track":"gene2pubmed"},{"pmid":"33545068","id":"PMC_33545068","title":"Systematically defining selective autophagy receptor-specific cargo using autophagosome content profiling.","date":"2021","source":"Molecular cell","url":"https://pubmed.ncbi.nlm.nih.gov/33545068","citation_count":105,"is_preprint":false,"source_track":"gene2pubmed"}],"cost":{"stage1":{"model":"claude-sonnet-4-6","input_tokens":46588,"output_tokens":2542,"usd":0.088947},"stage2":{"model":"claude-opus-4-6","input_tokens":5894,"output_tokens":2533,"usd":0.139192},"total_usd":0.517071,"stage1_batch_id":"msgbatch_011kfMfE81o6CTcyhVUqumuW","stage2_batch_id":"msgbatch_01DTputS6WNkH745x1nVSoBo","note":"batch pricing = 50% of standard","round2_stage1":{"model":"claude-sonnet-4-6","input_tokens":59246,"output_tokens":5297,"usd":0.128597},"round2_rules_fired":"R3","round2_stage2":{"model":"claude-opus-4-6","input_tokens":8163,"output_tokens":2643,"usd":0.160335}},"stage1_raw":"```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2020,\n      \"finding\": \"CTSC (DPP-1/cathepsin C) is responsible for the activation of neutrophil serine proteases (including neutrophil elastase); pharmacological inhibition of CTSC with brensocatib reduces sputum neutrophil elastase activity in bronchiectasis patients.\",\n      \"method\": \"Randomized controlled clinical trial with direct measurement of sputum neutrophil elastase activity\",\n      \"journal\": \"The New England journal of medicine\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — clinical intervention with direct biochemical readout (NSP activity), replicated across two dose arms\",\n      \"pmids\": [\"32897034\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2002,\n      \"finding\": \"Mutations in the CTSC gene that cause Papillon-Lefèvre syndrome result in dramatically reduced or abolished CTSC protease enzyme activity, as measured in patient leukocytes; specific missense and nonsense mutations (e.g., p.Y32X, p.R272P, p.G139R, frameshift p.L381fsX393) were identified as causative.\",\n      \"method\": \"Biochemical enzyme activity assay in patient leukocytes; mutational analysis by sequencing\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — direct enzyme activity measurement in primary patient cells with genotype-phenotype correlation across multiple families\",\n      \"pmids\": [\"12112662\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2014,\n      \"finding\": \"CTSC mRNA is negatively regulated by miR-23a; activation of NK cells downregulates miR-23a, leading to upregulation of CTSC expression and increased granzyme B activity, thereby enhancing NK cell cytotoxicity. ATRA induces miR-23a, reduces CTSC expression and granzyme B activity, and impairs NK cell cytotoxicity in an in vivo mouse model.\",\n      \"method\": \"Integrated microRNA/mRNA expression profiling, functional validation with ATRA treatment in vitro and in vivo mouse model\",\n      \"journal\": \"The international journal of biochemistry & cell biology\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — multiple orthogonal methods (expression profiling, functional ATRA experiments, in vivo model) but single lab\",\n      \"pmids\": [\"24440757\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2021,\n      \"finding\": \"In Papillon-Lefèvre syndrome neutrophils with loss-of-function CTSC mutations (503A>G substitution causing Y168C), CTSC protein is absent and CTSC and neutrophil serine protease (NSP: elastase, proteinase 3, cathepsin G) activities are abolished; NET formation upon PMA-stimulation is severely reduced, while other neutrophil functions (counts, morphology, chemotaxis, radical production, apoptosis) are largely unaffected.\",\n      \"method\": \"Functional characterization of patient neutrophils: enzyme activity assays, NET formation assay, chemotaxis, oxidative burst, apoptosis assays\",\n      \"journal\": \"PloS one\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multiple orthogonal functional assays in primary patient cells with defined genetic lesion\",\n      \"pmids\": [\"34932608\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2014,\n      \"finding\": \"Mutations across CTSC exons 5-7 (encoding the heavy chain) are enriched in PLS patients, and tetramerization domain mutations are associated with loss of CTSC enzymatic activity, suggesting tetramerization is important for CTSC catalytic function.\",\n      \"method\": \"Mutational analysis and review of published CTSC mutations with enzymatic activity correlations\",\n      \"journal\": \"Molecular genetics & genomic medicine\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 3 — mutational review with functional correlation across multiple published cases\",\n      \"pmids\": [\"24936511\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2007,\n      \"finding\": \"CTSC activity is undetectable in Papillon-Lefèvre syndrome patients carrying a novel intragenic deletion spanning exons 3-7 or a homozygous splice site mutation, demonstrating that these structural alterations abolish CTSC protease function.\",\n      \"method\": \"CTSC enzyme activity assay in patient samples; gene dosage analysis for intragenic deletion detection\",\n      \"journal\": \"The Journal of investigative dermatology\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — direct enzyme activity measurement correlated with novel mutation types\",\n      \"pmids\": [\"17943190\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2010,\n      \"finding\": \"CTSC (DPP1) enzymatic activity can be measured in living cells (THP-1 cells) using the fluorogenic substrate H-Gly-Phe-AFC, which accesses the lysosomal compartment; this established CTSC as a lysosomal cysteinyl protease with dipeptidyl aminopeptidase activity measurable in situ.\",\n      \"method\": \"Cell-based fluorescence assay with substrate H-Gly-Phe-AFC in intact cells; comparison with isolated enzyme assays\",\n      \"journal\": \"Journal of biomolecular screening\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — direct enzymatic measurement in intact cells, establishes lysosomal localization of active enzyme\",\n      \"pmids\": [\"21088147\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2022,\n      \"finding\": \"CTSC (TroCTSC) in fish (Trachinotus ovatus) has conserved catalytic active sites (Cys251, His397, Asn419); mutagenesis of each abolished proteolytic activity in vitro. Recombinant CTSC optimal activity is at 40°C, pH 5.5; it is promoted by Zn2+ and Ca2+ but inhibited by Fe2+ and Cu2+. After bacterial infection, CTSC translocates from cytoplasm to nucleus in cell culture.\",\n      \"method\": \"In vitro enzyme activity assay of recombinant protein; site-directed mutagenesis of active-site residues; subcellular localization by immunofluorescence\",\n      \"journal\": \"Fish & shellfish immunology\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 1 — in vitro reconstitution with mutagenesis of conserved catalytic residues; ortholog (teleost fish) consistent with mammalian CTSC\",\n      \"pmids\": [\"35952999\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"CTSC expression promotes RCC cell migration and invasion; its expression is downstream of the EGFR-MEK-ERK signaling pathway, and Praeruptorin B suppresses CTSC and CTSV mRNA and protein levels by inhibiting EGFR-MEK-ERK, thereby reducing cell migration and invasion.\",\n      \"method\": \"Cell migration/invasion assays; Western blot and RT-PCR for pathway components; pharmacological inhibition\",\n      \"journal\": \"International journal of molecular sciences\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 3 — single lab, pharmacological (indirect) evidence linking EGFR-MEK-ERK to CTSC expression without direct genetic epistasis\",\n      \"pmids\": [\"32331211\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2024,\n      \"finding\": \"A novel CTSC inhibitor (compound B22) inhibits CTSC activity by binding to the S2 pocket and S1 site of the enzyme, further suppressing downstream serine protease activity and reducing pro-inflammatory cytokine levels in an IBD model.\",\n      \"method\": \"In vitro CTSC enzyme activity assay; molecular docking/binding site characterization; in vivo IBD mouse model\",\n      \"journal\": \"European journal of medicinal chemistry\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 1-2 — in vitro enzyme assay with binding site characterization and in vivo functional validation\",\n      \"pmids\": [\"39383651\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2019,\n      \"finding\": \"A novel homozygous missense variant in CTSC exon 2 (c.G302C, p.Trp101Ser) causes aggressive periodontitis (AP1) without hyperkeratosis, establishing that CTSC loss-of-function can present as isolated periodontitis distinct from full PLS phenotype.\",\n      \"method\": \"Exome sequencing; co-segregation in consanguineous pedigree; in silico structural prediction\",\n      \"journal\": \"Journal of human genetics\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 3 — genetic identification with computational structural prediction but no direct enzyme activity measurement\",\n      \"pmids\": [\"31068678\"],\n      \"is_preprint\": false\n    }\n  ],\n  \"current_model\": \"CTSC (cathepsin C/DPP-1) is a lysosomal cysteine protease with dipeptidyl aminopeptidase activity that requires conserved catalytic residues (Cys, His, Asn) and tetramerization of its heavy chain for enzymatic function; its principal established role is the proteolytic activation of neutrophil serine proteases (elastase, proteinase 3, cathepsin G) and granzymes in immune cells, and loss-of-function CTSC mutations abolish NSP activity and impair NET formation, causing Papillon-Lefèvre syndrome, while its expression is post-transcriptionally regulated by miR-23a and transcriptionally by the EGFR-MEK-ERK pathway.\"\n}\n```\n\n--- ROUND 2 (R2/R3, augmented corpus) ---\n\nI'll systematically classify each paper as KEEP (about canonical CTSC/cathepsin C protein) or EXCLUDE (alias collision, alt-locus, or symbol collision), then extract mechanistic discoveries.\n\n**KEEP papers (about canonical CTSC/cathepsin C):**\n- PMID:32897034 — brensocatib inhibits DPP-1 (cathepsin C) to reduce neutrophil serine protease activation\n- PMID:24936511 — CTSC mutations in PLS, tetramerization important for activity\n- PMID:12112662 — CTSC mutations cause PLS, reduced enzyme activity\n- PMID:24440757 — miR-23a negatively regulates CTSC mRNA; CTSC activates granzymes\n- PMID:10581027 — loss-of-function CTSC mutations cause PLS\n- PMID:10593994 — CTSC mutations cause PLS\n- PMID:11726493 — crystal structure of cathepsin C (DPPI); exclusion domain mechanism\n- PMID:9092576 — gene characterization, expression, chromosomal localization\n- PMID:8428921 — DPPI (cathepsin C) required for activation of granule serine proteases\n- PMID:15585850 — PLS patients: CTSC deficiency abolishes neutrophil serine proteases but granzymes partially retained\n- PMID:10662807 — CTSC mutations cause Haim-Munk syndrome (allelic with PLS)\n- PMID:11327826 — pro-DPPI activation by cathepsins L and S; oligomeric structure; inhibition by cystatins\n- PMID:33450198 — CTSC promotes breast cancer lung metastasis via PR3-IL-1β-neutrophil/NET axis\n- PMID:35952999 — fish CTSC (TroCTSC): catalytic residues Cys251/His397/Asn419; lysosomal localization; antibacterial immunity\n- PMID:26205983 — same CTSC nonsense mutation can cause PLS or HMS phenotypes\n- PMID:34932608 — rare CTSC mutation abolishes serine protease activity; NET formation reduced\n- PMID:17943190 — intragenic deletion of CTSC causes PLS; CTSC activity undetectable\n- PMID:31068678 — novel CTSC missense variant causes aggressive periodontitis\n- PMID:21088147 — cell-based fluorescence assay for DPP1 (cathepsin C) activity\n- PMID:39383651 — novel CTSC inhibitor B22 binds S2 pocket and S1 site\n- PMID:33961781 — BioPlex 3.0 (interactome data, CTSC listed)\n- PMID:28514442 — BioPlex 2.0 interactome\n- PMID:23397598 — novel CTSC deletion mutation causes PLS\n- PMID:56490 — PMID:32331211 — Pra-B reduces CTSC/CTSV via EGFR-MEK-ERK pathway in RCC\n\n**EXCLUDE papers:** All papers about PLS (partial least squares statistics), PLS (primary lateral sclerosis), Pals (C. elegans proteins), PALS (splenic anatomy), DPP1 (yeast diacylglycerol pyrophosphate phosphatase — different gene), Pls (S. aureus protein), PPR proteins in plants, and other unrelated topics.\n\n```json\n{\n  \"discoveries\": [\n    {\n      \"year\": 2001,\n      \"finding\": \"Crystal structure of human dipeptidyl peptidase I (cathepsin C/CTSC) revealed that an exclusion domain transforms the papain-like endopeptidase framework into a tetrameric enzyme. The four active sites are exposed to solvent in a tetrahedral arrangement, the exclusion domain blocks access of polypeptide chains except at their termini, and Asp1 positions the N-terminal amino group of the substrate, explaining the strict dipeptidyl aminopeptidase specificity. Missense mutations causing Haim-Munk and Papillon-Lefèvre syndromes were mapped to positions that disrupt the fold.\",\n      \"method\": \"X-ray crystallography with functional/structural interpretation; mapping of disease mutations\",\n      \"journal\": \"The EMBO journal\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — crystal structure with mechanistic interpretation, disease mutation mapping; seminal single paper with multiple orthogonal insights\",\n      \"pmids\": [\"11726493\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 1993,\n      \"finding\": \"CTSC (dipeptidyl peptidase I, DPPI) is required for post-translational activation of granule serine proteases in immune cells. Inhibition of DPPI in CD8+ T cells, LAK cells, mast cells, and myeloid cells impaired generation of cathepsin G, granzyme, and other serine protease activities. In U-937 cells, DPPI inhibition caused accumulation of the pro-enzyme form of cathepsin G bearing its N-terminal dipeptide extension, demonstrating that DPPI cleaves the two-residue activation peptide.\",\n      \"method\": \"Pharmacological inhibition of DPPI in cell lines, pulse-chase radiolabeling, immunoblotting for pro- vs. active enzyme forms\",\n      \"journal\": \"The Journal of biological chemistry\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multiple cell types, direct biochemical demonstration of pro-enzyme accumulation upon DPPI inhibition; foundational mechanistic paper\",\n      \"pmids\": [\"8428921\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 1999,\n      \"finding\": \"Loss-of-function mutations in CTSC (cathepsin C gene at 11q14) cause Papillon-Lefèvre syndrome (autosomal recessive palmoplantar keratosis and periodontitis). Functional assays in patient leukocytes demonstrated near-total loss of cathepsin C enzymatic activity; obligate carriers showed reduced activity, confirming the causal relationship.\",\n      \"method\": \"Homozygosity mapping, genomic sequencing, functional enzyme activity assay in patient leukocytes\",\n      \"journal\": \"Nature genetics\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1-2 — genetic mapping combined with direct enzymatic activity measurement in patient cells; independently confirmed in the same year by Hart et al.\",\n      \"pmids\": [\"10581027\", \"10593994\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2001,\n      \"finding\": \"Human pro-DPPI (pro-cathepsin C) expressed in insect cells is a dimer incapable of autoactivation. Cathepsin L efficiently activates pro-DPPI at pH 4.5 via two cleavage pathways: (1) initial cleavage within the pro-region followed by removal of the activation peptide and then separation into heavy/light chains, or (2) separation of the pro-region from the catalytic domain first. Cathepsin S is a less efficient activator; cathepsin B and DPPI itself cannot activate the proenzyme. Cystatin C and stefins A and B inhibit active DPPI with Ki values of 0.5–1.1 nM.\",\n      \"method\": \"Baculovirus expression, affinity purification of active and precursor forms, in vitro activation assays, kinetic inhibition measurements, CD spectroscopy, glycosylation analysis\",\n      \"journal\": \"Biochemistry\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1 — reconstituted in vitro activation system with multiple protease combinations, kinetic characterization; rigorous biochemical study\",\n      \"pmids\": [\"11327826\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2000,\n      \"finding\": \"Haim-Munk syndrome (HMS) and Papillon-Lefèvre syndrome (PLS) are allelic disorders caused by mutations in CTSC. A mutation in exon 6 of CTSC (2127A→G, changing a conserved amino acid) segregates with HMS in four nuclear families from the Cochin isolate. A mutation at the same codon (2126C→T) causes classical PLS in a Turkish family, demonstrating that different mutations at the same residue cause phenotypically distinct syndromes.\",\n      \"method\": \"Sequencing of CTSC in affected families, haplotype analysis, co-segregation analysis\",\n      \"journal\": \"Journal of medical genetics\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — multiple families, allelic mapping; confirms CTSC as causal gene for both syndromes\",\n      \"pmids\": [\"10662807\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2004,\n      \"finding\": \"In humans with Papillon-Lefèvre syndrome (CTSC/DPPI loss-of-function), neutrophil-derived serine proteases (cathepsin G, neutrophil elastase) are severely reduced in activity and stability, confirming DPPI's essential role in their activation. Surprisingly, granzyme activities in cytotoxic lymphocytes (LAK cells) are retained at significant levels, and LAK-mediated cytotoxicity against K562 is normal. Neutrophils from PLS patients do not uniformly fail to kill S. aureus and E. coli, indicating that serine proteases are not the principal bactericidal mechanism.\",\n      \"method\": \"Enzymatic activity assays in primary patient cells (neutrophils, LAK cells), bactericidal assays, immunological functional assays\",\n      \"journal\": \"Journal of immunology\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — primary patient cells, multiple orthogonal functional assays; defines the human cell-type specificity of CTSC-dependent serine protease activation\",\n      \"pmids\": [\"15585850\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 1997,\n      \"finding\": \"The human CTSC (dipeptidyl-peptidase I) gene spans ~3.5 kb and consists of two exons and one intron, a genomic organization distinct from other papain-type cysteine proteinases. By FISH, the gene maps to chromosomal region 11q14.1–q14.3. Northern analysis shows highest mRNA expression in lung, kidney, and placenta; high levels in polymorphonuclear leukocytes and alveolar macrophages. IL-2 treatment of lymphocytes significantly increases DPPI mRNA levels, indicating transcriptional regulation.\",\n      \"method\": \"Genomic cloning, FISH chromosomal mapping, Northern blot analysis, IL-2 stimulation experiment\",\n      \"journal\": \"The Journal of biological chemistry\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — direct experimental characterization of gene structure, chromosomal location, and regulated expression; foundational molecular genetics paper\",\n      \"pmids\": [\"9092576\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2014,\n      \"finding\": \"CTSC mRNA is negatively regulated by miR-23a. In NK cells, activation down-regulates miR-23a, which de-represses CTSC expression, leading to increased cathepsin C protein and granzyme B activity required for cytotoxicity. Treatment with all-trans retinoic acid (ATRA) induces miR-23a expression, decreases CTSC mRNA and protein levels, reduces granzyme B activity, and impairs NK cell cytotoxicity in a mouse model.\",\n      \"method\": \"miRNA/mRNA expression profiling, functional validation with ATRA treatment, in vivo mouse cytotoxicity model, granzyme B activity assay\",\n      \"journal\": \"The international journal of biochemistry & cell biology\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2-3 — miR-23a/CTSC regulatory link validated in NK cells and in vivo, but mechanism of miRNA targeting not fully reconstituted\",\n      \"pmids\": [\"24440757\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2021,\n      \"finding\": \"Tumor-secreted CTSC promotes breast cancer lung metastasis by enzymatically activating neutrophil membrane-bound proteinase 3 (PR3). Activated PR3 processes IL-1β, which activates NF-κB, upregulating IL-6 and CCL3 for neutrophil recruitment. The CTSC–PR3–IL-1β axis also induces neutrophil ROS production and NET formation; NETs degrade thrombospondin-1 and support metastatic colonization. Pharmacological targeting of CTSC with compound AZD7986 suppresses lung metastasis in a mouse model.\",\n      \"method\": \"In vitro enzymatic activation assays, co-culture experiments, mouse metastasis models (genetic and pharmacological CTSC inhibition), NET quantification, thrombospondin-1 degradation assays, human tumor correlations\",\n      \"journal\": \"Cancer cell\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 1-2 — multiple orthogonal methods including enzymatic reconstitution, genetic knockdown, pharmacological inhibition, and in vivo models; independently consistent results\",\n      \"pmids\": [\"33450198\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"In a phase 2 clinical trial, brensocatib (INS1007), an oral reversible inhibitor of DPP-1 (cathepsin C/CTSC), reduced sputum neutrophil elastase activity in patients with bronchiectasis and prolonged time to first exacerbation, demonstrating that pharmacological inhibition of CTSC reduces downstream neutrophil serine protease activity in vivo in humans.\",\n      \"method\": \"Randomized double-blind placebo-controlled phase 2 trial; sputum neutrophil elastase activity measured as pharmacodynamic endpoint\",\n      \"journal\": \"The New England journal of medicine\",\n      \"confidence\": \"High\",\n      \"confidence_rationale\": \"Tier 2 — large randomized controlled trial directly measuring CTSC-dependent neutrophil elastase activity as pharmacodynamic readout in humans\",\n      \"pmids\": [\"32897034\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2002,\n      \"finding\": \"CTSC mutations in North American PLS families (including novel p.G139R) are associated with dramatically reduced CTSC protease enzyme activity in patient leukocytes, with almost no detectable activity. Biochemical analysis confirmed that mutations altering restriction enzyme sites in conserved regions of CTSC abolish enzymatic function.\",\n      \"method\": \"CTSC gene sequencing, restriction enzyme analysis, direct enzymatic activity measurement in patient leukocytes\",\n      \"journal\": \"Human mutation\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — direct functional assay in patient cells confirming loss of enzymatic activity for specific mutations\",\n      \"pmids\": [\"12112662\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2014,\n      \"finding\": \"Mutations in the CTSC gene cluster predominantly in exons 5–7, which encode the heavy chain of cathepsin C. This region is implicated in tetramerization, suggesting that tetramer formation is important for CTSC enzymatic activity. Genotype–phenotype analysis showed that the same CTSC mutation can produce different phenotypic severity, implicating modifier genes or environmental factors outside CTSC.\",\n      \"method\": \"Mutational database analysis, genotype-phenotype correlation across 75 published mutations\",\n      \"journal\": \"Molecular genetics & genomic medicine\",\n      \"confidence\": \"Low\",\n      \"confidence_rationale\": \"Tier 4 — database/correlative analysis without direct experimental validation of tetramerization role\",\n      \"pmids\": [\"24936511\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2021,\n      \"finding\": \"In a PLS patient with a 503A>G substitution in CTSC exon 4 (p.Tyr168Cys), neutrophil lysates lacked CTSC protein and showed no CTSC or neutrophil serine protease (NSP) activity. Neutrophil counts, morphology, priming, chemotaxis, radical production, and apoptosis regulation were normal, but NET formation upon PMA stimulation was severely depressed, identifying NSP-dependent NET formation as a specific functional deficit in CTSC-deficient neutrophils.\",\n      \"method\": \"Patient neutrophil functional assays: enzyme activity, chemotaxis, oxidative burst, NET quantification, apoptosis; comparison with another PLS mutation and healthy controls\",\n      \"journal\": \"PloS one\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2 — primary patient cells, multiple orthogonal functional readouts; demonstrates selective NET formation defect\",\n      \"pmids\": [\"34932608\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2024,\n      \"finding\": \"Novel indolinone-based CTSC inhibitor B22 was shown to inhibit CTSC enzymatic activity by binding to the S2 pocket and S1 site of cathepsin C. B22 further inhibits downstream neutrophil serine protease activity and exerts anti-inflammatory effects by modulating cytokine levels in inflammatory bowel disease models in vitro and in vivo.\",\n      \"method\": \"In vitro CTSC activity assay, molecular docking, in vivo IBD mouse model, cytokine measurement\",\n      \"journal\": \"European journal of medicinal chemistry\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 2-3 — direct enzymatic inhibition assay with binding site identification and in vivo validation; single study\",\n      \"pmids\": [\"39383651\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2022,\n      \"finding\": \"Fish CTSC (TroCTSC from Trachinotus ovatus) has three conserved catalytic residues (Cys251, His397, Asn419); mutation of any of these dramatically reduces proteolytic activity. Recombinant TroCTSC has optimal hydrolase activity at 40°C and pH 5.5, is promoted by Zn2+ and Ca2+ but inhibited by Fe2+ and Cu2+. TroCTSC localizes to the cytoplasm and partially co-localizes with lysosomes; after V. harveyi stimulation it translocates to the nucleus. Overexpression enhances bacterial clearance and pro-inflammatory cytokine expression; knockdown reduces antibacterial capacity.\",\n      \"method\": \"Site-directed mutagenesis of catalytic residues, recombinant protein activity assay, immunofluorescence/co-localization, in vivo overexpression/knockdown in goldfish\",\n      \"journal\": \"Fish & shellfish immunology\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 1-2 — catalytic residue mutagenesis with direct activity measurement; ortholog study with functional in vivo validation\",\n      \"pmids\": [\"35952999\"],\n      \"is_preprint\": false\n    },\n    {\n      \"year\": 2020,\n      \"finding\": \"Praeruptorin B reduces migration and invasion of renal carcinoma cells by suppressing the EGFR–MEK–ERK signaling pathway, which leads to downregulation of CTSC mRNA and protein expression. EGF-induced upregulation of CTSC was blocked by praeruptorin B, placing CTSC downstream of EGFR–MEK–ERK signaling in the regulation of tumor cell invasiveness.\",\n      \"method\": \"Cell migration/invasion assays, Western blot for CTSC and signaling proteins, EGF stimulation and praeruptorin B inhibition experiments\",\n      \"journal\": \"International journal of molecular sciences\",\n      \"confidence\": \"Medium\",\n      \"confidence_rationale\": \"Tier 3 — pathway placement via pharmacological inhibition in cell lines; single study\",\n      \"pmids\": [\"32331211\"],\n      \"is_preprint\": false\n    }\n  ],\n  \"current_model\": \"CTSC (cathepsin C/dipeptidyl peptidase I) is a lysosomal cysteine protease that functions as a tetramer with a unique exclusion domain that restricts substrate access to N-terminal dipeptides; it is the obligate activator of granule serine proteases (neutrophil elastase, cathepsin G, proteinase 3, granzymes) in immune cells by cleaving their two-residue activation peptides, a role confirmed by loss-of-function mutations causing Papillon-Lefèvre and Haim-Munk syndromes; active CTSC is generated by cathepsin L-mediated processing of its proenzyme, and its activity is regulated post-transcriptionally by miR-23a; in cancer contexts, secreted CTSC drives metastasis by activating neutrophil PR3 to process IL-1β, promoting NF-κB-dependent neutrophil recruitment and NET formation that degrades anti-metastatic thrombospondin-1.\"\n}\n```","stage2_raw":"```json\n{\n  \"mechanistic_narrative\": \"CTSC (cathepsin C/dipeptidyl peptidase 1) is a lysosomal cysteine protease whose principal function is the proteolytic activation of neutrophil serine proteases (elastase, proteinase 3, cathepsin G) and granzymes, thereby governing innate immune effector mechanisms including neutrophil extracellular trap (NET) formation and NK cell cytotoxicity [PMID:32897034, PMID:34932608, PMID:24440757]. The enzyme possesses dipeptidyl aminopeptidase activity dependent on a conserved Cys-His-Asn catalytic triad, requires tetramerization of its heavy chain for function, and operates optimally at acidic pH consistent with lysosomal residence [PMID:35952999, PMID:24936511, PMID:21088147]. Loss-of-function mutations in CTSC cause Papillon-Lefèvre syndrome, in which CTSC and downstream serine protease activities are abolished in patient leukocytes [PMID:12112662, PMID:34932608]. CTSC expression is post-transcriptionally repressed by miR-23a in NK cells, and pharmacological inhibition of CTSC (brensocatib) reduces neutrophil elastase activity in bronchiectasis patients, validating CTSC as a druggable upstream activator of inflammatory serine proteases [PMID:24440757, PMID:32897034].\",\n  \"teleology\": [\n    {\n      \"year\": 2002,\n      \"claim\": \"Establishing that CTSC loss-of-function mutations are the molecular cause of Papillon-Lefèvre syndrome resolved the genetic basis of this disease and linked specific missense, nonsense, and frameshift variants to abolished enzyme activity in patient leukocytes.\",\n      \"evidence\": \"Biochemical CTSC activity assays in patient leukocytes combined with mutation sequencing across multiple families\",\n      \"pmids\": [\"12112662\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Structural basis for how individual mutations abolish activity was not determined\",\n        \"Whether residual CTSC activity correlates with phenotypic severity was not quantified\"\n      ]\n    },\n    {\n      \"year\": 2007,\n      \"claim\": \"Demonstrating that large structural alterations (intragenic deletions spanning exons 3–7 and splice-site mutations) also abolish CTSC activity expanded the mutational spectrum underlying PLS beyond point mutations.\",\n      \"evidence\": \"CTSC enzyme activity assay in patient samples with gene dosage analysis for deletion detection\",\n      \"pmids\": [\"17943190\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"No complementation or rescue experiment to confirm causality of the deletion\",\n        \"Effects on CTSC protein processing and trafficking were not assessed\"\n      ]\n    },\n    {\n      \"year\": 2010,\n      \"claim\": \"Measurement of CTSC dipeptidyl aminopeptidase activity within intact living cells confirmed that the active enzyme resides in the lysosomal compartment and established a cell-based assay platform for inhibitor screening.\",\n      \"evidence\": \"Fluorogenic substrate (H-Gly-Phe-AFC) assay in intact THP-1 cells compared with isolated enzyme assays\",\n      \"pmids\": [\"21088147\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Whether CTSC has additional non-lysosomal sites of activity in primary immune cells was not addressed\",\n        \"Substrate specificity beyond the dipeptide was not explored\"\n      ]\n    },\n    {\n      \"year\": 2014,\n      \"claim\": \"Two advances clarified CTSC regulation and structure-function: (1) miR-23a was identified as a post-transcriptional repressor of CTSC mRNA in NK cells, linking CTSC upregulation to enhanced granzyme B activity and NK cytotoxicity; (2) enrichment of PLS mutations in exons 5–7 implicated tetramerization of the heavy chain as essential for catalytic function.\",\n      \"evidence\": \"miRNA/mRNA profiling with ATRA-mediated miR-23a induction in vitro and in vivo; mutational review correlating tetramerization-domain variants with enzyme activity loss\",\n      \"pmids\": [\"24440757\", \"24936511\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Direct demonstration that disrupting tetramerization alone abolishes activity (e.g., by engineering interface mutations) was not performed\",\n        \"Whether miR-23a regulates CTSC in neutrophils or other immune lineages is unknown\",\n        \"The structural basis for tetramer-dependent activation remains unresolved\"\n      ]\n    },\n    {\n      \"year\": 2019,\n      \"claim\": \"Identification of a CTSC missense variant (p.Trp101Ser) causing isolated aggressive periodontitis without hyperkeratosis demonstrated that CTSC loss-of-function can produce phenotypic spectra milder than full PLS.\",\n      \"evidence\": \"Exome sequencing with co-segregation analysis in a consanguineous pedigree; in silico structural prediction\",\n      \"pmids\": [\"31068678\"],\n      \"confidence\": \"Low\",\n      \"gaps\": [\n        \"No direct CTSC enzyme activity measurement was performed for this variant\",\n        \"Whether partial versus complete loss of CTSC activity explains the phenotypic difference is unknown\"\n      ]\n    },\n    {\n      \"year\": 2020,\n      \"claim\": \"A randomized clinical trial demonstrated that pharmacological inhibition of CTSC with brensocatib reduces neutrophil elastase activity in bronchiectasis patients, providing direct human evidence that CTSC is the rate-limiting activator of neutrophil serine proteases in vivo.\",\n      \"evidence\": \"Randomized controlled trial with sputum neutrophil elastase activity measurement across two dose arms\",\n      \"pmids\": [\"32897034\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Effects on other CTSC-dependent proteases (proteinase 3, cathepsin G) were not reported\",\n        \"Long-term consequences of sustained CTSC inhibition on immune defense are not established\"\n      ]\n    },\n    {\n      \"year\": 2021,\n      \"claim\": \"Functional profiling of PLS patient neutrophils (CTSC Y168C) showed that absent CTSC protein abolishes all three major NSP activities and severely impairs NET formation, while leaving chemotaxis, oxidative burst, and apoptosis intact, pinpointing CTSC's non-redundant role to serine protease activation and NET biology.\",\n      \"evidence\": \"Comprehensive functional assays (enzyme activity, NET formation, chemotaxis, oxidative burst, apoptosis) in primary patient neutrophils\",\n      \"pmids\": [\"34932608\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Mechanism by which loss of NSP activity leads to defective NET formation is not resolved\",\n        \"Whether NET impairment directly accounts for periodontal disease susceptibility was not tested\"\n      ]\n    },\n    {\n      \"year\": 2022,\n      \"claim\": \"Mutagenesis of the conserved catalytic triad residues (Cys251, His397, Asn419) in a fish CTSC ortholog confirmed that each is individually essential for proteolytic activity, establishing the catalytic mechanism as deeply conserved across vertebrates.\",\n      \"evidence\": \"Site-directed mutagenesis of recombinant TroCTSC with in vitro activity assays; immunofluorescence for localization\",\n      \"pmids\": [\"35952999\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Fish ortholog findings require confirmation in mammalian CTSC\",\n        \"Reported nuclear translocation upon infection lacks mechanistic explanation and mammalian validation\"\n      ]\n    },\n    {\n      \"year\": 2024,\n      \"claim\": \"Characterization of a novel CTSC inhibitor (B22) targeting the S2 pocket and S1 site confirmed the enzyme's active-site architecture as druggable and showed that CTSC inhibition suppresses downstream serine protease activity and pro-inflammatory cytokines in an IBD model.\",\n      \"evidence\": \"In vitro enzyme assay, molecular docking, and in vivo IBD mouse model\",\n      \"pmids\": [\"39383651\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Selectivity of B22 over other cysteine cathepsins was not fully established\",\n        \"Whether CTSC inhibition benefits IBD independently of neutrophil elastase suppression is unknown\"\n      ]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"The structural basis for how CTSC tetramerization enables catalysis, the precise mechanism linking CTSC/NSP loss to defective NET formation, and the full spectrum of CTSC substrates beyond serine proteases and granzymes remain unresolved.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"No high-resolution structure of disease-causing mutant CTSC tetramers exists\",\n        \"Whether CTSC has non-protease signaling roles is unexplored\",\n        \"Comprehensive substrate profiling in primary human immune cells has not been performed\"\n      ]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0140096\", \"supporting_discovery_ids\": [0, 1, 3, 7, 9]},\n      {\"term_id\": \"GO:0016787\", \"supporting_discovery_ids\": [6, 7]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005764\", \"supporting_discovery_ids\": [6]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"R-HSA-168256\", \"supporting_discovery_ids\": [0, 2, 3]},\n      {\"term_id\": \"R-HSA-392499\", \"supporting_discovery_ids\": [0, 3, 9]}\n    ],\n    \"complexes\": [],\n    \"partners\": [\n      \"ELANE\",\n      \"PRTN3\",\n      \"CTSG\",\n      \"GZMB\"\n    ],\n    \"other_free_text\": []\n  }\n}\n```\n\n--- ROUND 2 (R2/R3, augmented corpus) ---\n\n```json\n{\n  \"mechanistic_narrative\": \"CTSC (cathepsin C / dipeptidyl peptidase 1) is a lysosomal cysteine protease that functions as a tetramer whose unique exclusion domain restricts catalysis to the sequential removal of N-terminal dipeptides from polypeptide substrates [PMID:11726493]. CTSC is the obligate activator of granule serine proteases—neutrophil elastase, cathepsin G, proteinase 3, and granzymes—by cleaving their two-residue activation peptides; its proenzyme is itself processed by cathepsin L, while endogenous inhibitors (cystatins) and the microRNA miR-23a regulate its activity post-translationally and post-transcriptionally [PMID:8428921, PMID:11327826, PMID:24440757]. Loss-of-function mutations in CTSC cause the allelic Mendelian disorders Papillon-Lefèvre syndrome and Haim-Munk syndrome, characterized by palmoplantar keratosis, severe periodontitis, and impaired neutrophil serine protease activity and NET formation [PMID:10581027, PMID:10662807, PMID:34932608]. In cancer, tumor-secreted CTSC activates neutrophil PR3 to drive an IL-1β–NF-κB inflammatory cascade and NET-dependent degradation of thrombospondin-1, promoting metastatic colonization, a pathway pharmacologically targetable by DPP-1 inhibitors such as brensocatib [PMID:33450198, PMID:32897034].\",\n  \"teleology\": [\n    {\n      \"year\": 1993,\n      \"claim\": \"Establishing that CTSC is the enzyme responsible for converting pro-forms of granule serine proteases to their active forms resolved how immune effector cells generate mature cathepsin G, granzymes, and related proteases.\",\n      \"evidence\": \"Pharmacological DPPI inhibition in CD8+ T cells, LAK cells, mast cells, and myeloid lines with pulse-chase labeling showing pro-cathepsin G accumulation\",\n      \"pmids\": [\"8428921\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Structural basis of dipeptide specificity was unknown\",\n        \"Whether CTSC was essential for all granule serine proteases in all immune lineages was untested genetically\"\n      ]\n    },\n    {\n      \"year\": 1997,\n      \"claim\": \"Characterization of the CTSC gene structure and chromosomal location provided the molecular framework for subsequent disease-gene mapping and showed that CTSC expression is transcriptionally regulated by immune signals such as IL-2.\",\n      \"evidence\": \"Genomic cloning, FISH mapping to 11q14, Northern blot tissue profiling, and IL-2 stimulation of lymphocytes\",\n      \"pmids\": [\"9092576\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Promoter elements and transcription factors driving tissue-specific expression were not defined\",\n        \"Whether IL-2-induced upregulation has functional consequences for serine protease activation was not tested\"\n      ]\n    },\n    {\n      \"year\": 1999,\n      \"claim\": \"Identification of CTSC loss-of-function mutations as the cause of Papillon-Lefèvre syndrome, and subsequently Haim-Munk syndrome, proved that CTSC is essential for normal skin and periodontal homeostasis in humans and that the two syndromes are allelic.\",\n      \"evidence\": \"Homozygosity mapping, CTSC sequencing in affected families, enzymatic activity assays in patient leukocytes; allelic mutations at the same codon causing PLS vs. HMS\",\n      \"pmids\": [\"10581027\", \"10593994\", \"10662807\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"How CTSC deficiency leads specifically to periodontal destruction and keratosis remained mechanistically unclear\",\n        \"Modifier genes explaining phenotypic variability between allelic disorders were not identified\"\n      ]\n    },\n    {\n      \"year\": 2001,\n      \"claim\": \"The crystal structure explained how the exclusion domain converts the papain-fold endopeptidase into a strict dipeptidyl aminopeptidase by blocking internal polypeptide access, and showed that disease mutations disrupt this fold; concurrently, in vitro reconstitution demonstrated that cathepsin L is the physiological activator of pro-CTSC.\",\n      \"evidence\": \"X-ray crystallography of human CTSC tetramer with disease mutation mapping; baculovirus expression of pro-DPPI with in vitro activation by cathepsin L and kinetic inhibition by cystatins\",\n      \"pmids\": [\"11726493\", \"11327826\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Dynamics of tetramer assembly in vivo and whether monomeric/dimeric intermediates are functional were not addressed\",\n        \"In vivo confirmation that cathepsin L is the predominant activator in specific cell types was lacking\"\n      ]\n    },\n    {\n      \"year\": 2004,\n      \"claim\": \"Analysis of PLS patient immune cells revealed that CTSC dependence for serine protease activation is cell-type-specific: neutrophil cathepsin G and elastase are severely affected, whereas cytotoxic lymphocyte granzymes retain significant activity, resolving an apparent paradox of selective immunodeficiency.\",\n      \"evidence\": \"Enzymatic activity assays and bactericidal/cytotoxicity functional assays in primary neutrophils and LAK cells from PLS patients\",\n      \"pmids\": [\"15585850\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Alternative activation pathways for granzymes in lymphocytes were not identified\",\n        \"Whether residual granzyme activity is sufficient for all cytotoxic lymphocyte functions in vivo was not determined\"\n      ]\n    },\n    {\n      \"year\": 2014,\n      \"claim\": \"Discovery that miR-23a negatively regulates CTSC mRNA introduced a post-transcriptional control layer linking NK cell activation status to granzyme B maturation and cytotoxic potential.\",\n      \"evidence\": \"miRNA profiling in NK cells, ATRA-mediated miR-23a induction reducing CTSC protein and granzyme B activity, in vivo mouse cytotoxicity model\",\n      \"pmids\": [\"24440757\"],\n      \"confidence\": \"Medium\",\n      \"gaps\": [\n        \"Direct miR-23a binding site validation (e.g., luciferase reporter with mutation) was not shown\",\n        \"Relevance of this regulatory axis in neutrophils or other CTSC-dependent cell types was not tested\"\n      ]\n    },\n    {\n      \"year\": 2020,\n      \"claim\": \"A phase 2 clinical trial demonstrated that pharmacological inhibition of CTSC (brensocatib) reduces neutrophil elastase activity in vivo in humans and improves clinical outcomes in bronchiectasis, validating the CTSC–serine protease axis as a therapeutic target.\",\n      \"evidence\": \"Randomized double-blind placebo-controlled trial measuring sputum neutrophil elastase activity as pharmacodynamic endpoint\",\n      \"pmids\": [\"32897034\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Long-term immunological consequences of sustained CTSC inhibition in humans were not assessed\",\n        \"Whether brensocatib affects granzyme-dependent immunity was not evaluated\"\n      ]\n    },\n    {\n      \"year\": 2021,\n      \"claim\": \"Two advances clarified CTSC's roles in neutrophil biology and cancer: CTSC-deficient neutrophils were shown to have a selective defect in NET formation, and tumor-secreted CTSC was found to promote metastasis via a PR3–IL-1β–NF-κB cascade driving neutrophil recruitment and NET-mediated thrombospondin-1 degradation.\",\n      \"evidence\": \"PLS patient neutrophil functional profiling showing selective NET impairment; mouse metastasis models with genetic/pharmacological CTSC inhibition, enzymatic reconstitution of CTSC–PR3 activation, NET quantification\",\n      \"pmids\": [\"34932608\", \"33450198\"],\n      \"confidence\": \"High\",\n      \"gaps\": [\n        \"Whether tumor-secreted CTSC activates serine proteases beyond PR3 in the tumor microenvironment is unknown\",\n        \"The mechanism by which CTSC is secreted by tumor cells rather than retained in lysosomes was not elucidated\"\n      ]\n    },\n    {\n      \"year\": null,\n      \"claim\": \"Key unresolved questions include the structural basis of CTSC tetramer assembly dynamics in vivo, the identity of alternative granzyme-activating proteases in lymphocytes, and the long-term immunological safety profile of chronic CTSC inhibition.\",\n      \"evidence\": \"\",\n      \"pmids\": [],\n      \"confidence\": \"Low\",\n      \"gaps\": [\n        \"No in vivo evidence identifying the protease(s) that activate granzymes independently of CTSC\",\n        \"No structural data on conformational changes during CTSC proenzyme processing in a cellular context\",\n        \"Mechanism of CTSC secretion by tumor cells into the extracellular space is undefined\"\n      ]\n    }\n  ],\n  \"mechanism_profile\": {\n    \"molecular_activity\": [\n      {\"term_id\": \"GO:0140096\", \"supporting_discovery_ids\": [0, 1, 3, 5, 8, 9, 14]},\n      {\"term_id\": \"GO:0016787\", \"supporting_discovery_ids\": [0, 1, 3, 14]}\n    ],\n    \"localization\": [\n      {\"term_id\": \"GO:0005764\", \"supporting_discovery_ids\": [0, 3, 14]},\n      {\"term_id\": \"GO:0005576\", \"supporting_discovery_ids\": [8]}\n    ],\n    \"pathway\": [\n      {\"term_id\": \"GO:0005576\", \"supporting_discovery_ids\": [8]},\n      {\"term_id\": \"R-HSA-168256\", \"supporting_discovery_ids\": [1, 5, 8, 9, 12]},\n      {\"term_id\": \"R-HSA-392499\", \"supporting_discovery_ids\": [0, 1, 3]},\n      {\"term_id\": \"R-HSA-1643685\", \"supporting_discovery_ids\": [2, 4, 8, 10]}\n    ],\n    \"complexes\": [],\n    \"partners\": [\n      \"CTSG\",\n      \"ELANE\",\n      \"PRTN3\",\n      \"GZMB\",\n      \"CTSL\"\n    ],\n    \"other_free_text\": []\n  }\n}\n```"}