![]() | Review waiting, please be patient.
This may take 3 months or more, since drafts are reviewed in no specific order. There are 2,692 pending submissions waiting for review.
Where to get help
How to improve a draft
You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article. Improving your odds of a speedy review To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags. Editor resources
Reviewer tools
|
Submission declined on 29 July 2024 by
Bobby Cohn (
talk). This submission is not adequately supported by
reliable sources. Reliable sources are required so that information can be
verified. If you need help with referencing, please see
Referencing for beginners and
Citing sources.
Where to get help
How to improve a draft
You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article. Improving your odds of a speedy review To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags. Editor resources
This draft has been resubmitted and is currently awaiting re-review. | ![]() |
Coiled-coil domain containing 97 (CCDC97) is a protein encoded by the CCDC97 gene. [1] This gene is a member of the CCDC family and has 2 transcriptional variants. [2]
CCDC97, also known as FLJ40267 and MGC20255, is located at 19q13.2 on the plus strand in humans and has 6 exons. [3] Orthologs for this gene can be found in mammals, reptiles, amphibians, birds, fish, and invertebrates [4]. Transcriptional variant 1 or protein isoform 1 [5] has 3329 base pairs and encodes the longer protein isoform and contains 343 amino acids.
The CCD97 gene produces 5 different mRNAs; 3 alternatively spliced variants and 2 unsliced variants, with 2 spliced and unspliced mRNA encoding 4 good proteins resulting in 4 isoforms. [6]
The CCDC97 protein isoform 1 has a molecular mass of ~39 kDa [7] and a predicted isoelectric point of 4.5. [8] It is rich in acids such as aspartic acid (D) and glutamic acid (E) which are primarily located in the C-terminus. [9] In humans there is a protein abundance of 6.22ppm. [10] Two strong supported motifs found on the protein are DUF052 and an E-rich region. [11]
Rate of Mutation
CCDC97 has an average rate of mutation when compared to a gene known to mutate slowly ( cytochrome c) and quickly ( Fibrinogen alpha)
Paralogs
There are no paralogs for CCDC97.
Orthologs
Orthologs for CCDC97 can be found in most vertebrates as well as invertebrates [12]. Aves (Birds) have sequence identities that are lower than expected, suggesting that this gene has greatly mutated in birds. Invertebrates are the most distantly related to humans with the lowest sequence identities.
CCDC97 | Genus and Species | Common Name | Taxanomic Group | Median Date of Divergance (MYA) | Accession Number | Sequence Length (aa) | Sequence Identity (%) | Sequence Similarity (%) |
Mammals | Homo sapiens | Humans | Primates | 0 | NM_052848 | 343 | 100% | 100% |
Cavia porcellus | Domestic Guinea Pig | Rodentia | 87 | XP_003462073 | 342 | 89.80% | 94.20% | |
Physeter catodon | Sperm Whale | Cetartiodactyla | 94 | XP_007128179 | 347 | 88.80% | 91.40% | |
Artibeus jamaicensis | Jamaican Fruit Bat | Chiroptera | 94 | XP_037013554 | 361 | 84.80% | 87.30% | |
Sarcophilus harrisii | Tasmanian Devil | Dasyuromorphia | 160 | XP_031819750 | 332 | 64.40% | 75.60% | |
Tachyglossus aculeatus | Australian Echidna | Monotremata | 180 | XP_038623271 | 330 | 59.00% | 68.10% | |
Reptlia | Python bivittatus | Burmese Python | Squamata | 319 | XP_007421554 | 345 | 51.50% | 62.90% |
Alligator mississippiensis | American Alligator | Crocodilia | 319 | XP_059574710 | 309 | 51.30% | 63.00% | |
Aves | Accipiter gentilis | Northern Goshawk | Cuculiformes | 319 | XP_049652563 | 303 | 41.10% | 50.30% |
Phalacrocorax carbo | Great Cormorant | Suliformes | 319 | XP_064296149 | 317 | 37.70% | 46.30% | |
Amphibia | Xenopus tropicalis | Tropical Clawed Frog | Anura | 325 | XP_012823864 | 300 | 46.20% | 61.90% |
Microcaecilia unicolor | Microcaecilia Unicolor | Gymnophiona | 352 | XP_030075449 | 315 | 47.10% | 61.40% | |
Fish | Protopterus annectens | West African Lungfish | Lepidosireniformes | 408 | XP_043933492 | 354 | 45.20% | 60.20% |
Latimeria chalumnae | Coelacanth | Coelacanthiformes | 415 | XP_014349074 | 339 | 47.50% | 63.70% | |
Acipenser ruthenus | Sterlet | Acipenseriformes | 429 | XP_033881880 | 363 | 46.30% | 57.90% | |
Leucoraja erinacea | Little Skate | Rajiformes | 462 | XP_055519601 | 344 | 46.10% | 63.30% | |
Callorhinchus milii | Elephant Shark | Chimaeriformes | 462 | XP_007909130 | 326 | 45.00% | 61.20% | |
Petromyzon marinus | Sea Lamprey | Petromyzontiformes | 563 | XP_032821086 | 314 | 40.70% | 57.10% | |
Invertebrate | Centruroides sculpturatus | Arizona Bark Scorpion | Scorpiones | 686 | XP_023213136.1 | 284 | 31.50% | 48.10% |
Caenorhabditis elegans | Roundworm | Rhabditida | 708 | NP_506468 | 301 | 28.20% | 45.50% | |
Ylistrum balloti | Ballot's Saucer Scallop | Pectinida | 708 | XP_060071957.1 | 343 | 28.00% | 42.20% |
The promoter and gene sequence for the gene CCDC97 is located between chr19:41,309,673-41,310,813. [13]
Name | Class | Family |
KLF3 | C2H2 zinc finger factors | Three-zinc finger Kruppel-related |
ZNF454 | C2H2 zinc finger factors | More than 3 adjacent zinc fingers |
Thap11 | C2CH THAP-type zinc finger factors | THAP-related factors |
SOX14 | High-mobility group (HMG) domain factors | SOX-related factors |
PKNOX1 | Homeo domain factors | TALE-type homeo domain factors |
ZNF530 | C2H2 zinc finger factors | More than 3 adjacent zinc fingers |
Nrf1 | Basic leucine zipper factors (bZIP) | Jun-related |
ZNF213 | C2H2 zinc finger factors | More than 3 adjacent zinc fingers |
Name | Score | Sequence |
hsa-miR-486-3p | 99 | ctgcccca |
hsa-miR-30a-5p | 99 | tgtttaca |
hsa-miR-8085 | 98 | ctctccc |
hsa-miR-4524a-3p | 97 | ctgtctc |
hsa-miR-450a-2-3p | 92 | tccccaa |
Name | Score | Sequence |
A2BP1 | 11.1 | UGCAUG |
HNRNPA1 | 9.9 | UAGGGA |
NONO | 8.9 | AGGGA |
CCDC97 has very high ubiquitous expression in most human tissue types [17]. The highest levels of expression are found in the ovaries (RPKM 6.9), lymph node (RPKM 6.7), spleen (RPKM 6.2), appendix (RPKM 5.9), and endometrium (RPKM 5.3) when testis (RPKM 8.9) are excluded [18]
Post-translational modifications that are predicted to occur for protein isoform 1 of CCDC97 are phosphorylation [19], sumoylation [20], and O-GalNAc glycosylation [21].
ELM [22] found the most localization signals for the cytoplasm and the nucleus. PSORT II Prediction [23] predicted 43.5% of the CCDC97 protein to be located in the nucleus, 21.7% in the mitochondria, and 17.4% in the cytoplasm.
CCDC97 protein isoform 1 has been found to interact with over 50 different proteins. [25]
Top predicted protein interactants for CCDC97 are SF3B6 (Splicing factor 3b subunit 6), SF3B5 (Splicing factor 3b subunit 5), SF3B1 (Splicing factor 3b subunit 1), SF3B3 (Splicing factor 3b subunit 3), SF3A1 (splicing factor 3a, subunit 1), ZRSR2 (zinc finger (CCCH type), RNA-binding motif and serine/arginine-rich 2) and TTC33 (tetratricopeptide repeat domain 33). [26] CCDC97 has also been predicted to notably interactant with MAPK14 [27] (mitogen-activated protein kinase 14), TIGD6 [28] (tigger transposable element derived), and SRPK2 [29] (SRSF protein kinase 2).
High co-expressions of CCDC97 with Dual-Specificity Tyrosine-(Y)-Phosphorylation Regulated Kinase 1B ( DYRK1B) is associated with decreased rates of survival for triple-negative breast cancer (TNBC) patients. [30] CCDC97 has also been found to be linked to Camurati-Engelmann Disease due to its proximity to transforming growth factor beta 1 ( TGFB1). [31]
![]() | Review waiting, please be patient.
This may take 3 months or more, since drafts are reviewed in no specific order. There are 2,692 pending submissions waiting for review.
Where to get help
How to improve a draft
You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article. Improving your odds of a speedy review To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags. Editor resources
Reviewer tools
|
Submission declined on 29 July 2024 by
Bobby Cohn (
talk). This submission is not adequately supported by
reliable sources. Reliable sources are required so that information can be
verified. If you need help with referencing, please see
Referencing for beginners and
Citing sources.
Where to get help
How to improve a draft
You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article. Improving your odds of a speedy review To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags. Editor resources
This draft has been resubmitted and is currently awaiting re-review. | ![]() |
Coiled-coil domain containing 97 (CCDC97) is a protein encoded by the CCDC97 gene. [1] This gene is a member of the CCDC family and has 2 transcriptional variants. [2]
CCDC97, also known as FLJ40267 and MGC20255, is located at 19q13.2 on the plus strand in humans and has 6 exons. [3] Orthologs for this gene can be found in mammals, reptiles, amphibians, birds, fish, and invertebrates [4]. Transcriptional variant 1 or protein isoform 1 [5] has 3329 base pairs and encodes the longer protein isoform and contains 343 amino acids.
The CCD97 gene produces 5 different mRNAs; 3 alternatively spliced variants and 2 unsliced variants, with 2 spliced and unspliced mRNA encoding 4 good proteins resulting in 4 isoforms. [6]
The CCDC97 protein isoform 1 has a molecular mass of ~39 kDa [7] and a predicted isoelectric point of 4.5. [8] It is rich in acids such as aspartic acid (D) and glutamic acid (E) which are primarily located in the C-terminus. [9] In humans there is a protein abundance of 6.22ppm. [10] Two strong supported motifs found on the protein are DUF052 and an E-rich region. [11]
Rate of Mutation
CCDC97 has an average rate of mutation when compared to a gene known to mutate slowly ( cytochrome c) and quickly ( Fibrinogen alpha)
Paralogs
There are no paralogs for CCDC97.
Orthologs
Orthologs for CCDC97 can be found in most vertebrates as well as invertebrates [12]. Aves (Birds) have sequence identities that are lower than expected, suggesting that this gene has greatly mutated in birds. Invertebrates are the most distantly related to humans with the lowest sequence identities.
CCDC97 | Genus and Species | Common Name | Taxanomic Group | Median Date of Divergance (MYA) | Accession Number | Sequence Length (aa) | Sequence Identity (%) | Sequence Similarity (%) |
Mammals | Homo sapiens | Humans | Primates | 0 | NM_052848 | 343 | 100% | 100% |
Cavia porcellus | Domestic Guinea Pig | Rodentia | 87 | XP_003462073 | 342 | 89.80% | 94.20% | |
Physeter catodon | Sperm Whale | Cetartiodactyla | 94 | XP_007128179 | 347 | 88.80% | 91.40% | |
Artibeus jamaicensis | Jamaican Fruit Bat | Chiroptera | 94 | XP_037013554 | 361 | 84.80% | 87.30% | |
Sarcophilus harrisii | Tasmanian Devil | Dasyuromorphia | 160 | XP_031819750 | 332 | 64.40% | 75.60% | |
Tachyglossus aculeatus | Australian Echidna | Monotremata | 180 | XP_038623271 | 330 | 59.00% | 68.10% | |
Reptlia | Python bivittatus | Burmese Python | Squamata | 319 | XP_007421554 | 345 | 51.50% | 62.90% |
Alligator mississippiensis | American Alligator | Crocodilia | 319 | XP_059574710 | 309 | 51.30% | 63.00% | |
Aves | Accipiter gentilis | Northern Goshawk | Cuculiformes | 319 | XP_049652563 | 303 | 41.10% | 50.30% |
Phalacrocorax carbo | Great Cormorant | Suliformes | 319 | XP_064296149 | 317 | 37.70% | 46.30% | |
Amphibia | Xenopus tropicalis | Tropical Clawed Frog | Anura | 325 | XP_012823864 | 300 | 46.20% | 61.90% |
Microcaecilia unicolor | Microcaecilia Unicolor | Gymnophiona | 352 | XP_030075449 | 315 | 47.10% | 61.40% | |
Fish | Protopterus annectens | West African Lungfish | Lepidosireniformes | 408 | XP_043933492 | 354 | 45.20% | 60.20% |
Latimeria chalumnae | Coelacanth | Coelacanthiformes | 415 | XP_014349074 | 339 | 47.50% | 63.70% | |
Acipenser ruthenus | Sterlet | Acipenseriformes | 429 | XP_033881880 | 363 | 46.30% | 57.90% | |
Leucoraja erinacea | Little Skate | Rajiformes | 462 | XP_055519601 | 344 | 46.10% | 63.30% | |
Callorhinchus milii | Elephant Shark | Chimaeriformes | 462 | XP_007909130 | 326 | 45.00% | 61.20% | |
Petromyzon marinus | Sea Lamprey | Petromyzontiformes | 563 | XP_032821086 | 314 | 40.70% | 57.10% | |
Invertebrate | Centruroides sculpturatus | Arizona Bark Scorpion | Scorpiones | 686 | XP_023213136.1 | 284 | 31.50% | 48.10% |
Caenorhabditis elegans | Roundworm | Rhabditida | 708 | NP_506468 | 301 | 28.20% | 45.50% | |
Ylistrum balloti | Ballot's Saucer Scallop | Pectinida | 708 | XP_060071957.1 | 343 | 28.00% | 42.20% |
The promoter and gene sequence for the gene CCDC97 is located between chr19:41,309,673-41,310,813. [13]
Name | Class | Family |
KLF3 | C2H2 zinc finger factors | Three-zinc finger Kruppel-related |
ZNF454 | C2H2 zinc finger factors | More than 3 adjacent zinc fingers |
Thap11 | C2CH THAP-type zinc finger factors | THAP-related factors |
SOX14 | High-mobility group (HMG) domain factors | SOX-related factors |
PKNOX1 | Homeo domain factors | TALE-type homeo domain factors |
ZNF530 | C2H2 zinc finger factors | More than 3 adjacent zinc fingers |
Nrf1 | Basic leucine zipper factors (bZIP) | Jun-related |
ZNF213 | C2H2 zinc finger factors | More than 3 adjacent zinc fingers |
Name | Score | Sequence |
hsa-miR-486-3p | 99 | ctgcccca |
hsa-miR-30a-5p | 99 | tgtttaca |
hsa-miR-8085 | 98 | ctctccc |
hsa-miR-4524a-3p | 97 | ctgtctc |
hsa-miR-450a-2-3p | 92 | tccccaa |
Name | Score | Sequence |
A2BP1 | 11.1 | UGCAUG |
HNRNPA1 | 9.9 | UAGGGA |
NONO | 8.9 | AGGGA |
CCDC97 has very high ubiquitous expression in most human tissue types [17]. The highest levels of expression are found in the ovaries (RPKM 6.9), lymph node (RPKM 6.7), spleen (RPKM 6.2), appendix (RPKM 5.9), and endometrium (RPKM 5.3) when testis (RPKM 8.9) are excluded [18]
Post-translational modifications that are predicted to occur for protein isoform 1 of CCDC97 are phosphorylation [19], sumoylation [20], and O-GalNAc glycosylation [21].
ELM [22] found the most localization signals for the cytoplasm and the nucleus. PSORT II Prediction [23] predicted 43.5% of the CCDC97 protein to be located in the nucleus, 21.7% in the mitochondria, and 17.4% in the cytoplasm.
CCDC97 protein isoform 1 has been found to interact with over 50 different proteins. [25]
Top predicted protein interactants for CCDC97 are SF3B6 (Splicing factor 3b subunit 6), SF3B5 (Splicing factor 3b subunit 5), SF3B1 (Splicing factor 3b subunit 1), SF3B3 (Splicing factor 3b subunit 3), SF3A1 (splicing factor 3a, subunit 1), ZRSR2 (zinc finger (CCCH type), RNA-binding motif and serine/arginine-rich 2) and TTC33 (tetratricopeptide repeat domain 33). [26] CCDC97 has also been predicted to notably interactant with MAPK14 [27] (mitogen-activated protein kinase 14), TIGD6 [28] (tigger transposable element derived), and SRPK2 [29] (SRSF protein kinase 2).
High co-expressions of CCDC97 with Dual-Specificity Tyrosine-(Y)-Phosphorylation Regulated Kinase 1B ( DYRK1B) is associated with decreased rates of survival for triple-negative breast cancer (TNBC) patients. [30] CCDC97 has also been found to be linked to Camurati-Engelmann Disease due to its proximity to transforming growth factor beta 1 ( TGFB1). [31]