THAP3 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | THAP3, THAP domain containing 3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 612532; MGI: 1917126; HomoloGene: 18413; GeneCards: THAP3; OMA: THAP3 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. [7] The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain [8] and a host-cell factor 1C binding motif. [9] These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. [10] THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys. [7]
The H. sapiens THAP3 gene is a protein-encoding gene that is located on the plus strand of chromosome 1 [7] at cytogenetic location 1p36.31. [11] It is 10,727 base pairs long, spanning from genomic coordinates 6,624,868-6,635,595. [11] It contains 6 exons. [12]
In H. sapiens, THAP3 gene is expressed ubiquitously throughout different tissues, and expression is greatest in the kidneys. [13] It has also been determined that expression of THAP3 tends to be slightly higher in organs located in the abdomen and male and female sexual organs, such as the ovaries, testes, prostate, adrenal gland, spleen, liver, and colon, though expression in the kidneys is 1.4-1.5x higher than those organs. [13] THAP3 mRNA is 1.3x. more abundant in H. sapiens fetal brain tissue than in H. sapiens adult kidney tissue. [14]
Transcription of the THAP3 gene can result in 11 different mRNA variants, of which 8 are alternatively spliced and 3 are unspliced. [7] Variant 1 is the predominant variant and encodes THAP3 protein isoform 1. [7]
Variant | Sequence length (nucleotides) | Accession number [7] |
---|---|---|
1 | 1358 | NM_001195752.2 [16] |
2 | 2071 | NM_138350.4 [17] |
3 | 1361 | NM_001195753.2 [18] |
4 | 1262 | NM_001394496.1 [19] |
5 | 2050 | NM_001394497.1 [20] |
6 | 2047 | NM_001394498.1 [21] |
7 | 1123 | NM_001394499.1 [22] |
8 | 1120 | NM_001394500.1 [23] |
The H. sapiens THAP3 protein is predicted to have a molecular weight of 26.9 kilodaltons [24] and a pI of 10.26. [25] The amino acid sequence is isoleucine and tyrosine rich and arginine poor. [24] Characteristics domains of H. sapiens are the THAP domain (THAP) and the hell-cell factor 1C binding motif (HCM). [7]
Due to having 8 alternatively spliced variants, there are 8 THAP3 isoforms. [7]
Isoform | Sequence length (amino acids) | Accession number | Encoded by |
---|---|---|---|
1 | 238 | NP_001182681.1 [26] | Variant 1 |
2 | 175 | NP_612359.2 [27] | Variant 2 |
3 | 239 | NP_001182682.1 [28] | Variant 3 |
4 | 236 | NP_001381425.1 [29] | Variant 4 |
5 | 168 | NP_001381426.1 [30] | Variant 5 |
6 | 167 | NP_001381427.1 [31] | Variant 6 |
7 | 148 | NP_001381428.1 [32] | Variant 7 |
8 | 147 | NP_001381429.1 [33] | Variant 8 |
The predicted H. sapiens THAP3 tertiary structure contains a globular region and an alpha helix. [5] [6] The globular region is located near the N-terminus of the sequence and is the structure of the THAP domain. It spans amino acids 4-82. [37] The alpha helix is located from amino acids 186-230 and contains the host-cell factor 1C binding motif. [37]
THAP3 can be localized in the nucleus or mitochondria of H. sapiens cells. [38]
The H. sapiens the THAP3 protein has 30 predicted phosphorylation sites, 28 predicted O-β-glycosylation sites, and 11 predicted Yin-Yang sites. [35] [36] Many proteins involved in transcription regulation are influenced by phosphorylation and glycosylation sites, which corroborates THAP3's function. [39]
The H. sapiens THAP3 protein, along with several other proteins, is part of the THAP family of proteins. [40] All of these proteins contain the THAP domain and are, thus, paralogs of H. sapiens THAP3. [15]
Protein Name | E-Value! [15] | Percent Identity to THAP3 [15] |
---|---|---|
THAP1 [41] | 8×10−23 | 48.00 |
THAP2 [42] | 6×10−17 | 45.24 |
THAP5 [43] | 4×10−13 | 31.96 |
THAP6 [44] | 6×10−6 | 34.44 |
THAP7 [45] | 1×10−7 | 33.33 |
THAP8 [46] | 8×10−11 | 31.96 |
THAP9 [47] | 2×10−8 | 32.99 |
There are approximately 206 orthlologs of H. sapiens THAP3. [7] Orthologs can be found in a variety of taxomonic classes, including mammals, reptiles, amphibians, bony fishes, and cartilaginous fishes. [15] However, there are no orthologs in bacteria, fungi, protists, archaea, plants, invertebrates, or birds. [15] Additionally, not all orders are represented with in a class. For example, in reptiles, orthologs to H. sapiens THAP3 are found in testudines (turtles or tortoises) and not found in crocodilia (crocodiles and alligators) or squamata (lizards and snakes). [15] Similarly, there are only orthologs in apoda within amphibians. [15] There are no orthologs in anura (frogs) or urodela (salamanders). [15]
In closely related organisms, those diverged 0-160 million years ago (MYA), percent similarity of orthologs ranges from 36-82.9%. THAP3 sequences in rodents are the least conserved compared to H. sapiens. Sequences that diverged 319-353 MYA, those moderately related, have 47.2-68.9% similarity to H. sapiens THAP3, and 41.3-54.1% similarity in organisms that are distantly related, diverged 431-464 MYA.
Taxonomic Class | Scientific Name | Common Name | Taxonomic Order | Date of Divergence [50] | Accession Number [15] | Percent Identity to THAP3 | Percent Similarity to THAP3 [51] |
---|---|---|---|---|---|---|---|
Mammals | Marmota flaviventris | Yellow-bellied marmot | Rodentia | 87 | XP_027803226.1 [52] | 29.9 | 36 |
Lontra canadensis | North American river otter | Carnivora | 94 | XP_032719186.1 [53] | 59.5 | 65.8 | |
Eptesicus fuscus | Big brown bat | Chiroptera | 94 | XP_028016747.1 [54] | 65.0 | 69.6 | |
Balaenoptera musculus | Blue whale | Cetacea | 94 | XP_036686252.1 [55] | 77.5 | 82.9 | |
Dromiciops gliroides | Colocolo opossum | Microbiotheria | 160 | XP_043850206.1 [56] | 64.6 | 74.5 | |
Phascolarctos cinereus | Koala | Diprotodontia | 160 | XP_020830574.1 [57] | 65.7 | 76.4 | |
Reptiles | Caretta caretta | Loggerhead turtle | Testudines | 319 | XP_048680971.1 [58] | 36.9 | 47.2 |
Gopherus evgoodei | Goode's thornscrub tortoise | Testudines | 319 | XP_030393185.1 [59] | 48.9 | 58.1 | |
Chelonoidis abingdonii | Abingdon Island giant tortoise | Testudines | 319 | XP_032619750.1 [60] | 48.9 | 61.4 | |
Mauremys mutica | Yellow pond turtle | Testudines | 319 | XP_044852367.1 [61] | 49.0 | 60.9 | |
Amphibians | Microcaecilia unicolor | Microcaecilia unicolor | Gymnophiona | 353 | XP_030041702.1 [62] | 41.2 | 56.8 |
Geotrypetes seraphini | Gaboon caecilian | Gymnophiona | 353 | XP_033777236.1 [63] | 44.2 | 57.8 | |
Bony Fishes | Electrophorus electricus | Electric eel | Gymnotiformes | 431 | XP_026873261.2 [64] | 31.9 | 41.3 |
Coregonus clupeaformis | Lake whitefish | Salmoniformes | 431 | XP_041712304.2 [65] | 32.9 | 47.7 | |
Brienomyrus brachyistius | Baby whale | Osteoglossiformes | 431 | XP_048872538.1 [66] | 33.5 | 46.3 | |
Puntigrus tetrazona | Sumatra barb | Cypriniformes | 431 | XP_043081346.1 [67] | 34.0 | 48.1 | |
Cartilaginous Fishes | Rhincodon typus | Whale shark | Orectolobiformes | 464 | XP_020386430.1 [68] | 39.0 | 53.4 |
Chiloscyllium plagiosum | White-spotted bamboo shark | Orectolobiformes | 464 | XP_043531920.1 [69] | 39.0 | 53.0 | |
Amblyraja radiata | Thorny skate | Rajiformes | 464 | XP_032904038.1 [70] | 40.2 | 54.1 |
H. sapiens THAP3 has evolved at a rate similar to H. sapiens fibrinogen alpha, which is involved in the immune system. [15]
H. sapiens THAP3 interacts with proteins involved in various cellular processes, like transcription regulation and neuronal development. [10] It is also interacts with molecular chaperones during its translation.
Process | Protein Name | Identified By! [72] | Interaction Type |
---|---|---|---|
Transcription Regulation | CHAT | two hybrid assay | Functional |
FGFR3 | two hybrid assay | Functional | |
HCF1C [73] | affinity capture - mass spectrometry | Functional | |
OGT [73] | affinity capture - mass spectrometry | Functional | |
PKN1 | two hybrid assay | Functional | |
POLR2A | two hybrid assay | Functional | |
TARDBP | two hybrid assay | Functional | |
Neuronal Development | LSAMP | two hybrid assay | Functional |
DNAJB6 | two hybrid assay | Functional | |
Protein Folding | BAG6 | two hybrid assay | Developmental |
THAP3 contributes to the presentation of X-linked Dystonia-Parkinsonism, also known as Lubag Syndrome. [74] This disease is a neurodegenerative movement disorder that predominantly affects males of Filipino descent. [75] Symptoms include tremors, bradykinesia, rigidity, postural instability, shuffling gait and dystonia, which typically develops later in life. [75]
THAP3 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | THAP3, THAP domain containing 3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 612532; MGI: 1917126; HomoloGene: 18413; GeneCards: THAP3; OMA: THAP3 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. [7] The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain [8] and a host-cell factor 1C binding motif. [9] These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. [10] THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys. [7]
The H. sapiens THAP3 gene is a protein-encoding gene that is located on the plus strand of chromosome 1 [7] at cytogenetic location 1p36.31. [11] It is 10,727 base pairs long, spanning from genomic coordinates 6,624,868-6,635,595. [11] It contains 6 exons. [12]
In H. sapiens, THAP3 gene is expressed ubiquitously throughout different tissues, and expression is greatest in the kidneys. [13] It has also been determined that expression of THAP3 tends to be slightly higher in organs located in the abdomen and male and female sexual organs, such as the ovaries, testes, prostate, adrenal gland, spleen, liver, and colon, though expression in the kidneys is 1.4-1.5x higher than those organs. [13] THAP3 mRNA is 1.3x. more abundant in H. sapiens fetal brain tissue than in H. sapiens adult kidney tissue. [14]
Transcription of the THAP3 gene can result in 11 different mRNA variants, of which 8 are alternatively spliced and 3 are unspliced. [7] Variant 1 is the predominant variant and encodes THAP3 protein isoform 1. [7]
Variant | Sequence length (nucleotides) | Accession number [7] |
---|---|---|
1 | 1358 | NM_001195752.2 [16] |
2 | 2071 | NM_138350.4 [17] |
3 | 1361 | NM_001195753.2 [18] |
4 | 1262 | NM_001394496.1 [19] |
5 | 2050 | NM_001394497.1 [20] |
6 | 2047 | NM_001394498.1 [21] |
7 | 1123 | NM_001394499.1 [22] |
8 | 1120 | NM_001394500.1 [23] |
The H. sapiens THAP3 protein is predicted to have a molecular weight of 26.9 kilodaltons [24] and a pI of 10.26. [25] The amino acid sequence is isoleucine and tyrosine rich and arginine poor. [24] Characteristics domains of H. sapiens are the THAP domain (THAP) and the hell-cell factor 1C binding motif (HCM). [7]
Due to having 8 alternatively spliced variants, there are 8 THAP3 isoforms. [7]
Isoform | Sequence length (amino acids) | Accession number | Encoded by |
---|---|---|---|
1 | 238 | NP_001182681.1 [26] | Variant 1 |
2 | 175 | NP_612359.2 [27] | Variant 2 |
3 | 239 | NP_001182682.1 [28] | Variant 3 |
4 | 236 | NP_001381425.1 [29] | Variant 4 |
5 | 168 | NP_001381426.1 [30] | Variant 5 |
6 | 167 | NP_001381427.1 [31] | Variant 6 |
7 | 148 | NP_001381428.1 [32] | Variant 7 |
8 | 147 | NP_001381429.1 [33] | Variant 8 |
The predicted H. sapiens THAP3 tertiary structure contains a globular region and an alpha helix. [5] [6] The globular region is located near the N-terminus of the sequence and is the structure of the THAP domain. It spans amino acids 4-82. [37] The alpha helix is located from amino acids 186-230 and contains the host-cell factor 1C binding motif. [37]
THAP3 can be localized in the nucleus or mitochondria of H. sapiens cells. [38]
The H. sapiens the THAP3 protein has 30 predicted phosphorylation sites, 28 predicted O-β-glycosylation sites, and 11 predicted Yin-Yang sites. [35] [36] Many proteins involved in transcription regulation are influenced by phosphorylation and glycosylation sites, which corroborates THAP3's function. [39]
The H. sapiens THAP3 protein, along with several other proteins, is part of the THAP family of proteins. [40] All of these proteins contain the THAP domain and are, thus, paralogs of H. sapiens THAP3. [15]
Protein Name | E-Value! [15] | Percent Identity to THAP3 [15] |
---|---|---|
THAP1 [41] | 8×10−23 | 48.00 |
THAP2 [42] | 6×10−17 | 45.24 |
THAP5 [43] | 4×10−13 | 31.96 |
THAP6 [44] | 6×10−6 | 34.44 |
THAP7 [45] | 1×10−7 | 33.33 |
THAP8 [46] | 8×10−11 | 31.96 |
THAP9 [47] | 2×10−8 | 32.99 |
There are approximately 206 orthlologs of H. sapiens THAP3. [7] Orthologs can be found in a variety of taxomonic classes, including mammals, reptiles, amphibians, bony fishes, and cartilaginous fishes. [15] However, there are no orthologs in bacteria, fungi, protists, archaea, plants, invertebrates, or birds. [15] Additionally, not all orders are represented with in a class. For example, in reptiles, orthologs to H. sapiens THAP3 are found in testudines (turtles or tortoises) and not found in crocodilia (crocodiles and alligators) or squamata (lizards and snakes). [15] Similarly, there are only orthologs in apoda within amphibians. [15] There are no orthologs in anura (frogs) or urodela (salamanders). [15]
In closely related organisms, those diverged 0-160 million years ago (MYA), percent similarity of orthologs ranges from 36-82.9%. THAP3 sequences in rodents are the least conserved compared to H. sapiens. Sequences that diverged 319-353 MYA, those moderately related, have 47.2-68.9% similarity to H. sapiens THAP3, and 41.3-54.1% similarity in organisms that are distantly related, diverged 431-464 MYA.
Taxonomic Class | Scientific Name | Common Name | Taxonomic Order | Date of Divergence [50] | Accession Number [15] | Percent Identity to THAP3 | Percent Similarity to THAP3 [51] |
---|---|---|---|---|---|---|---|
Mammals | Marmota flaviventris | Yellow-bellied marmot | Rodentia | 87 | XP_027803226.1 [52] | 29.9 | 36 |
Lontra canadensis | North American river otter | Carnivora | 94 | XP_032719186.1 [53] | 59.5 | 65.8 | |
Eptesicus fuscus | Big brown bat | Chiroptera | 94 | XP_028016747.1 [54] | 65.0 | 69.6 | |
Balaenoptera musculus | Blue whale | Cetacea | 94 | XP_036686252.1 [55] | 77.5 | 82.9 | |
Dromiciops gliroides | Colocolo opossum | Microbiotheria | 160 | XP_043850206.1 [56] | 64.6 | 74.5 | |
Phascolarctos cinereus | Koala | Diprotodontia | 160 | XP_020830574.1 [57] | 65.7 | 76.4 | |
Reptiles | Caretta caretta | Loggerhead turtle | Testudines | 319 | XP_048680971.1 [58] | 36.9 | 47.2 |
Gopherus evgoodei | Goode's thornscrub tortoise | Testudines | 319 | XP_030393185.1 [59] | 48.9 | 58.1 | |
Chelonoidis abingdonii | Abingdon Island giant tortoise | Testudines | 319 | XP_032619750.1 [60] | 48.9 | 61.4 | |
Mauremys mutica | Yellow pond turtle | Testudines | 319 | XP_044852367.1 [61] | 49.0 | 60.9 | |
Amphibians | Microcaecilia unicolor | Microcaecilia unicolor | Gymnophiona | 353 | XP_030041702.1 [62] | 41.2 | 56.8 |
Geotrypetes seraphini | Gaboon caecilian | Gymnophiona | 353 | XP_033777236.1 [63] | 44.2 | 57.8 | |
Bony Fishes | Electrophorus electricus | Electric eel | Gymnotiformes | 431 | XP_026873261.2 [64] | 31.9 | 41.3 |
Coregonus clupeaformis | Lake whitefish | Salmoniformes | 431 | XP_041712304.2 [65] | 32.9 | 47.7 | |
Brienomyrus brachyistius | Baby whale | Osteoglossiformes | 431 | XP_048872538.1 [66] | 33.5 | 46.3 | |
Puntigrus tetrazona | Sumatra barb | Cypriniformes | 431 | XP_043081346.1 [67] | 34.0 | 48.1 | |
Cartilaginous Fishes | Rhincodon typus | Whale shark | Orectolobiformes | 464 | XP_020386430.1 [68] | 39.0 | 53.4 |
Chiloscyllium plagiosum | White-spotted bamboo shark | Orectolobiformes | 464 | XP_043531920.1 [69] | 39.0 | 53.0 | |
Amblyraja radiata | Thorny skate | Rajiformes | 464 | XP_032904038.1 [70] | 40.2 | 54.1 |
H. sapiens THAP3 has evolved at a rate similar to H. sapiens fibrinogen alpha, which is involved in the immune system. [15]
H. sapiens THAP3 interacts with proteins involved in various cellular processes, like transcription regulation and neuronal development. [10] It is also interacts with molecular chaperones during its translation.
Process | Protein Name | Identified By! [72] | Interaction Type |
---|---|---|---|
Transcription Regulation | CHAT | two hybrid assay | Functional |
FGFR3 | two hybrid assay | Functional | |
HCF1C [73] | affinity capture - mass spectrometry | Functional | |
OGT [73] | affinity capture - mass spectrometry | Functional | |
PKN1 | two hybrid assay | Functional | |
POLR2A | two hybrid assay | Functional | |
TARDBP | two hybrid assay | Functional | |
Neuronal Development | LSAMP | two hybrid assay | Functional |
DNAJB6 | two hybrid assay | Functional | |
Protein Folding | BAG6 | two hybrid assay | Developmental |
THAP3 contributes to the presentation of X-linked Dystonia-Parkinsonism, also known as Lubag Syndrome. [74] This disease is a neurodegenerative movement disorder that predominantly affects males of Filipino descent. [75] Symptoms include tremors, bradykinesia, rigidity, postural instability, shuffling gait and dystonia, which typically develops later in life. [75]