This article needs additional citations for
verification. (February 2024) |
Part of a series on |
Evolutionary biology |
---|
In biology, phylogenetics ( /ˌfaɪloʊdʒəˈnɛtɪks, -lə-/) [1] [2] [3] is the study of the evolutionary history and relationships among or within groups of organisms. These relationships are determined by phylogenetic inference, methods that focus on observed heritable traits, such as DNA sequences, protein amino acid sequences, or morphology. The result of such an analysis is a phylogenetic tree—a diagram containing a hypothesis of relationships that reflects the evolutionary history of a group of organisms. [4]
The tips of a phylogenetic tree can be living taxa or fossils, and represent the "end" or the present time in an evolutionary lineage. A phylogenetic diagram can be rooted or unrooted. A rooted tree diagram indicates the hypothetical common ancestor of the tree. An unrooted tree diagram (a network) makes no assumption about the ancestral line, and does not show the origin or "root" of the taxa in question or the direction of inferred evolutionary transformations. [5]
In addition to their use for inferring phylogenetic patterns among taxa, phylogenetic analyses are often employed to represent relationships among genes or individual organisms. Such uses have become central to understanding biodiversity, evolution, ecology, and genomes.
Phylogenetics is component of systematics that uses similarities and differences of the characteristics of species to interpret their evolutionary relationships and origins. Phylogenetics focuses on whether the characteristics of a species reinforce a phylogenetic inference that it diverged from the most recent common ancestor of a taxonomic group. [6]
In the field of cancer research, phylogenetics can be used to study the clonal evolution of tumors and molecular chronology, predicting and showing how cell populations vary throughout the progression of the disease and during treatment, using whole genome sequencing techniques. [7] The evolutionary processes behind cancer progression are quite different from those in species and are important to phylogenetic inference; these differences manifest in at least four areas: the types of aberrations that occur, the rates of mutation, the intensity, and high heterogeneity - variability - of tumor cell subclones. [8]
Phylogenetics can also aid in drug design and discovery. Phylogenetics allows scientists to organize species and can show which species are likely to have inherited particular traits that are medically useful, such as producing biologically active compounds - those that have effects on the human body. For example, in drug discovery, venom-producing animals are particularly useful. Venoms from these animals produce several important drugs, e.g., ACE inhibitors and Prialt ( Ziconotide). To find new venoms, scientists turn to phylogenetics to screen for closely related species that may have the same useful traits. The phylogenetic tree shows which species of fish have an origin of venom, and related fish they may contain the trait. Using this approach in studying venomous fish, biologists are able to identify the fish species that may be venomous. Biologist have used this approach in many species such as snakes and lizards. [9] In forensic science, phylogenetic tools are useful to assess DNA evidence for court cases. The simple phylogenetic tree of viruses A-E shows the relationships between viruses e.g., all viruses are descendants of Virus A.
HIV forensics uses phylogenetic analysis to track the differences in HIV genes and determine the relatedness of two samples. Phylogenetic analysis has been used in criminal trials to exonerate or hold individuals. HIV forensics does have its limitations, i.e., it cannot be the sole proof of transmission between individuals and phylogenetic analysis which shows transmission relatedness does not indicate direction of transmission. [10]
Taxonomy is the identification, naming, and classification of organisms. Compared to systemization, classification emphasizes whether a species has characteristics of a taxonomic group. [6] The Linnaean classification system developed in the 1700s by Carolus Linnaeus is the foundation for modern classification methods. Linnaean classification relies on an organism's phenotype or physical characteristics to group and organize species. [11] With the emergence of biochemistry, organism classifications are now usually based on phylogenetic data, and many systematists contend that only monophyletic taxa should be recognized as named groups. The degree to which classification depends on inferred evolutionary history differs depending on the school of taxonomy: phenetics ignores phylogenetic speculation altogether, trying to represent the similarity between organisms instead; cladistics (phylogenetic systematics) tries to reflect phylogeny in its classifications by only recognizing groups based on shared, derived characters ( synapomorphies); evolutionary taxonomy tries to take into account both the branching pattern and "degree of difference" to find a compromise between them.
Usual methods of phylogenetic inference involve computational approaches implementing the optimality criteria and methods of parsimony, maximum likelihood (ML), and MCMC-based Bayesian inference. All these depend upon an implicit or explicit mathematical model describing the evolution of characters observed. [12]
Phenetics, popular in the mid-20th century but now largely obsolete, used distance matrix-based methods to construct trees based on overall similarity in morphology or similar observable traits (i.e. in the phenotype or the overall similarity of DNA, not the DNA sequence), which was often assumed to approximate phylogenetic relationships.
Prior to 1950, phylogenetic inferences were generally presented as narrative scenarios. Such methods are often ambiguous and lack explicit criteria for evaluating alternative hypotheses. [13] [14] [15]
In phylogenetic analysis, taxon sampling selects a small group of taxa to represent the evolutionary history of its broader population. [16] This process is also known as stratified sampling or clade-based sampling. [17] The practice occurs given limited resources to compare and analyze every species within a target population. [16] Based on the representative group selected, the construction and accuracy of phylogenetic trees vary, which impacts derived phylogenetic inferences. [17]
Unavailable datasets, such as an organism's incomplete DNA and protein amino acid sequences in genomic databases, directly restrict taxonomic sampling. [17] Consequently, a significant source of error within phylogenetic analysis occurs due to inadequate taxon samples. Accuracy may be improved by increasing the number of genetic samples within its monophyletic group. Conversely, increasing sampling from outgroups extraneous to the target stratified population may decrease accuracy. Long branch attraction is an attributed theory for this occurrence, where nonrelated branches are incorrectly classified together, insinuating a shared evolutionary history. [16]
There are debates if increasing the number of taxa sampled improves phylogenetic accuracy more than increasing the number of genes sampled per taxon. Differences in each method's sampling impact the number of nucleotide sites utilized in a sequence alignment, which may contribute to disagreements. For example, phylogenetic trees constructed utilizing a more significant number of total nucleotides are generally more accurate, as supported by phylogenetic trees' bootstrapping replicability from random sampling.
The graphic presented in Taxon Sampling, Bioinformatics, and Phylogenomics, compares the correctness of phylogenetic trees generated using fewer taxa and more sites per taxon on the x-axis to more taxa and fewer sites per taxon on the y-axis. With fewer taxa, more genes are sampled amongst the taxonomic group; in comparison, with more taxa added to the taxonomic sampling group, fewer genes are sampled. Each method has the same total number of nucleotide sites sampled. Furthermore, the dotted line represents a 1:1 accuracy between the two sampling methods. As seen in the graphic, most of the plotted points are located below the dotted line, which indicates gravitation toward increased accuracy when sampling fewer taxa with more sites per taxon. The research performed utilizes four different phylogenetic tree construction models to verify the theory; neighbor-joining (NJ), minimum evolution (ME), unweighted maximum parsimony (MP), and maximum likelihood (ML). In the majority of models, sampling fewer taxon with more sites per taxon demonstrated higher accuracy.
Generally, with the alignment of a relatively equal number of total nucleotide sites, sampling more genes per taxon has higher bootstrapping replicability than sampling more taxa. However, unbalanced datasets within genomic databases make increasing the gene comparison per taxon in uncommonly sampled organisms increasingly difficult. [17]
The term "phylogeny" derives from the German Phylogenie, introduced by Haeckel in 1866, [18] and the Darwinian approach to classification became known as the "phyletic" approach. [19] It can be traced back to Aristotle, who wrote in his Posterior Analytics, "We may assume the superiority ceteris paribus [other things being equal] of the demonstration which derives from fewer postulates or hypotheses."
The modern concept of phylogenetics evolved primarily as a disproof of a previously widely accepted theory. During the late 19th century, Ernst Haeckel's recapitulation theory, or "biogenetic fundamental law", was widely accepted. It was often expressed as " ontogeny recapitulates phylogeny", i.e. the development of a single organism during its lifetime, from germ to adult, successively mirrors the adult stages of successive ancestors of the species to which it belongs. But this theory has long been rejected. [20] [21] Instead, ontogeny evolves – the phylogenetic history of a species cannot be read directly from its ontogeny, as Haeckel thought would be possible, but characters from ontogeny can be (and have been) used as data for phylogenetic analyses; the more closely related two species are, the more apomorphies their embryos share.
Phylogenetic tools and representations (trees and networks) can also be applied to philology, the study of the evolution of oral languages and written text and manuscripts, such as in the field of quantitative comparative linguistics. [79]
Computational phylogenetics can be used to investigate a language as an evolutionary system. The evolution of human language closely corresponds with human's biological evolution which allows phylogenetic methods to be applied. The concept of a "tree" serves as an efficient way to represent relationships between languages and language splits. It also serves as a way of testing hypotheses about the connections and ages of language families. For example, relationships among languages can be shown by using cognates as characters. [80] [81] The phylogenetic tree of Indo-European languages shows the relationships between several of the languages in a timeline, as well as the similarity between words and word order.
There are three types of criticisms about using phylogenetics in philology, the first arguing that languages and species are different entities, therefore you can not use the same methods to study both. The second being how phylogenetic methods are being applied to linguistic data. And the third, discusses the types of data that is being used to construct the trees. [80]
Bayesian phylogenetic methods, which are sensitive to how treelike the data is, allow for the reconstruction of relationships among languages, locally and globally. The main two reasons for the use of Bayesian phylogenetics are that (1) diverse scenarios can be included in calculations and (2) the output is a sample of trees and not a single tree with true claim. [82]
The same process can be applied to texts and manuscripts. In Paleography, the study of historical writings and manuscripts, texts were replicated by scribes who copied from their source and alterations - i.e., 'mutations' - occurred when the scribe did not precisely copy the source. [83]
This Needs rewriting and citing so as to be helpful, clear, and encyclopedic; the section's tone or style may not reflect the
encyclopedic tone used on Wikipedia. (February 2024) |
Phylogenetic screens involve the pharmacological examination of closely related groups of organisms. Advances in cladistics analysis through faster computer programs and improved molecular techniques have increased the precision of phylogenetic determination, allowing for the identification of species with pharmacological potential.
Phylogenetic screens have been used in a rudimentary manner in the past, such as studying the Apocynaceae family of plants known for their alkaloid-producing species like Catharanthus, which produces vincristine, an antileukemia drug. However, modern techniques now enable researchers to study close relatives of a species to uncover either (1) higher abundance of important bioactive compounds (e.g., species of Taxus for taxol) or (2) natural variants of known pharmaceuticals (e.g., species of Catharanthus for different forms of vincristine or vinblastine.[ citation needed]
The figure below contains the phylogenetic screen of biodiversity within the fungi family. Inside the circle there are subtrees present that were done via phylogenetic analysis. These relations help understand the evolutionary history of various groups of organisms, identifying relationships between different species, and predicting future evolutionary changes. If we were to take biodiversity information from existing knowledge there might be relations between species or subgroups that we didn't know. But with emerging imagery systems and new analysis techniques more genetic relation can be found in biodiverse fields. The image below can help with conservation efforts as there are rare species of fungi involved, that could be beneficial to ecosystems all around. [84]
This article may be
confusing or unclear to readers. In particular, this section includes dense text that might be hard to understand by general audiences. (February 2024) |
Whole-genome sequence data of pathogens obtained from outbreaks or epidemics of infectious diseases can provide important insights into transmission dynamics and inform public health strategies. Previous studies have relied on integrating genomic and epidemiological data to reconstruct transmission events. However, recent research has explored the possibility of deducing transmission patterns solely from genomic data using phylodynamics, which involves analyzing the properties of pathogen phylogenies. Phylodynamics uses theoretical models to compare predicted branch lengths with actual branch lengths in phylogenies to infer transmission patterns. Additionally, coalescent theory, which describes probability distributions on trees based on population size, has been adapted for epidemiological purposes. Another potential source of information within phylogenies that has been explored is "tree shape". These approaches are computationally intensive but have the potential to provide valuable insights into pathogen transmission dynamics. [85]
The structure of the host contact network has a profound impact on the dynamics of outbreaks or epidemics, and outbreak management strategies rely on the type of transmission patterns driving the outbreak. One can expect that pathogen genomes spreading through different contact network structures, such as chains, homogenous networks, or networks with super-spreaders, would accumulate mutations in distinct patterns, resulting in noticeable differences in the shape of phylogenetic trees, as illustrated in Fig. 1. Analyzation of the structural characteristics of phylogenetic trees generated from simulated bacterial genome evolution across multiple types of contact networks was conducted. Simple topological properties of phylogenetic trees that, when combined, can be used to classify trees into chain-like, homogenous, or super-spreading dynamics, revealing transmission dynamics. These properties form the basis of a computational classifier are used to classify real-world outbreaks. Remarkably, the computational predictions of overall transmission dynamics for each outbreak align with known epidemiology [86]
Different transmission networks result in quantitatively different tree shapes to determine whether tree shapes captured information about the underlying disease transmission patterns within an outbreak, we simulated evolution of a bacterial genome over three types of outbreak contact network—homogenous, super-spreading and chain—and summarized the resulting phylogenies with five metrics describing tree shape. Figures 2 and 3 illustrate the distributions of these metrics across the three types of outbreaks, revealing clear differences in tree topology depending on the underlying host contact network. Super-spreader networks gave rise to phylogenies with higher Colless imbalance, longer ladder patterns, lower Δw and deeper trees than transmission networks with a homogeneous distribution of contacts. Trees derived from chain-like networks were less variable, deeper, more imbalanced and narrower than the other trees. Other topological summary metrics considered did not resolve the three outbreak types as fully (Supplementary Information). Scatter plots can be used for pathogen transmission analysis to visualize the relationship between two variables, such as the number of infected individuals and the time since infection. For example, a scatter plot can be used to examine the relationship between the number of cases of a pathogen and the amount of time since the first case was reported. This can help to identify trends and patterns in the data, such as whether the spread of the pathogen is increasing or decreasing over time. Scatter plots can also be used to identify any outliers or clusters of data points, which can provide insight into potential transmission routes or super-spreader events. Overall, scatter plots can be a useful tool in pathogen transmission analysis to identify patterns and trends in the data, and to inform public health interventions and control measures. [86]
The box plot imagery on the right displays the pathogen transformation data. Box plots are often used in statistical analysis to compare different groups or to visualize changes in a single group over time. They are particularly useful when dealing with large datasets or when comparing several groups, as they can quickly highlight differences or similarities in the data. Box plots, also known as box-and-whisker plots, are useful in statistical analysis to provide a summary of the distribution of a dataset. They display the range, median, quartiles, and potential outliers of the data in a visual manner. Box plots are commonly used to compare different groups or to analyze changes in a single group over time. They are especially helpful when working with large datasets or when comparing multiple groups, as they can easily identify any differences or similarities in the data. This makes box plots a valuable tool for analyzing pathogen transmission data, as they can help to identify important features in the distribution of the data. [86]
{{
cite book}}
: CS1 maint: numeric names: authors list (
link)
{{
cite book}}
: |website=
ignored (
help)
{{
cite journal}}
: CS1 maint: DOI inactive as of February 2024 (
link)
{{
cite journal}}
: CS1 maint: multiple names: authors list (
link)
This article needs additional citations for
verification. (February 2024) |
Part of a series on |
Evolutionary biology |
---|
In biology, phylogenetics ( /ˌfaɪloʊdʒəˈnɛtɪks, -lə-/) [1] [2] [3] is the study of the evolutionary history and relationships among or within groups of organisms. These relationships are determined by phylogenetic inference, methods that focus on observed heritable traits, such as DNA sequences, protein amino acid sequences, or morphology. The result of such an analysis is a phylogenetic tree—a diagram containing a hypothesis of relationships that reflects the evolutionary history of a group of organisms. [4]
The tips of a phylogenetic tree can be living taxa or fossils, and represent the "end" or the present time in an evolutionary lineage. A phylogenetic diagram can be rooted or unrooted. A rooted tree diagram indicates the hypothetical common ancestor of the tree. An unrooted tree diagram (a network) makes no assumption about the ancestral line, and does not show the origin or "root" of the taxa in question or the direction of inferred evolutionary transformations. [5]
In addition to their use for inferring phylogenetic patterns among taxa, phylogenetic analyses are often employed to represent relationships among genes or individual organisms. Such uses have become central to understanding biodiversity, evolution, ecology, and genomes.
Phylogenetics is component of systematics that uses similarities and differences of the characteristics of species to interpret their evolutionary relationships and origins. Phylogenetics focuses on whether the characteristics of a species reinforce a phylogenetic inference that it diverged from the most recent common ancestor of a taxonomic group. [6]
In the field of cancer research, phylogenetics can be used to study the clonal evolution of tumors and molecular chronology, predicting and showing how cell populations vary throughout the progression of the disease and during treatment, using whole genome sequencing techniques. [7] The evolutionary processes behind cancer progression are quite different from those in species and are important to phylogenetic inference; these differences manifest in at least four areas: the types of aberrations that occur, the rates of mutation, the intensity, and high heterogeneity - variability - of tumor cell subclones. [8]
Phylogenetics can also aid in drug design and discovery. Phylogenetics allows scientists to organize species and can show which species are likely to have inherited particular traits that are medically useful, such as producing biologically active compounds - those that have effects on the human body. For example, in drug discovery, venom-producing animals are particularly useful. Venoms from these animals produce several important drugs, e.g., ACE inhibitors and Prialt ( Ziconotide). To find new venoms, scientists turn to phylogenetics to screen for closely related species that may have the same useful traits. The phylogenetic tree shows which species of fish have an origin of venom, and related fish they may contain the trait. Using this approach in studying venomous fish, biologists are able to identify the fish species that may be venomous. Biologist have used this approach in many species such as snakes and lizards. [9] In forensic science, phylogenetic tools are useful to assess DNA evidence for court cases. The simple phylogenetic tree of viruses A-E shows the relationships between viruses e.g., all viruses are descendants of Virus A.
HIV forensics uses phylogenetic analysis to track the differences in HIV genes and determine the relatedness of two samples. Phylogenetic analysis has been used in criminal trials to exonerate or hold individuals. HIV forensics does have its limitations, i.e., it cannot be the sole proof of transmission between individuals and phylogenetic analysis which shows transmission relatedness does not indicate direction of transmission. [10]
Taxonomy is the identification, naming, and classification of organisms. Compared to systemization, classification emphasizes whether a species has characteristics of a taxonomic group. [6] The Linnaean classification system developed in the 1700s by Carolus Linnaeus is the foundation for modern classification methods. Linnaean classification relies on an organism's phenotype or physical characteristics to group and organize species. [11] With the emergence of biochemistry, organism classifications are now usually based on phylogenetic data, and many systematists contend that only monophyletic taxa should be recognized as named groups. The degree to which classification depends on inferred evolutionary history differs depending on the school of taxonomy: phenetics ignores phylogenetic speculation altogether, trying to represent the similarity between organisms instead; cladistics (phylogenetic systematics) tries to reflect phylogeny in its classifications by only recognizing groups based on shared, derived characters ( synapomorphies); evolutionary taxonomy tries to take into account both the branching pattern and "degree of difference" to find a compromise between them.
Usual methods of phylogenetic inference involve computational approaches implementing the optimality criteria and methods of parsimony, maximum likelihood (ML), and MCMC-based Bayesian inference. All these depend upon an implicit or explicit mathematical model describing the evolution of characters observed. [12]
Phenetics, popular in the mid-20th century but now largely obsolete, used distance matrix-based methods to construct trees based on overall similarity in morphology or similar observable traits (i.e. in the phenotype or the overall similarity of DNA, not the DNA sequence), which was often assumed to approximate phylogenetic relationships.
Prior to 1950, phylogenetic inferences were generally presented as narrative scenarios. Such methods are often ambiguous and lack explicit criteria for evaluating alternative hypotheses. [13] [14] [15]
In phylogenetic analysis, taxon sampling selects a small group of taxa to represent the evolutionary history of its broader population. [16] This process is also known as stratified sampling or clade-based sampling. [17] The practice occurs given limited resources to compare and analyze every species within a target population. [16] Based on the representative group selected, the construction and accuracy of phylogenetic trees vary, which impacts derived phylogenetic inferences. [17]
Unavailable datasets, such as an organism's incomplete DNA and protein amino acid sequences in genomic databases, directly restrict taxonomic sampling. [17] Consequently, a significant source of error within phylogenetic analysis occurs due to inadequate taxon samples. Accuracy may be improved by increasing the number of genetic samples within its monophyletic group. Conversely, increasing sampling from outgroups extraneous to the target stratified population may decrease accuracy. Long branch attraction is an attributed theory for this occurrence, where nonrelated branches are incorrectly classified together, insinuating a shared evolutionary history. [16]
There are debates if increasing the number of taxa sampled improves phylogenetic accuracy more than increasing the number of genes sampled per taxon. Differences in each method's sampling impact the number of nucleotide sites utilized in a sequence alignment, which may contribute to disagreements. For example, phylogenetic trees constructed utilizing a more significant number of total nucleotides are generally more accurate, as supported by phylogenetic trees' bootstrapping replicability from random sampling.
The graphic presented in Taxon Sampling, Bioinformatics, and Phylogenomics, compares the correctness of phylogenetic trees generated using fewer taxa and more sites per taxon on the x-axis to more taxa and fewer sites per taxon on the y-axis. With fewer taxa, more genes are sampled amongst the taxonomic group; in comparison, with more taxa added to the taxonomic sampling group, fewer genes are sampled. Each method has the same total number of nucleotide sites sampled. Furthermore, the dotted line represents a 1:1 accuracy between the two sampling methods. As seen in the graphic, most of the plotted points are located below the dotted line, which indicates gravitation toward increased accuracy when sampling fewer taxa with more sites per taxon. The research performed utilizes four different phylogenetic tree construction models to verify the theory; neighbor-joining (NJ), minimum evolution (ME), unweighted maximum parsimony (MP), and maximum likelihood (ML). In the majority of models, sampling fewer taxon with more sites per taxon demonstrated higher accuracy.
Generally, with the alignment of a relatively equal number of total nucleotide sites, sampling more genes per taxon has higher bootstrapping replicability than sampling more taxa. However, unbalanced datasets within genomic databases make increasing the gene comparison per taxon in uncommonly sampled organisms increasingly difficult. [17]
The term "phylogeny" derives from the German Phylogenie, introduced by Haeckel in 1866, [18] and the Darwinian approach to classification became known as the "phyletic" approach. [19] It can be traced back to Aristotle, who wrote in his Posterior Analytics, "We may assume the superiority ceteris paribus [other things being equal] of the demonstration which derives from fewer postulates or hypotheses."
The modern concept of phylogenetics evolved primarily as a disproof of a previously widely accepted theory. During the late 19th century, Ernst Haeckel's recapitulation theory, or "biogenetic fundamental law", was widely accepted. It was often expressed as " ontogeny recapitulates phylogeny", i.e. the development of a single organism during its lifetime, from germ to adult, successively mirrors the adult stages of successive ancestors of the species to which it belongs. But this theory has long been rejected. [20] [21] Instead, ontogeny evolves – the phylogenetic history of a species cannot be read directly from its ontogeny, as Haeckel thought would be possible, but characters from ontogeny can be (and have been) used as data for phylogenetic analyses; the more closely related two species are, the more apomorphies their embryos share.
Phylogenetic tools and representations (trees and networks) can also be applied to philology, the study of the evolution of oral languages and written text and manuscripts, such as in the field of quantitative comparative linguistics. [79]
Computational phylogenetics can be used to investigate a language as an evolutionary system. The evolution of human language closely corresponds with human's biological evolution which allows phylogenetic methods to be applied. The concept of a "tree" serves as an efficient way to represent relationships between languages and language splits. It also serves as a way of testing hypotheses about the connections and ages of language families. For example, relationships among languages can be shown by using cognates as characters. [80] [81] The phylogenetic tree of Indo-European languages shows the relationships between several of the languages in a timeline, as well as the similarity between words and word order.
There are three types of criticisms about using phylogenetics in philology, the first arguing that languages and species are different entities, therefore you can not use the same methods to study both. The second being how phylogenetic methods are being applied to linguistic data. And the third, discusses the types of data that is being used to construct the trees. [80]
Bayesian phylogenetic methods, which are sensitive to how treelike the data is, allow for the reconstruction of relationships among languages, locally and globally. The main two reasons for the use of Bayesian phylogenetics are that (1) diverse scenarios can be included in calculations and (2) the output is a sample of trees and not a single tree with true claim. [82]
The same process can be applied to texts and manuscripts. In Paleography, the study of historical writings and manuscripts, texts were replicated by scribes who copied from their source and alterations - i.e., 'mutations' - occurred when the scribe did not precisely copy the source. [83]
This Needs rewriting and citing so as to be helpful, clear, and encyclopedic; the section's tone or style may not reflect the
encyclopedic tone used on Wikipedia. (February 2024) |
Phylogenetic screens involve the pharmacological examination of closely related groups of organisms. Advances in cladistics analysis through faster computer programs and improved molecular techniques have increased the precision of phylogenetic determination, allowing for the identification of species with pharmacological potential.
Phylogenetic screens have been used in a rudimentary manner in the past, such as studying the Apocynaceae family of plants known for their alkaloid-producing species like Catharanthus, which produces vincristine, an antileukemia drug. However, modern techniques now enable researchers to study close relatives of a species to uncover either (1) higher abundance of important bioactive compounds (e.g., species of Taxus for taxol) or (2) natural variants of known pharmaceuticals (e.g., species of Catharanthus for different forms of vincristine or vinblastine.[ citation needed]
The figure below contains the phylogenetic screen of biodiversity within the fungi family. Inside the circle there are subtrees present that were done via phylogenetic analysis. These relations help understand the evolutionary history of various groups of organisms, identifying relationships between different species, and predicting future evolutionary changes. If we were to take biodiversity information from existing knowledge there might be relations between species or subgroups that we didn't know. But with emerging imagery systems and new analysis techniques more genetic relation can be found in biodiverse fields. The image below can help with conservation efforts as there are rare species of fungi involved, that could be beneficial to ecosystems all around. [84]
This article may be
confusing or unclear to readers. In particular, this section includes dense text that might be hard to understand by general audiences. (February 2024) |
Whole-genome sequence data of pathogens obtained from outbreaks or epidemics of infectious diseases can provide important insights into transmission dynamics and inform public health strategies. Previous studies have relied on integrating genomic and epidemiological data to reconstruct transmission events. However, recent research has explored the possibility of deducing transmission patterns solely from genomic data using phylodynamics, which involves analyzing the properties of pathogen phylogenies. Phylodynamics uses theoretical models to compare predicted branch lengths with actual branch lengths in phylogenies to infer transmission patterns. Additionally, coalescent theory, which describes probability distributions on trees based on population size, has been adapted for epidemiological purposes. Another potential source of information within phylogenies that has been explored is "tree shape". These approaches are computationally intensive but have the potential to provide valuable insights into pathogen transmission dynamics. [85]
The structure of the host contact network has a profound impact on the dynamics of outbreaks or epidemics, and outbreak management strategies rely on the type of transmission patterns driving the outbreak. One can expect that pathogen genomes spreading through different contact network structures, such as chains, homogenous networks, or networks with super-spreaders, would accumulate mutations in distinct patterns, resulting in noticeable differences in the shape of phylogenetic trees, as illustrated in Fig. 1. Analyzation of the structural characteristics of phylogenetic trees generated from simulated bacterial genome evolution across multiple types of contact networks was conducted. Simple topological properties of phylogenetic trees that, when combined, can be used to classify trees into chain-like, homogenous, or super-spreading dynamics, revealing transmission dynamics. These properties form the basis of a computational classifier are used to classify real-world outbreaks. Remarkably, the computational predictions of overall transmission dynamics for each outbreak align with known epidemiology [86]
Different transmission networks result in quantitatively different tree shapes to determine whether tree shapes captured information about the underlying disease transmission patterns within an outbreak, we simulated evolution of a bacterial genome over three types of outbreak contact network—homogenous, super-spreading and chain—and summarized the resulting phylogenies with five metrics describing tree shape. Figures 2 and 3 illustrate the distributions of these metrics across the three types of outbreaks, revealing clear differences in tree topology depending on the underlying host contact network. Super-spreader networks gave rise to phylogenies with higher Colless imbalance, longer ladder patterns, lower Δw and deeper trees than transmission networks with a homogeneous distribution of contacts. Trees derived from chain-like networks were less variable, deeper, more imbalanced and narrower than the other trees. Other topological summary metrics considered did not resolve the three outbreak types as fully (Supplementary Information). Scatter plots can be used for pathogen transmission analysis to visualize the relationship between two variables, such as the number of infected individuals and the time since infection. For example, a scatter plot can be used to examine the relationship between the number of cases of a pathogen and the amount of time since the first case was reported. This can help to identify trends and patterns in the data, such as whether the spread of the pathogen is increasing or decreasing over time. Scatter plots can also be used to identify any outliers or clusters of data points, which can provide insight into potential transmission routes or super-spreader events. Overall, scatter plots can be a useful tool in pathogen transmission analysis to identify patterns and trends in the data, and to inform public health interventions and control measures. [86]
The box plot imagery on the right displays the pathogen transformation data. Box plots are often used in statistical analysis to compare different groups or to visualize changes in a single group over time. They are particularly useful when dealing with large datasets or when comparing several groups, as they can quickly highlight differences or similarities in the data. Box plots, also known as box-and-whisker plots, are useful in statistical analysis to provide a summary of the distribution of a dataset. They display the range, median, quartiles, and potential outliers of the data in a visual manner. Box plots are commonly used to compare different groups or to analyze changes in a single group over time. They are especially helpful when working with large datasets or when comparing multiple groups, as they can easily identify any differences or similarities in the data. This makes box plots a valuable tool for analyzing pathogen transmission data, as they can help to identify important features in the distribution of the data. [86]
{{
cite book}}
: CS1 maint: numeric names: authors list (
link)
{{
cite book}}
: |website=
ignored (
help)
{{
cite journal}}
: CS1 maint: DOI inactive as of February 2024 (
link)
{{
cite journal}}
: CS1 maint: multiple names: authors list (
link)