Chimera (molecular biology)


In genetics and molecular biology, a chimera is a single DNA sequence originating from multiple transcripts or parent sequences. It can occur in various contexts. Chimeras are generally considered a contaminant, as a chimera can be interpreted as a novel sequence while it is in fact an artifact. However, the formation of artificial chimeras can also be a useful tool in the molecular biology. For example, in protein engineering, "chimeragenesis " is one of the "two major techniques used to manipulate cDNA sequences".

Description

Transcript chimera

A chimera can occur as a single cDNA sequence originating from two transcripts. It is usually considered to be a contaminant in transcript and expressed sequence tag databases. It is estimated that approximately 1% of all transcripts in the National Center for Biotechnology Information's Unigene database contain a "chimeric sequence".

PCR chimera

A chimera can also be an artifact of PCR amplification. It occurs when the extension of an amplicon is aborted, and the aborted product functions as a primer in the next PCR cycle. The aborted product anneals to the wrong template and continues to extend, thereby synthesizing a single sequence sourced from two different templates.
PCR chimeras are an important issue to take into account during metabarcoding, where DNA sequences from environmental samples are used to determine biodiversity. A chimera is a novel sequence that will most probably not match to any known organism. Hence, it might be interpreted as a new species thereby inflating over diversity.

Chimeric read

A chimeric read is a digital DNA sequence that originates from an actual chimera or produced due to misreading the sample. The latter is known to occur with sequencing of electrophoresis gels.
Some methods have been devised to detect chimeras, like: