Skip to content

Commit 6b06097

Browse files
committed
Added basic ambiguity codes to base modification tags
1 parent 873b592 commit 6b06097

File tree

1 file changed

+19
-0
lines changed

1 file changed

+19
-0
lines changed

SAMtags.tex

+19
Original file line numberDiff line numberDiff line change
@@ -503,6 +503,10 @@ \subsection{Base modifications}
503503
below, taken from \emph{Modeling methyl-sensitive transcription factor
504504
motifs with an expanded epigenetic alphabet}, Coby Viner
505505
et.al. \url{https://www.biorxiv.org/content/10.1101/043794v1}.
506+
Additionally ambiguity codes {\tt A}, {\tt C}, {\tt G} and {\tt T}
507+
exist to represent unspecified modifications bases of their respective
508+
canonical base types, plus code {\tt N} to represent an unspecified
509+
modification of any base type.
506510

507511
\begin{center}
508512
\begin{tabular}{lllll}
@@ -512,16 +516,21 @@ \subsection{Base modifications}
512516
C & h & 5hmC & 5-Hydroxymethylcytosine & 76792 \\
513517
C & f & 5fC & 5-Formylcytosine & 76794 \\
514518
C & c & 5caC & 5-Carboxylcytosine & 76793 \\
519+
C & C & & Ambiguity code; any C mod & \\
515520
\hline
516521
T & g & 5hmU & 5-Hydroxymethyluracil & 16964 \\
517522
T & e & 5fU & 5-Formyluracil & 80961 \\
518523
T & b & 5caU & 5-Carboxyluracil & 17477 \\
524+
T & T & & Ambiguity code; any T mod & \\
519525
\hline
520526
A & a & 6mA & 6-Methyladenine & 28871 \\
527+
A & A & & Ambiguity code; any A mod & \\
521528
\hline
522529
G & o & 8oxoG & 8-Oxoguanine & 44605 \\
530+
G & G & & Ambiguity code; any G mod & \\
523531
\hline
524532
N & n & Xao & Xanthosine & 18107 \\
533+
N & N & & Ambiguity code; any mod & \\
525534
\end{tabular}
526535
\end{center}
527536

@@ -536,6 +545,13 @@ \subsection{Base modifications}
536545
For example {\tt MM:Z:C+m,5,12,3;C+h,57;} may have an associated
537546
quality tag of {\tt MP:Z:5EB/}.
538547

548+
Quality values for ambiguity codes give the likelihood that the
549+
modification is one of the possible codes compatible with that
550+
ambiguity code. For example {\tt MM:Z:C+C,10 MP:Z:+} indicates a C
551+
call with an unspecified modification and the phred score of 10 (ASCII
552+
value {\tt +}). This corresponds to a 90\% chance of the base being
553+
modified.
554+
539555

540556
\end{description}
541557

@@ -560,6 +576,9 @@ \section{Tag History}
560576
\setlength{\parindent}{0pt}
561577
\newcommand*{\gap}{\vspace*{2ex}}
562578

579+
\subsubsection*{June 2019}
580+
Added the MM and MP tags describing base modifications.
581+
563582
\subsubsection*{January 2019}
564583
Added the OA tag for recording original/previous alignment information.
565584

0 commit comments

Comments
 (0)