package biocaml
Install
Dune Dependency
Authors
Maintainers
Sources
md5=486aeb3e552dabae85839e2af30d6c52
sha512=4ed2df0b7cbd80bd6e29bd8fee9d2dacd9379ad0f4ff142bd8e16ade3f1507f6cc7cbe4c614943b8feb8fa4705935695cb458606b0da813dbf255b1e566a43cf
doc/biocaml.unix/Biocaml_unix/Chr/index.html
Module Biocaml_unix.Chr
Source
Chromosome names. A chromosome name, as defined by this module, consists of two parts. An optional prefix "chr" (case-insensitive), followed by a suffix identifying the chromosome. The possible suffixes (case-insensitive) are:
- "X" representing the maternal chromosome
- "Y" representing the paternal chromosome
- "M", "Mt", or "MtDNA" representing mitochondrial DNA
- "N" where N is a positive number in either arabic or roman numeral form
Some suffixes have ambiguous interpretations, e.g. "X" could be a Roman numeral or the maternal chromosome. Preference is given to the non-numeric interpretation. Thus "chrX" is treated as the maternal chromosome, not chromosome 10. Strings not in the above form are left unaltered by the functions of this module.
Functions for converting to an Arabic or Roman format are provided. The benefit is that these functions convert to a canonical form, and thus the resulting strings can be compared reliably. For example, to_arabic "Chr4"
and to_arabic "IV"
both return "4". The Arabic format is recommended over the Roman form because the Roman form is incomplete; e.g. it cannot represent chromosome number 10 because there would be an ambiguity with the maternal chromosome "chrX".
to_arabic s
returns the canonical Arabic representation of s
.
to_roman s
returns the canonical Roman representation of s
.