About


H2M Output Data Description


H2M generates a series of outputs that correspond to the human and mouse nucleotide and amino acid changes associated with a given mutation. The full description of this output is provided below:

ColumnDescription
gene_name_hHuman gene name
gene_id_hHuman gene ID
tx_id_hHuman transcript ID
chr_hHuman chromosome number
exon_num_hTotal number of exons of the human transcript
strand_hPositive or Negative strand of the human transcript on the chromosome
matchThe computed reference sequence by given coordinate is matched with the input reference sequence or not
start_h | end_hStart and end position of the human variant on the chromosome in MAF format
ref_seq_h | alt_seq_hReference and alternate sequence of the human variant on the chromosome in MAF format
HGVSc_h | HGVSp_hHGVSc and HGVSp expression of the human variant
classification_hHuman variant effect classification, including missense/nonsense/in-frame indel/fram-shift indel/intron, etc.
exon_hExon/Intron location of the given human mutation, for example, E_7/I_5
type_hHuman variant type in MAF format, including SNP/DNP/TNP/ONP/INS/DEL
statusThis mutation can be modeled in the given target transcript or not, True or False
classH2M modeling result class, 0-5
statementStatement of the H2M result class
flank_size_left | flank_size_rightLength of the identical sequences between human and mouse on the left/right side of the mutation
gene_name_mMouse gene name
gene_id_mMouse gene ID
tx_id_mMouse transcript ID
chr_mMouse chromosome number
exon_num_mTotal number of exons of the mouse transcript
strand_mPositive or negative strand of the mouse transcript on the chromosome
type_mMouse variant type in MAF format, including SNP/DNP/TNO/ONP/INS/DEL
classification_mMouse variant effect classification
exon_mExon/Intron location of the murine mutation
start_m_ori | end_m_oriStart and end position of the mouse variant (with exactly the same DNA change) on the chromosome in MAF format
ref_seq_m_ori | alt_seq_m_oriReference and alternate sequence of the mouse variant (with exactly the same DNA change) on the chromosome in MAF format
HGVSc_m_ori | HGVSp_m_oriHGVSc and HGVSp expression of the mouse variant (with exactly the same DNA change)
start_m | end_mStart and end position of the mouse variant (with the same amino acid change) on the chromosome in MAF format
ref_seq_m | alt_seq_mReference and alternate sequence of the mouse variant (with the same amino acid change) on the chromosome in MAF format
HGVSc_m | HGVSp_mHGVSc and HGVSp expression of the mouse variant (with the same amino acid change)

H2M Modeling Class Description


ClassStatement
Class 0This mutation can be originally modeled (i.e. same DNA change & and same amino acid change).
Class 1This mutation can be alternatively modeled (i.e. different DNA change to generate the same amino acid change).
Class 2This mutation can be modeled, but the effect may not be consistent (i.e. same DNA change, but of uncertain effect–usually limited to noncoding regions).
Class 3This mutation cannot be originally modeled and no alternative is found.
Class 4Mutated sequences are not identical.
Class 5Coordinate error. This mutation is not in the query gene.
Class 6This mutation cannot be originally modeled.