Zika virusNovel alignment-free sequence descriptors in Zika virus characterization

Published 11 April 2016

The recent epidemic of Zika virus infections in South and Latin America has raised serious concerns on its ramifications for the population in the Americas and spread of the virus worldwide. Researchers explain their research on computer-assisted approaches toward surveillance and consequent design of drugs and vaccines to combat the growth and spread of the Zika virus.

The recent epidemic of Zika virus infections in South and Latin America has raised serious concerns on its ramifications for the population in the Americas and spread of the virus worldwide. The Zika virus disease is a relatively new phenomenon for which sufficient and comprehensive data and investigative reports have not been available to date. Although first recognized as a new virus in 1947 in Uganda’s Zika forest animals, its debilitating effect on human fetuses leading to babies being born with smaller braincases (microcephaly) was not known or well investigated until its epidemic form in Yap Island, Micronesia, in 2013 and now in the South American countries in 2015-16. The concern is so high that public health authorities in some countries such as Colombia, Ecuador, El Salvador, and Jamaica have asked their women to avoid pregnancies until the virus is better understood. The World Health Organization (WHO) has labeled the outbreak of such abnormalities as a Public Health Emergency of International Concern.

The Zika virus is spread by the Aedes aegypti mosquito which is prevalent in tropical climate regions, but is spreading northwards following global warming trends. Bentham Science Publishers reports that to understand the nature of the virus, researchers undertook a special bioinformatics study of its complete RNA sequence from African and non-African, that is, Asian, Pacific, and American, sources. The researchers note that viral sequences mutate much more rapidly than, say, mammalian sequences. They found that the African Zika viral sequences were distinctly separate compared to the non-African sequences.

This difference is evident dramatically from two graphs of the Zika genome, one from Central African Republic and another from Brazil. The graphs are representations of the Zika RNA sequences through our model where the nucleotides of a sequence are plotted sequentially on a 2-dimensional grid by a set algorithm. Such a representation allows the researchers to project a visual rendering of the distribution of nucleotides in a sequence and also to quantitatively index a sequence through numerical sequence descriptors and measure similarities and dissimilarities among sequences. While the normal practice is to use various alignment techniques like BLAST, CLUSTALW, etc. to measure such differences, the researchers say their approach allows for a novel method of genome characterization called alignment-free measures and is therefore independent of several assumptions. Results such as theirs provide more focus to surveillance of the spread and mutations of the Zika virus and hopefully can contribute to the eventual development of suitable drugs and vaccines that target the expression of specific Zika virus genes or the products expressed by these genes.

— Read more in Ashesh Nandy et al., “Characterizing the Zika virus genome – A bioinformatics study. Current Computer Aided-Drug Design 12 (2016) (DOI: 10.2174/1573409912666160401115812)