Antell Gregory C, Dampier Will, Aiamkitsumrit Benjamas, Nonnemacher Michael R, Jacobson Jeffrey M, Pirrone Vanessa, Zhong Wen, Kercher Katherine, Passic Shendra, Williams Jean W, Schwartz Gregory, Hershberg Uri, Krebs Fred C, Wigdahl Brian
Department of Microbiology and Immunology, Drexel University College of Medicine, Philadelphia, PA, USA.
Center for Molecular Virology and Translational Neuroscience, Institute for Molecular Medicine and Infectious Disease, Drexel University College of Medicine, Philadelphia, PA, USA.
Retrovirology. 2016 May 3;13(1):32. doi: 10.1186/s12977-016-0266-9.
HIV-1 entry is a receptor-mediated process directed by the interaction of the viral envelope with the host cell CD4 molecule and one of two co-receptors, CCR5 or CXCR4. The amino acid sequence of the third variable (V3) loop of the HIV-1 envelope is highly predictive of co-receptor utilization preference during entry, and machine learning predictive algorithms have been developed to characterize sequences as CCR5-utilizing (R5) or CXCR4-utilizing (X4). It was hypothesized that while the V3 loop is predominantly responsible for determining co-receptor binding, additional components of the HIV-1 genome may contribute to overall viral tropism and display sequence signatures associated with co-receptor utilization.
The accessory protein Tat and the HlV-1 long terminal repeat (LTR) were analyzed with respect to genetic diversity and compared by Jensen-Shannon divergence which resulted in a correlation with both mean genetic diversity as well as the absolute difference in genetic diversity between R5- and X4-genome specific trends. As expected, the V3 domain of the gp120 protein was enriched with statistically divergent positions. Statistically divergent positions were also identified in Tat amino acid sequences within the transactivation and TAR-binding domains, and in nucleotide positions throughout the LTR. We further analyzed LTR sequences for putative transcription factor binding sites using the JASPAR transcription factor binding profile database and found several putative differences in transcription factor binding sites between R5 and X4 HIV-1 genomes, specifically identifying the C/EBP sites I and II, and Sp site III to differ with respect to sequence configuration for R5 and X4 LTRs.
These observations support the hypothesis that co-receptor utilization coincides with specific genetic signatures in HIV-1 Tat and the LTR, likely due to differing transcriptional regulatory mechanisms and selective pressures applied within specific cellular targets during the course of productive HIV-1 infection.
HIV-1进入是一个受体介导的过程,由病毒包膜与宿主细胞CD4分子以及两种共受体之一CCR5或CXCR4的相互作用所引导。HIV-1包膜的第三个可变(V3)环的氨基酸序列高度预测进入过程中共受体利用偏好,并且已经开发了机器学习预测算法来将序列表征为利用CCR5(R5)或利用CXCR4(X4)。据推测,虽然V3环主要负责确定共受体结合,但HIV-1基因组的其他成分可能有助于整体病毒嗜性并显示与共受体利用相关的序列特征。
分析了辅助蛋白Tat和HIV-1长末端重复序列(LTR)的遗传多样性,并通过詹森-香农散度进行比较,这导致与平均遗传多样性以及R5和X4基因组特异性趋势之间的遗传多样性绝对差异均存在相关性。如预期的那样,gp120蛋白的V3结构域富含统计学上有差异的位置。在反式激活和TAR结合结构域内的Tat氨基酸序列以及整个LTR的核苷酸位置中也鉴定出统计学上有差异的位置。我们使用JASPAR转录因子结合谱数据库进一步分析了LTR序列中的假定转录因子结合位点,发现R5和X4 HIV-1基因组之间在转录因子结合位点上存在几个假定差异,具体确定C/EBP位点I和II以及Sp位点III在R5和X4 LTR的序列构型方面存在差异。
这些观察结果支持这样的假设,即共受体利用与HIV-1 Tat和LTR中的特定遗传特征相吻合,这可能是由于在高效HIV-1感染过程中特定细胞靶标内应用的不同转录调控机制和选择压力所致。