This article has Open Peer Review reports available.
A simple structure-based model for the prediction of HIV-1 co-receptor tropism
© Heider et al.; licensee BioMed Central Ltd. 2014
Received: 10 March 2014
Accepted: 28 July 2014
Published: 1 August 2014
Human Immunodeficiency Virus 1 enters host cells through interaction of its V3 loop (which is part of the gp120 protein) with the host cell receptor CD4 and one of two co-receptors, namely CCR5 or CXCR4. Entry inhibitors binding the CCR5 co-receptor can prevent viral entry. As these drugs are only available for CCR5-using viruses, accurate prediction of this so-called co-receptor tropism is important in order to ensure an effective personalized therapy. With the development of next-generation sequencing technologies, it is now possible to sequence representative subpopulations of the viral quasispecies.
Here we present T-CUP 2.0, a model for predicting co-receptor tropism. Based on our recently published T-CUP model, we developed a more accurate and even faster solution. Similarly to its predecessor, T-CUP 2.0 models co-receptor tropism using information of the electrostatic potential and hydrophobicity of V3-loops. However, extracting this information from a simplified structural vacuum-model leads to more accurate and faster predictions. The area-under-the-ROC-curve (AUC) achieved with T-CUP 2.0 on the training set is 0.968±0.005 in a leave-one-patient-out cross-validation. When applied to an independent dataset, T-CUP 2.0 has an improved prediction accuracy of around 3% when compared to the original T-CUP.
We found that it is possible to model co-receptor tropism in HIV-1 based on a simplified structure-based model of the V3 loop. In this way, genotypic prediction of co-receptor tropism is very accurate, fast and can be applied to large datasets derived from next-generation sequencing technologies. The reduced complexity of the electrostatic modeling makes T-CUP 2.0 independent from third-party software, making it easy to install and use.
The Human Immunodeficiency Virus 1 (HIV-1) enters host cells by binding to the CD4 receptor and one of the chemokine co-receptors CCR5 and CXCR4,. The so-called co-receptor tropism of an HIV-1 virus describes the type of co-receptor that is being used: those viruses binding specifically to the CCR5 receptor are called "R5"-, and those binding to CXCR4 are called "X4"-tropic. Some viruses are able to bind either of the co-receptors and are called "dual"- or "R5X4"-tropic. It has been shown that patients harboring X4-tropic viruses tend to progess faster towards the Aquired Immunodeficiency Syndrome (AIDS) when compared to patients harboring only R5-tropic viruses. Recently developed drugs, such as Maraviroc and Vicriviroc, specificially bind to the CCR5 receptor, effectively inhibiting viral entry of R5-tropic viruses. Unfortunately, these drugs are of course ineffective against X4-tropic viruses. Therefore reliable determination of co-receptor tropism is crucial for an effective antiviral treatment of patients. Research has focused on the development of both in vitro tests, such as cell-based assays on the one hand, and in silico methods, on the other to develop a reliable tool for co-receptor tropism determination. The main drawbacks of the former are rather high costs and long turn-around time.
Most of the computational methods focus on the third variable loop (V3), a variable region of the glycoprotein 120 (gp120) of HIV-1. V3 is around 35 amino acids in length, variable in its sequence composition but also in length, and has been shown to be the main determinant for co-receptor tropism. Electrostatic interactions have been implicated to play a decisive role in co-receptor tropism. The most simple and best-known model of co-receptor usage is the 11/25 rule predicting a virus to be R5-tropic unless one of the amino acids sidechains at position 11 or 25 is positively charged. Although having a high specificity (about 90%), this rule lacks sensitivity (40-60%). In order to improve prediction accuracy, several more sophisticated prediction models, ranging from artificial neural networks, position specific scoring matrices to support vector machines have been developed. In our recent studies, we have taken up the implication made by charge-based rules (11/25) and developed an electrostatic hull approach to predict co-receptor tropism. In our approach, V3 sequences (from the training set as well as sequences from new patients) are first modeled onto the V3 X-ray structure by Huang et al. (PDB:2b4c). Second, the electrostatic potential ϕ(r) around the V3 loops is estimated by calculating the electrostatic potential on a constant hull, a discretized surface of n hull = 642 points in an approximate distance of 0.6 nm around the solvent accessible surface of the V3 structure. The electrostatic hulls of the V3 loops are then used as input for a random forest model. In addition, we complemented the structural electrostatic descriptor with a sequence-based classifier, the hydrophobicity scale by Kyte and Doolittle, which encodes amino acid sequences with numerical values representing their hydrophobicity. The hydrophobicity descriptor has been used in several other studies and a wide range of applications leading to accurate predictions[14–19]. The electrostatic and hydrophobicity classifiers, are then combined by a second-level-learning approach, i.e. the outputs of the two classifiers are used as an input for a third random forest, making the final prediction. With this scheme, we outperformed the prediction accuracy of other state-of-the-art methods, such as geno2pheno and wetcat, as demonstrated on an independent test set. Introducing structural information into classification models seem to improve overall prediction performance[10, 22–24]. The combination of structural and sequence information has not only been demonstrated in HIV-1 co-receptor usage prediction, but also in other related studies dealing with HIV-1 drug resistance prediction[25, 26]. For instance, Hou et al. developed an SVM-based method that modeled HIV-1 protease inhibitor resistance using structural information of the HIV-1 protease. We proposed a classification model for Bevirimat resistance in HIV-1 that combines sequence-derived and structural information of the viral p2 protein. These studies suggest that the combination of sequence and structural information can improve prediction performance, compared to classifiers based on either sequences or structures. This is in line with theoretical findings that ensemble learning can lead to better prediction results and that classifier diversity is highly important. The rather complex modeling and prediction scheme of our initial co-receptor prediction method (called T-CUP) leads to disadvantages in computation speed, and involves a handful of external programs. Thus, to date T-CUP has not been available. The aim of this study was the development of T-CUP 2.0, a less complex and faster method, that yields better or comparable predictive power and is easy to install and to use.
For this study, we used the data gathered by Dybowski et al.. It consists of 1351 clonal amino acid sequences of the V3 loop of HIV-1 from 899 patients. 200 sequences are derived from X4-tropic (34 R5X4, 166 X4) viruses and 1151 from R5-tropic viruses. Most of the sequences are from subtype B (~52%), subtype C (~17%) and subtype D (~9%). However, 22% of the sequences spread over many different subtypes. Sequences, tropism and subtype information was extracted from the Los Alamos HIV database (http://www.hiv.lanl.gov/).
All sequences were transformed to a uniform length of 35 and encoded by the numerical hydropathy descriptor using the Interpol package of R. Interpol uses a numerical representation of the amino acids (here: hydropathy) and concatenates these data points. From the resulting curve samples are taken by equal interval. These samples are then used as features for classification.
Linear interpolation on a set of data points (x 0,y 0),(x 1,y 1),…,(x n ,y n ) is defined as the concatenation of linear interpolants between each pair of successive data points. The normalization factor is defined as the number of samples taken (by equal interval) from the aforementioned curves to generate an input for the subsequent classification. Here we used a normalization factor of 35.
In the V3 model, the datapoints were clustered using the k-Means algorithm in R. The number of centers was set to 35.
Random forests were used for developing a classification model, as implemented in the randomForest package of R. Receiver operating characteristics (ROC) curves were calculated and analyzed using the ROCR package. Cross-validation was performed based on a leave-one-patient-out scheme according to Dybowski et al., where the random forest was trained on all sequences of all patients except one patient, and the tropisms of the sequences of the remaining patient were predicted. This was repeated for each patient. This patient-wise-cross-validation was repeated for 10 times to average prediction performance. Area under the curve (AUC) values are shown as a ± δ, marking the average and the 95% confidence interval estimated with a t-distribution. Feature importance was assessed using the built-in function of the randomForest package and estimated by the sum of all decreases in Gini impurity, which has been shown to be more robust compared to the mean decrease in accuracy. For statistical comparison we used Wilcoxon signed-rank test on the AUC distributions.
Comparison with other methods
For comparison with other prediction methods, we used the independent testset of Dybowski et al.. This testset consists of 74 sequences from different HIV-1 subtypes. We compared T-CUP 2.0 with geno2pheno, wetcat and the recently developed method of Bozek et al..
Results and discussion
The motivation for a new prediction model for HIV-1 co-receptor usage is that T-CUP has a very high accuracy, but is rather slow due to the modeling process of the V3 loops with Modeller and the calculation of the electrostatic hull with APBS. This is especially important for new sequencing technologies, i.e. next-generation sequencing, where millions of sequences are generated per sample and fast predictions are needed to be applicable in routine diagnostics. Albeit T-CUP has been demonstrated to be applicable to next-generation sequencing data in principle, it lacks computational efficiency. Therefore, we decided to develop a novel structural descriptor that is both, highly accurate with regard to subsequent classification, but also very efficient with regard to computing time.
The original T-CUP electrostatic classifier was based on an ESP hull above the V3 loop surface. Discretized values of the potential were used to predict co-receptor usage. However the calculation of these electrostatic potential values required solving the Poisson-Boltzmann equation (PBE), which is computationally expensive. The discretized ESP values are the result of charges found on the V3 loop and change when different values for the permittivity of these charges are assumed. In our original publication, we tested different values for permittivity for the solvent and protein. We found that the T-CUP electrostatic model has a good accuracy for a dielectric constant of 5 inside and outside the protein and an ionic strength of zero. Although these values do not reflect physiological conditions, the prediction accuracy was best. For T-CUP 2.0, we exploit the fact that under these conditions (equal permittivity of solvent and protein) the Poisson-Boltzmann equation can be reduced to a potential based on Coulomb’s law.
In our new model, in the following referred to as T-CUP 2.0, coordinates of the C α atoms of the V3 structure of Huang et al. were used as the basis for model building. First, V3 sequences were encoded with Interpol using the net charge descriptor and transformed to 35 values. For each sequence, the interpolated charge values were assigned to the template C α coordinates. In this way, we can simplify the solution for the NP-hard side-chain packing problem faced during the modeling step of the original T-CUP. Next, the V3 structure was placed into a three-dimensional grid with a spacing of 1. Grid points that lay within a distance of[5, 6] to any of the C α were extracted. The resulting 8372 points were reduced by k-Means clustering (k = 35). Here, k was set to match the number of C α atoms present in the model and thus serving as cluster centers. In the last step, the electrostatic potential (ϕ(x i )) for each of the cluster centers x i was calculated by a vacuum model.
The resulting ϕ(x i ) values were then used as a descriptor for the classifier, similar to the ESP values in the former model.
Besides the electrostatic vacuum model, we also trained a classifier using the hydropathy scale according to Dybowski et al.. The V3 sequences were encoded and interpolated to a length of 35 using Interpol. The outputs of the vacuum model and the hydropathy model were then combined via stacking according to Dybowski et al..
Comparison of different prediction methods
Bozek et al.*
Bozek et al.
Bozek et al.
We took V3 loops from two X-ray structures as templates, both with V3 in the context of CD4 bound gp120 and an antibody, to test whether the template structure of the V3 loop has an impact on the classification performance. First the V3 structure from PDB entry 2b4c was used, based on the R5-tropic strain JR-FL and bound to antibody X5. For the second round we used V3 from PDB entry 2qad, based on R5-tropic strain YU2 and complexed with the sulfated antibody 412d. The latter V3 structure is less open in the central bulge region of V3 as it there binds one of the sulfate groups. Both V3 sequences have a length of 35 amino acids and carry only three conservative mutations. Surprisingly, using a more recent V3 structure, we found no significant differences in AUC distributions according to the Wilcoxon signed-rank test (p = 0.25). Moreover, we also tested a model that, besides hydrophobicity and charge, employs the length of the V3 sequences. This is motivated by the notion that the sequence length in the training set is rather narrowly distributed in R5-tropic (83% have length of 35 amino acids), while being broader in X4-tropic viruses (only 52.5% have length of 35 amino acids). However, incorporating length information into the classification system did not improve classification performance (Wilcoxon signed-rank test p = 0.44).
Evaluation on NGS data
Both algorithms, T-CUP and T-CUP 2.0 have a linear computational complexity (in O(n)). T-CUP 2.0 is completely written in R and up to 14-times faster than the original T-CUP. For data resulting from a 454/Roche GS FLX sequencing run and consisting of around 1 million reads, T-CUP 2.0 needs 3.1 days on a single CPU, while T-CUP requires 43.7 days. Both systems can be run in parallel, leading to around 9 hours for T-CUP 2.0 (two Quadcore server with IntelXeon(R) CPU E5462 @ 2.80GHz and 32GB RAM), and five days for T-CUP still.
T-CUP 2.0 performs superior compared to T-CUP in both, the patient-wise cross-validation and on the independent test set. When comparing prediction results between T-CUP 2.0 and T-CUP the correlation is relatively high r = 0.702 ± 0.001. The disagreement D = 0.1001 ± 0.0014, indicates that classifiers can be combined into a final, more accurate prediction. Also Chueca et al. have suggested to combine different prediction models to improve overall prediction performance. Additionally, we have demonstrated the use of our method with next-generation sequencing data. The predicted X4-fraction is in agreement with results obtained from phenotypic assays. However, computational method such as T-CUP 2.0 are able to detect smaller fractions of X4-viruses compared to current phenotypic assays and thus should be considered as more sensitive for diagnostics issues with next-generation sequencing data. Finally, we are convinced that making the T-CUP 2.0 R-package available to other researchers is a valuable contribution to the field.
Availability and requirements
Project name: T-CUP 2.0
Project home page: http://www.uni-due.de/~hy0546/TCUP2.zip
Operating system(s): cross-platform (64bit architecture)
Programming language: R
Other requirements: R 3.0 or higher
License: GNU LGPL
Any restrictions: none
This work was supported by Deutsche Forschungsgemeinschaft, project TRR60/A6.
- D’Souza MP, Harden VA: Chemokines and HIV-1 second receptors Confluence of two fields generates optimism in AIDS research. Nat Med. 1996, 2 (12): 1293-1300. 10.1038/nm1296-1293.View ArticlePubMedGoogle Scholar
- Koot M, Keet IP, Vos AH, de Goede RE, Roos MT, Coutinho RA, Miedema F, Schellekens PT, Tersmette M: Prognostic value of HIV-1 syncytium-inducing phenotype for rate of CD4+ cell depletion and progression to AIDS. Ann Intern Med. 1993, 118 (9): 681-688. 10.7326/0003-4819-118-9-199305010-00004.View ArticlePubMedGoogle Scholar
- Dorr P, Westby M, Dobbs S, Griffin P, Irvine B, Macartney M, Mori J, Rickett G, Smith-Burchnell C, Napier C, Webster R, Armour D, Price D, Stammen B, Wood A, Perros M: Maraviroc (UK-427,857), a potent, orally bioavailable, and selective small-molecule inhibitor of chemokine receptor CCR5 with broad-spectrum anti-human immunodeficiency virus type 1 activity. Antimicrob Agents Chemother. 2005, 49 (11): 4721-4732. 10.1128/AAC.49.11.4721-4732.2005.View ArticlePubMedPubMed CentralGoogle Scholar
- Tagat JR, McCombie SW, Nazareno D, Labroli MA, Xiao Y, Steensma RW, Strizki JM, Baroudy BM, Cox K, Lachowicz J, Varty G, Watkins R: Piperazine-based CCR5 antagonists as HIV-1 inhibitors. IV. Discovery of 1-[(4,6-dimethyl-5-pyrimidinyl)carbonyl]- 4-[4-[2-methoxy-1(R)-4-(trifluoromethyl)phenyl]ethyl-3(S)-methyl-1-piperazinyl]- 4-methylpiperidine (Sch-417690/Sch-D), a potent, highly selective, and orally bioavailable CCR5 antagonist. J Med Chem. 2004, 47 (10): 2405-2408. 10.1021/jm0304515.View ArticlePubMedGoogle Scholar
- Hwang SS, Boyle TJ, Lyerly HK, Cullen BR: Identification of the envelope V3 loop as the primary determinant of cell tropism in HIV-1. Science. 1991, 253 (5015): 71-74. 10.1126/science.1905842.View ArticlePubMedGoogle Scholar
- Fouchier RA, Groenink M, Kootstra NA, Tersmette M, Huisman HG, Miedema F, Schuitemaker H: Phenotype-associated sequence variation in the third variable domain of the human immunodeficiency virus type 1 gp120 molecule. J Virol. 1992, 66 (5): 3183-3187.PubMedPubMed CentralGoogle Scholar
- Resch W, Hoffman N, Swanstrom R: Improved success of phenotype prediction of the human immunodeficiency virus type 1 from envelope variable loop 3 sequence using neural networks. Virology. 2001, 288: 51-62. 10.1006/viro.2001.1087.View ArticlePubMedGoogle Scholar
- Jensen MA, Li FS, van Wout AB, Nickle DC, Shriner D, He HX, McLaughlin S, Shankarappa R Margolick: Improved Coreceptor usage prediction and genotypic monitoring of R5-to-X4 transition by motif analysis of human immunodeficiency virus type 1 env V3 loop sequences. J Virol. 2003, 77: 13376-13388. 10.1128/JVI.77.24.13376-13388.2003.View ArticlePubMedPubMed CentralGoogle Scholar
- Pillai S, Good B, Richman D, Corbeil J: A new perspective on V3 phenotype prediction. AIDS Res Hum Retroviruses. 2003, 19 (2): 145-149. 10.1089/088922203762688658.View ArticlePubMedGoogle Scholar
- Dybowski JN, Heider D, Hoffmann D: Prediction of co-receptor usage of HIV-1 from genotype. PLoS Comput Biol. 2010, 6 (4): e1000743-10.1371/journal.pcbi.1000743.View ArticlePubMedPubMed CentralGoogle Scholar
- Huang CC, Tang M, Zhang MY, Majeed S, Montabana E, Stanfield RL, Dimitrov DS, Korber B, Sodroski J, Wilson IA, Wyatt R, Kwong PD: Structure of a V3-containing HIV-1 gp120 core. Science. 2005, 310 (5750): 1025-1028. 10.1126/science.1118398.View ArticlePubMedPubMed CentralGoogle Scholar
- Breiman L: Random forests. Mach Learn. 2001, 45: 5-32. 10.1023/A:1010933404324.View ArticleGoogle Scholar
- Kyte J, Doolittle R: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982, 157: 105-132. 10.1016/0022-2836(82)90515-0.View ArticlePubMedGoogle Scholar
- Thompson TB, Chuo KC, Zheng C: Neural network prediction of the HIV-1 protease cleavage sites. J Theor Biol. 1995, 177: 369-379. 10.1006/jtbi.1995.0254.View ArticlePubMedGoogle Scholar
- Pánek J, Eidhammer I, Aasland R: Using hydropathy features for function prediction of membrane proteins. Mol Membrane Biol. 2007, 24 (4): 304-12. 10.1080/09687860701213884.View ArticleGoogle Scholar
- Chowriappa P, Dua S, Kanno J, Thompson HW: Protein structure classification based on conserved hydrophobic residues. IEEE/ACM Trans Comput Biol Bioinformatics / IEEE, ACM. 2008, 6 (4): 639-51.View ArticleGoogle Scholar
- Heider D, Hauke S, Pyka M, Kessler D: Insights into the classification of small GTPases. Adv Appl Bioinformatics Chem. 2010, 3: 15-24.View ArticleGoogle Scholar
- Heider D, Verheyen J, Hoffmann D: Predicting Bevirimat resistance of HIV-1 from genotype. BMC Bioinformatics. 2010, 11: 37-10.1186/1471-2105-11-37.View ArticlePubMedPubMed CentralGoogle Scholar
- van den Boom J, Heider D, Martin SR, Pastore A, Mueller JW: 3’-Phosphoadenosine 5’-phosphosulfate (PAPS) synthases, naturally fragile enzymes specifically stabilized by nucleotide binding. J Biol Chem. 2012, 287 (21): 17645-17655. 10.1074/jbc.M111.325498.View ArticlePubMedPubMed CentralGoogle Scholar
- Wolpert D: Stacked generalization. Neural Netw. 1992, 5: 241-260. 10.1016/S0893-6080(05)80023-1.View ArticleGoogle Scholar
- Lengauer T, Sander O, Sierra S, Thielen A, Kaiser R: Bioinformatics prediction of HIV coreceptor usage. Nat Biotechnol. 2007, 25 (12): 1407-1410. 10.1038/nbt1371.View ArticlePubMedGoogle Scholar
- Sander O, Sing T, Sommer I, Low AJ, Cheung PK, Harrigan PR, Lengauer T, Domingues FS: Structural descriptors of gp120 V3 loop for the prediction of HIV-1 coreceptor usage. PLoS Comput Biol. 2007, 3 (3): e58-10.1371/journal.pcbi.0030058.View ArticlePubMedPubMed CentralGoogle Scholar
- Bozek K, Lengauer T, Sierra S, Kaiser R, Domingues FS: Analysis of physicochemical and structural properties determining HIV-1 coreceptor usage. PLoS Comput Biol. 2013, 9 (3): e1002977-10.1371/journal.pcbi.1002977.View ArticlePubMedPubMed CentralGoogle Scholar
- Díez-Fuertes F, Delgado E, Vega Y, Fernández-García A, Cuevas MT, Pinilla M, García V, Pérez-Álvarez L, Thomson MM: Improvement of HIV-1 coreceptor tropism prediction by employing selected nucleotide positions of the env gene in a Bayesian network classifier. J Antimicrob Chemother. 2013, 68 (7): 1471-1485. 10.1093/jac/dkt077.View ArticlePubMedGoogle Scholar
- Hou T, Zhang W, Wang J, Wang W: Predicting drug resistance of the HIV-1 protease using molecular interaction energy components. Proteins. 2009, 74 (4): 837-846. 10.1002/prot.22192.View ArticlePubMedPubMed CentralGoogle Scholar
- Dybowski JN, Riemenschneider M, Hauke S, Pyka M, Verheyen J, Hoffmann D, Heider D: Improved Bevirimat resistance prediction by combination of structural and sequence-based classifiers. BioData Min. 2011, 4: 26-10.1186/1756-0381-4-26.View ArticlePubMedPubMed CentralGoogle Scholar
- Kuncheva LI, Whitaker CJ: Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn. 2003, 51 (2): 181-207. 10.1023/A:1022859003006.View ArticleGoogle Scholar
- Heider D, Hoffmann D: Interpol: An R package for preprocessing of protein sequences. BioData Min. 2011, 4: 16-10.1186/1756-0381-4-16.View ArticlePubMedPubMed CentralGoogle Scholar
- R Development Core Team: R: A Language and Environment for Statistical Computing. 2006, Vienna, Austria: R Foundation for Statistical Computing, [ISBN 3-900051-07-0,http://www.R-project.org],Google Scholar
- Liaw A, Wiener M: Classification and regression by randomForest. R News. 2002, 2 (3): 18-22.Google Scholar
- Sing T, Sander O, Beerenwinkel N, Lengauer T: ROCR: visualizing classifier performance in R. Bioinformatics. 2005, 21 (20): 3940-3941. 10.1093/bioinformatics/bti623.View ArticlePubMedGoogle Scholar
- Calle ML, Urrea V: Letter to the editor: stability of random forest importance measures. Brief Bioinform. 2010, 12: 86-89.View ArticlePubMedGoogle Scholar
- Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234 (3): 779-815. 10.1006/jmbi.1993.1626.View ArticlePubMedGoogle Scholar
- Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA: Electrostatics of nanosystems: application to microtubules and the ribosome. Proc Natl Acad Sci U S A. 2001, 98 (18): 10037-10041. 10.1073/pnas.181342398.View ArticlePubMedPubMed CentralGoogle Scholar
- Dybowski JN, Heider D, Hoffmann D: Structure of HIV-1 quasi-species as early indicator for switches of co-receptor tropism. AIDS Res Ther. 2010, 7: 41-10.1186/1742-6405-7-41.View ArticlePubMedPubMed CentralGoogle Scholar
- Klein P, Kanehisa M, DeLisi C: Prediction of protein function from sequence properties: Discriminant analysis of a data base. Biochim Biophys Acta. 1984, 787: 221-226. 10.1016/0167-4838(84)90312-1.View ArticlePubMedGoogle Scholar
- Akutsu T: NP-hardness results for protein side-chain packing. Genome Inform. 1997, 8: 180-186.Google Scholar
- Heider D, Verheyen J, Hoffmann D: Machine learning on normalized protein sequences. BMC Res Notes. 2011, 4: 94-10.1186/1756-0500-4-94.View ArticlePubMedPubMed CentralGoogle Scholar
- Huang CC, Lam SN, Acharya P, Tang M, Xiang SH, Hussan SSU, Stanfield RL, Robinson J, Sodroski J, Wilson IA, Wyatt R, Bewley CA, Kwong PD: Structures of the CCR5 N terminus and of a tyrosine-sulfated antibody with HIV-1 gp120 and CD4. Science. 2007, 317 (5846): 1930-1934. 10.1126/science.1145373.View ArticlePubMedPubMed CentralGoogle Scholar
- Tsibris AMN, Korber B, Arnaout R, Russ C, Lo CC, Leitner T, Gaschen B, Theiler J, Paredes R, Su Z, Hughes MD, Gulick RM, Greaves W, Coakley E, Flexner C, Nusbaum C, Kuritzkes DR: Quantitative deep sequencing reveals dynamic HIV-1 escape and large population shifts during CCR5 antagonist therapy in vivo. PLoS One. 2009, 4 (5): e5683-10.1371/journal.pone.0005683.View ArticlePubMedPubMed CentralGoogle Scholar
- Chueca N, Garrido C, Alvarez M, Poveda E, de Dios Luna J, Zahonero N, Hernández-Quero J, Soriano V, Maroto C, de Mendoza C, García F: Improvement in the determination of HIV-1 tropism using the V3 gene sequence and a combination of bioinformatic tools. J Med Virol. 2009, 81 (5): 763-767. 10.1002/jmv.21425.View ArticlePubMedGoogle Scholar
- Archer J, Weber J, Henry K, Winner D, Gibson R, Lee L, Paxinos E, Arts EJ, Robertson DL, Mimms L, Quiñones-Mateu ME: Use of four next-generation sequencing platforms to determine HIV-1 coreceptor tropism. PLoS One. 2012, 7 (11): e49602-10.1371/journal.pone.0049602.View ArticlePubMedPubMed CentralGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.