AlphaFold: Highly accurate protein structure prediction for WHO priority pathogens

I learned today from our colleagues at WHO that Alphafold has recently added the proteome of nearly all the WHO priority pathogens to their library of highly accurate prediction of protein structures. Alphafold was new to me and is amazing! Per our colleagues at WHO:

  • AlphaFold is an AI system developed by DeepMind that predicts a protein’s 3D structure from its amino acid sequence.
  • It regularly achieves accuracy competitive with experimental results.
  • DeepMind and EMBL’s European Bioinformatics Institute (EMBL-EBI) have partnered to make these predictions freely available to the scientific community.
  • The database will be expanded during 2022 to cover a large proportion of all catalogued proteins (the over 100 million in UniRef90).

I’ve listed all current proteomes in the AlphaFold database below my signature. The list is very broad and I’ll name just a few to give you the idea: mammals (man, mouse, rat), plants (maize, soybeans), fungi (Candida albicans, Fonsecaea pedrosoi), parasites (Brugia malayi, Trypanosoma brucei), and bacteria (S. aureus, M. tuberculosis).

Truly science at its best … this is a stunning resource! I’ve known about DeepMind and chess (and other games) but being able to predict protein structures without the slow, expensive (and not always even possible for some proteins) process of crystallography would seem to open many, many research avenues. Wow! Many thanks to the teams at DeepMind and EMBL!

Current list (alphabetical order)

  1. Ajellomyces capsulatus
  2. Arabidopsis thaliana
  3. Brugia malayi
  4. Caenorhabditis elegans (Nematode worm)
  5. Campylobacter jejuni
  6. Candida albicans
  7. Cladophialophora carrionii
  8. Danio rerio (Zebrafish)
  9. Dictyostelium discoideum
  10. Dracunculus medinensis
  11. Drosophila melanogaster (Fruit fly)
  12. Enterococcus faecium
  13. Escherichia coli
  14. Fonsecaea pedrosoi
  15. Glycine max (Soybean)
  16. Haemophilus influenzae
  17. Helicobacter pylori
  18. Homo sapiens (Human)
  19. Klebsiella pneumoniae
  20. Leishmania infantum
  21. Madurella mycetomatis
  22. Methanocaldococcus jannaschii
  23. Mus musculus (Mouse)
  24. Mycobacterium leprae
  25. Mycobacterium tuberculosis
  26. Mycobacterium ulcerans
  27. Neisseria gonorrhoeae
  28. Nocardia brasiliensis
  29. Onchocerca volvulus
  30. Oryza sativa (Asian rice)
  31. Paracoccidioides lutzii
  32. Plasmodium falciparum
  33. Pseudomonas aeruginosa
  34. Rattus norvegicus (Rat)
  35. Saccharomyces cerevisiae (Budding yeast)
  36. Salmonella typhimurium
  37. Schistosoma mansoni
  38. Schizosaccharomyces pombe (Fission yeast)
  39. Shigella dysenteriae
  40. Sporothrix schenckii
  41. Staphylococcus aureus
  42. Streptococcus pneumoniae
  43. Strongyloides stercoralis
  44. Trichuris trichiura
  45. Trypanosoma brucei
  46. Trypanosoma cruzi
  47. Wuchereria bancrofti
  48. Zea mays (Maize)

