It's just a model! Understanding x-ray crystallography & cryo-EM maps, models, and resolution

Поделиться
HTML-код
  • Опубликовано: 12 сен 2024

Комментарии • 5

  • @thebumblingbiochemist
    @thebumblingbiochemist  Год назад +3

    Note that I talked about “right” but by “right” here I just mean that it matches the molecule’s structure in the sample the data was collected from. Molecules like proteins are often quite dynamic and we can only see “snapshots” of particular “poses” (conformations) they take. In crystallography we usually just see a single conformation because, as I will get into, the technique requires the copies of the protein to all arrange themselves in the same way to form a crystal. In cryo-EM, you freeze the molecules in place in a thin layer of water so the molecules can be in different conformations but in order to actually “solve structures” of a conformation, you need a large enough number of copies of the molecule to be in that conformation to get enough info for a high-res map (because you are going to be averaging data from the different copies).  

    Building off the concept of proteins being flexible and dynamic, let’s go back to the idea of resolution and see how this comes into play…  What is typically reported as “the” resolution is really just the *overall* resolution. But it’s really important to appreciate that local resolution may be worse (sometimes way worse!) than the overall resolution that is reported. Imagine taking a picture of a track meet. Even if you had a great camera, the runner might look blurry even though the spectators look crisp. Similarly, although the physics behind it are different, regions of a map corresponding to parts of the molecule that are more flexible or dynamic (often loops and things on the surface) might look more sausagey. And thus harder (or even impossible) to build a model into. So often those atoms are left unmodeled and you can see which are missing if you look at the PDB file (more on this later). Sometimes scientists will show this as dashed lines in their models.  

    For example, sometimes whole amino acids in proteins (especially the termini (chain ends) are left unmodeled. And sometimes you can just make out the backbone (less flexible) but can’t make out the side chain so it’s only partially modeled or an alanine is modeled on instead of the amino acid that’s actually there since alanine is pretty “generic” structure-wise. More on that in my alanine post. bit.ly/alaninechirality 

    Bottom line thing to keep in mind is that scientists can make out details better in some areas of the map than in others, and thus can model those regions more confidently. These regions have a low B-factor (aka temperature or displacement factor). On the other hand, regions with a high B-factor have more "iffy" data - it's less clear how to model in the underlying atoms. If you view a structure in something like PyMol, Chimera, or even the PDB web browser now, you can color the model by B-factor. Typically it’s displayed as a heat map with red corresponding to high B factor. You tend to see a lot of red on the periphery of the protein a blue inside.  

    No matter where in a molecule some feature is located, even if it has a low B factor, you want to make sure you actually view the maps yourself before relying on the model if you are trying to interpret some claim about it, or make some hypothesis or something. So I made a video explaining how to do this. Hope it helps.  

    Before you judge a structure based on its resolution, know that resolution isn’t the only thing that matters and even a low resolution structure can provide valuable information, especially if used in concert with high res info. Such as, for example, placing models from high res crystal structures of individual complex components into a low res cryo map of the whole complex.  

    As for the biophysics going on behind the scene leading to the resolution that is seen? Well, it’s different for x-ray crystallography and cryo-EM. And it’s also different how they determine it. I’m more familiar with how it’s done in crystallography so I will describe in detail below (adapted from older post). But if you want to learn more about the cryo-EM stuff, I talk about it some in the video and here are some links to papers I recommend.  

    Beckers, M., Mann, D., & Sachse, C. (2021). Structural interpretation of cryo-EM image reconstructions. Progress in biophysics and molecular biology, 160, 26-36. doi.org/10.1016/j.pbiomolbio.2020.07.004 

    Marques, M. A., Purdy, M. D., & Yeager, M. (2019). CryoEM maps are full of potential. Current opinion in structural biology, 58, 214-223. doi.org/10.1016/j.sbi.2019.04.006  

    D'Imprima, E., & Kühlbrandt, W. (2021). Current limitations to high-resolution structure determination by single-particle cryoEM. Quarterly reviews of biophysics, 54, e4. doi.org/10.1017/S0033583521000020  

    Here’s that crystallography info though… 

    And more here: blog form: bit.ly/resolvingresolution ; blog form with lots more on crystallography: bit.ly/xraycrystallography2 ; RUclips: ruclips.net/video/_G7AKFz0cgc/видео.html

    • @thebumblingbiochemist
      @thebumblingbiochemist  Год назад +2

      In crystallography, we get proteins (or other molecules) to give up their watery coats (undissolve) and organize themselves into crystals made up of lots of individual protein molecules organized in a repeating 3D pattern. And then we use x-rays to try to figure out where the atoms in a molecules are. Atoms themselves are made up of smaller pieces (subatomic particles) - positively charged protons & neutral neutrons clump together in a central nucleus & negatively-charged electrons (e⁻) whizz around them   
        
      When we beam x-rays at crystals, the e⁻ scatter the x-rays. We record where those scattered rays hit a detector and this gives us a pattern of spots called the DIFFRACTION PATTERN. We work backwards from the spots to figure out where they scattered from to get an ELECTRON DENSITY MAP (a blobby thing showing the location of the e⁻). Then we build an ATOMIC MODEL of the protein (sticky or ribbony thing) into the map to “solve” the protein’s structure  
        
      e⁻ are what scatter the rays (they’re able to interact with the electric field of X-rays (a form of Electromagnetic radiation), but it’s the nuclei we’re really interested in. Thankfully the nuclei are the central hubs around which e⁻ concentrate, so if we can find where e⁻ are, we can “stick in” the nuclei they correspond to. How confidently we can do this depends on the resolution of our data. In HIGH RESOLUTION data, you can distinctly see the signal coming from individual atoms/ BUT at LOWER RESOLUTION, their signals blend together so you see something more sausagey. So you know the atoms are in that general vicinity, but you can’t make out where exactly.  
        
      So what determines the resolution? Sorry if I get too technical, but…  
        
      Pictures often look like X-rays are “bouncing off” of atoms in a protein - but at the really tiny level it’s not like billiard balls bouncing off the sides of a pool table. Instead it’s more like tossing a billiard ball into a pool and then watching the ripples. The ball isn’t bouncing off the water, but it is perturbing it. The electric field of X-rays perturbs the electron clouds surrounding the nuclei of atoms, causing them to vibrate and give off their own waves (in all directions, but it’s convenient to draw them in just 1).   
        
      But as these individual waves travel they join up w/other waves to give us a resultant overall 3D wave. We call this adding together of waves a FOURIER TRANSFORM. Waves add together through something called superposition. If the waves are completely “in phase” (peaking at the same time) they’ll add CONSTRUCTIVELY and the resultant wave’ll be amplified. But if they’re out of phase - some peaking while others trough-ing this can dampen the wave - DESTRUCTIVE INTERFERENCE.   
        
      Due to the orderly, repetitive spacing of crystals, for each mini wave there’s almost always a wave completely out of phase and thus totally destructing that wave. But if the spacing’s just right and the waves are in sync, they add together constructively and produce a stronger wave that we can capture on our detector as a spot. Now gets a little tricky theory-wise but makes things a lot easier to work with and think about model-wise, because although this mini-waving is what’s happening on the single atom level, we can *think* of “slicing” crystals into families of evenly-spaced planes that *do* act like pool table walls, bouncing off incoming x-rays and giving you a dot on your detector. If you just take my word for it for now, keep reading to learn about how this relates to resolution. But if you want more deets skip ahead or hold off and I will get into it in more detail for those interested.   
        
      Each dot on the diffraction pattern corresponds to a family of planes with spacing that meet “Bragg’s conditions” (more below) so they’re sometimes called Bragg’s planes. And they’re kinda like pixels in that they’re crystallography’s “smallest units of displayed info” & tell you about what’s contained between the planes - so the closer the slicing, the higher the resolution. BUT unlike pixels, which are physical, separate, entities, these planes can “overlap” because they’re not physical things. They don’t really exist, it’s just a mathematical way to represent crystals - like virtually slicing a banana. We get data from slices from different “angles” so we get information about all directions (like slicing a banana hotdog *and* hamburger style but a lot more ways than just 2).   
        
      The thinner the slices, the higher the resolution - so you should be able to just keep slicing thinner and thinner - but we don’t see an infinite number of dots?! Well, kinda like how it was easier to tell dots apart when they were physically further apart, the closer the plane spacing (thus higher resolution), the further apart their dots.  
        
      So the spots from close together planes (thinner slicing, finer detail) are further from the center of the diffraction pattern. it’s like a reverse dart board where the high-res info is further from the bulls-eye. There are a lot fewer of these high-res spots & their signal’s less intense, so in order to capture them, our detector needs to be big & sensitive enough. BUT if it’s too sensitive, it can pick up “artifacts” that aren’t really coming from the protein. At some point, the noise (σ) starts to outweigh the true signal (I), so if we were to include more points we’d be adding noise instead of information. We have to decide were to make this cut-off & there are different statistical measures (including I/σ) we use to decide what “ring on the dart board” (resolution shell) to cut off at . 
        
      The reported resolution corresponds to where we make this cut-off in terms of what data to use - unlike with digital screens where the resolution is objective (can the screen display 2 dots or not?) we have to use some standardized subjectivity when figuring out resolution with this data - we have the data but when can we stop trusting it?  
        
      Often data quality measurements are reported for all the used data combined as well as for “highest resolution shell” (just outer ring) so readers can get a sense of whether they “cut” appropriately & didn’t try to use higher resolution “data” that was really just noise.  
        
      The cut-off’s really important because it limits what we even have a chance of seeing.  
        
      At low resolutions (higher numbers), we can make out things like the protein backbone. But it’s harder to be confident about the position of things like side chains (the unique parts that stick off the backbone). Once you get to higher resolutions, you start to be able to make out the side chains & their orientations. We call structures w/resolution at or better than 1.2 Å “atomic resolution” because we can confidently make out the location of all the atoms. BUT this is rare in protein crystallography - the average published structure is ~2 but some of the most exciting structures are lower resolution but of more “complex” things (which tend to be more “dynamic” and unwilling to sit nicely still in the precisely ordered arrangements required for crystallography). If even one of the thousands of individual protein molecules in a crystal is slightly different, it jumbles everything up  
        
      This is because scattered x-rays from different points on the protein interfere w/one another (either cancelling each other out or making the signal stronger) so that each diffraction spot contains some information about EVERY point in the protein, so the overall resolution corresponds to the whole protein BUT some regions can still be “fuzzier” than others - like places where atoms move around more. We can describe this local fuzziness with that B-factor (aka displacement factor or temperature factor) we talked about before. Areas with a high B-factor are more “sausagey” and thus harder to model accurately.  
        
      As I mentioned before, sometimes regions of a protein (often places like flexible loops and “intrinsically disordered regions” move around so much we can’t even tell there are atoms there so we have to leave those “unresolved regions “of the protein out of the structure we build even though we know there are actually protein parts there.

    • @thebumblingbiochemist
      @thebumblingbiochemist  Год назад +3

      Back to the detail on the planes as promised  
        
      You know how in CT scans they take images in slices? Well. we can *imagine* slicing up a a protein similarly. You can imagine slicing a crystal into families of evenly-spaced planes that waves *do* bounce off of. All the beams would hit these planes at the same angle, so they’d bounce off at the same angle. But they’d hit the detector with their phase shifted (peaking earlier or later in relation to the other waves coming off of other planes in family. So the bounced off waves would then almost always cancel each other out. But families of planes with certain spacings will add together constructively. This happens when the conditions of “Bragg’s Law” are fulfilled.   
        
      Their phase shift depends on wavelength (λ), angle incoming wave hits (which is also angle it bounces off) (θ) & distance between planes (d). Bragg’s Law says that in order for constructive interference to occur,  
        
      nλ = 2dsinθ  
        
      Why? Incoming x-ray wave hits one plane before the under it. It has to travel further to reach lower plane & reflected ray will has to travel further to reach detector. Bragg’s law tells you the condition needed for combined length of “detour” to be an integer # (n) of λ so waves gets back in sync  
        
      And we call the families of planes that fit these conditions “Bragg’s planes” or “hkl planes” where hkl is a notation used to describe how many slices are made in each dimension. Each dot in a diffraction pattern corresponds to a family of planes.   

      Finally, a note on a potential source of confusion… going back to the starring idea of 1 dot vs 2 dots, - with digital screens, whether or not you see 2 dots or one (assuming they’re far enough apart that your eyes can tell too) depends on whether the display shows 2 dots or 1. It’s kinda like one of those “LiteBright” toys where you have different color pegs you can light up - the smallest thing you can represent is a single peg. So if you want to display 2 dots that are smaller/closer together than the size of a single peg, you’re out of luck.  
        
      Instead of “light pegs,” digital screens use pixels, which are separately controllable “picture elements” that can display different colors and intensities of light. If pixels are smaller than the thing you want to show, multiple pixels can work together to show it, but you can’t show things smaller than a pixel (though some displays use subpixels which I”m not going to get into). So, even if your eyes would be able to tell that there were 2 dots in the area covered by a pixel, the computer can’t display it as 2 dots - it can only show you a single dot.   
        
      Screen resolution is usually reported as the pixel dimensions - like 10 pixels wide by 5 pixels tall. Though that would make a pretty lame screen! Instead common screen resolutions are 1280 x 720 (720p); 1920 x 1080 (1080p); 2560 x 1440 (1440p); 3840 x 2160 (4K or 2160p); 7680 x 4320 (8K or 4320p). If you want to display finer detail, you need to use a high pixel density - pack more pixels into the same area. This is how you can get better crispness and it’s usually reported as pixels per inch (psi).  
        
      So, for screens, high resolution corresponds to high numbers, but for structural biology, high resolution refers to low numbers. And this can really trip people up (I know I found it confusing when I started out, being used to hearing about high resolution monitors, with companies bragging about their “megapixel” displays (displays with millions of pixels).  

      More on digital resolution here: bit.ly/resolvingresolution 

      more on understanding crystal structures: bit.ly/crystalstructuremodels & ruclips.net/video/YK3VkqD2o2s/видео.html 

      more on cryo-EM here: bit.ly/cryoemxray & bit.ly/cryoEMbumblyintro & ruclips.net/video/qcS6oZlEtH0/видео.html 

      intro to PDB, crystal structure entries, crystal contents, resolution: bit.ly/pdbstructures & ruclips.net/video/Re2gwi-_OEw/видео.html & ruclips.net/video/IZtHsUFbyes/видео.html   

      links to more about the GFP example structure: 

      www.rcsb.org/structure/2b3p  

      Pédelacq, J. D., Cabantous, S., Tran, T., Terwilliger, T. C., & Waldo, G. S. (2006). Engineering and characterization of a superfolder green fluorescent protein. Nature biotechnology, 24(1), 79-88. doi.org/10.1038/nbt1172  

      pdb101.rcsb.org/motm/42  
             
      more about all sorts of things: #365DaysOfScience All (with topics listed) 👉 bit.ly/2OllAB0 or search blog: thebumblingbiochemist.com

  • @samuelmensahbaffoe8303
    @samuelmensahbaffoe8303 Год назад

    I have been following your graphics for a while but never knew you had a RUclips channel
    Great work !!