|
| 1 | +.. _FlexEFP_fmo.rst: |
| 2 | + |
| 3 | +************************* |
| 4 | +FMO: Applied flexible EFP |
| 5 | +************************* |
| 6 | + |
| 7 | +Overview |
| 8 | +-------- |
| 9 | + |
| 10 | +As is the case with many photoactive proteins,computational methods struggle to reproduce |
| 11 | +experimental spectra for the Fenna-Matthews-Olson complex (FMO). Work by |
| 12 | +`Kim et al <https://pubs.acs.org/doi/full/10.1021/acs.jpclett.9b03486>`_ shows that |
| 13 | +flexible QM/EFP can be applied to FMO to correctly generate computational results in |
| 14 | +quantitative agreement to experimental spectra. |
| 15 | + |
| 16 | +The key to applying EFP to your system is to carefully define the active site and EFP region. |
| 17 | +FMO is a trimeric protein with eight bacteriochloropyll a (BChl) pigments in each monomer. |
| 18 | +FMO completes energy transfer via excitonic couplings across these eight BChls. A summary |
| 19 | +of the complete workflow that was performed is the following: 1) molecular dynamics (MD) |
| 20 | +simiulations of the FMO protein in water and counter ions, 2) QM/MM (not EFP) geometry |
| 21 | +optimization of *each* active site (active sites consist of one BChl pigment and |
| 22 | +typically 3 H-bonding amino acids), and 3) flex-EFP excited state energy calculations of |
| 23 | +each pigment. |
| 24 | + |
| 25 | +In the case of FMO, these steps must be repeated on several snapshots from MD to account |
| 26 | +for variation in the resting state of the structure, and the QM region must be defined |
| 27 | +carefully in both the QM/MM and flex-EFP stages. It might not be universally true that one |
| 28 | +must perform QM/MM geometry optimization. This page is a walkthrough for the flex-EFP procedure |
| 29 | +only. Molecular dynamics and QM/MM optimizations are assumed to be complete for your |
| 30 | +system prior to these steps. |
| 31 | + |
| 32 | +.. image:: images/FMO_trimer_BCLs.bmp |
| 33 | + :width: 350 |
| 34 | + |
| 35 | +.. image:: images/FMO_mon_pigs.bmp |
| 36 | + :width: 400 |
| 37 | + |
| 38 | +You will need a structure file (.g96) and topology information (.top, for atom charges). In this specific case, |
| 39 | +a structure file is extracted from a GROMACS molecular dynamics trajectory and all water molecules more than 15 angstroms from |
| 40 | +the protein's surface have been removed. For a chlorophyll-containing protein, you will likely want to optimize the geometry |
| 41 | +of each active chlorophyl molecule (with very close amino acids/water molecules) separately with more standard QM/MM approaches |
| 42 | +before proceeding with EFP calculations on the optimized geometry. For this example, the first BChl, residue number 359, |
| 43 | +has been optimized and will be the QM region for the EFP calcuation. |
| 44 | + |
| 45 | +.. image:: images/fmo_waters15a.bmp |
| 46 | + :width: 400 |
| 47 | + |
| 48 | +First, an EFP region must be defined. Every amino acid, (non QM) BChl, and water molecule containing an |
| 49 | +atom within 15 angstroms of the QM BChl headring. |
| 50 | + |
| 51 | +The headring is defined by atomnames: MG CHA CHB HB CHC HC CHD HD NA C1A |
| 52 | +C2A H2A C3A H3A C4A CMA HMA1 HMA2 HMA3 NB C1B C2B C3B C4B CMB HMB1 HMB2 HMB3 CAB OBB CBB HBB1 HBB2 HBB3 NC C1C C2C H2C C3C |
| 53 | +H3C C4C CMC HMC1 HMC2 HMC3 CAC HAC1 HAC2 CBC HBC1 HBC2 HBC3 ND C1D C2D C3D C4D CMD HMD1 HMD2 HMD3 CAD OBD CBD HBD CGD O1D O2D |
| 54 | +CED HED1 HED2 HED3 |
| 55 | + |
| 56 | +The headring surrounded by EFP region looks like this: |
| 57 | + |
| 58 | +.. image:: images/tester.bmp |
| 59 | + :width: 400 |
| 60 | + |
| 61 | +EFP is, of course, a fragmentation method. The protein residues within the 15 angstrom cutoff will be expressed individually. |
| 62 | +Because amino acids are a continuous chain, we will need to break each residue into its own fragment. Chemically, we would like |
| 63 | +to divide each residue by the C-C backbone bond, however, standard PDB listing convention divides residues by the C-N |
| 64 | +bond. To correct this, 'C' and 'O' atom names should be included in the following aminoc acid. This way the 'C' and 'CA' |
| 65 | +(carbonyl carbon and alpha carbon respectively) bond is the division between bonded fragments. |
| 66 | +See the example below: |
| 67 | + |
| 68 | +.. image:: images/pdb_67_col.bmp |
| 69 | + :width: 400 |
| 70 | + |
| 71 | +For EFP, we would like these two fragments to look like this: |
| 72 | + |
| 73 | +.. image:: images/efp_67_col.bmp |
| 74 | + :width: 400 |
| 75 | + |
| 76 | +The desired atoms are contained in the structure file, but they do not completely 'agree' with the amino acid numbering. |
| 77 | +Below is a snippet from the structure file with the desired EFP fragment 8 highlighted. Note that atom names 'C' and 'O' |
| 78 | +have to be included in the following fragment. |
| 79 | + |
| 80 | +.. literalinclude:: ./examples/flex-EFP/1.Prepare_Structure/bchl359-50028.g96 |
| 81 | + :linenos: |
| 82 | + :lines: 79-101 |
| 83 | + :emphasize-lines: 10-21 |
| 84 | + |
| 85 | +Next, the BChl molecules are closer than the 15 angstrom cutoff, so they also appear in the EFP region. It is more cost efficient |
| 86 | +to treat BChl fragments as separate head and tail groups as is shown below: |
| 87 | + |
| 88 | +.. images/efp_headtail.bmp |
| 89 | + :width: 400 |
| 90 | + |
| 91 | +In the case of both amino acid and BChl fragments, we have at least one broken bond; we cannot simply compute a fragment that is |
| 92 | +missing an atomic bond. To solve this, we will introduce virtual hydrogen atoms to 'cap' the broken bonds. Non terminal |
| 93 | +amino acid fragments will have two virtual atoms. The first is between the alpha carbon of the previous residue and the |
| 94 | +carbonyl carbon of the current residue; the other is similarly between the alpha carbon of the current residue and the carbonyl carbon |
| 95 | +of the following residue. The BChl fragments are split between atoms 'C2A' and 'CAA.' One virtual atom will be added to both |
| 96 | +head and tail fragments between these atoms. Virtual atoms are added along the vector of the broken bonds with the distance changed |
| 97 | +to the C-H equilibrium bond distance, 1.09 angstroms. |
| 98 | + |
| 99 | +.. images/efp_bothvirt.bmp |
| 100 | +
|
| 101 | +Once the system is properly fragmented, we can finally run EFP calculations in the precense of the polarizable, solvatochromic environment. |
| 102 | + |
| 103 | +EFP Workflow |
| 104 | +------------ |
| 105 | + |
0 commit comments