@@ -7,99 +7,98 @@ FMO: Applied flexible EFP
77Overview
88--------
99
10- As is the case with many photoactive proteins,computational methods struggle to reproduce
11- experimental spectra for the Fenna-Matthews-Olson complex (FMO). Work by
12- `Kim et al <https://pubs.acs.org/doi/full/10.1021/acs.jpclett.9b03486 >`_ shows that
13- flexible QM/EFP can be applied to FMO to correctly generate computational results in
14- quantitative agreement to experimental spectra.
15-
16- The key to applying EFP to your system is to carefully define the active site and EFP region.
17- FMO is a trimeric protein with eight bacteriochloropyll a (BChl) pigments in each monomer.
18- FMO completes energy transfer via excitonic couplings across these eight BChls. A summary
19- of the complete workflow that was performed is the following: 1) molecular dynamics (MD)
20- simiulations of the FMO protein in water and counter ions, 2) QM/MM (not EFP) geometry
21- optimization of *each * active site (active sites consist of one BChl pigment and
22- typically 3 H-bonding amino acids), and 3) flex-EFP excited state energy calculations of
10+ As is the case with many photoactive proteins,computational methods struggle to reproduce
11+ experimental spectra for the Fenna-Matthews-Olson complex (FMO). Work by
12+ `Kim et al <https://pubs.acs.org/doi/full/10.1021/acs.jpclett.9b03486 >`_ shows that
13+ flexible QM/EFP can be applied to FMO to correctly generate computational results in
14+ quantitative agreement to experimental spectra.
15+
16+ The key to applying EFP to your system is to carefully define the active site and EFP region.
17+ FMO is a trimeric protein with eight bacteriochloropyll a (BChl) pigments in each monomer.
18+ FMO completes energy transfer via excitonic couplings across these eight BChls. A summary
19+ of the complete workflow that was performed is the following: 1) molecular dynamics (MD)
20+ simiulations of the FMO protein in water and counter ions, 2) QM/MM (not EFP) geometry
21+ optimization of *each * active site (active sites consist of one BChl pigment and
22+ typically 3 H-bonding amino acids), and 3) flex-EFP excited state energy calculations of
2323each pigment.
2424
25- In the case of FMO, these steps must be repeated on several snapshots from MD to account
26- for variation in the resting state of the structure, and the QM region must be defined
27- carefully in both the QM/MM and flex-EFP stages. It might not be universally true that one
28- must perform QM/MM geometry optimization. This page is a walkthrough for the flex-EFP procedure
29- only. Molecular dynamics and QM/MM optimizations are assumed to be complete for your
30- system prior to these steps.
25+ In the case of FMO, these steps must be repeated on several snapshots from MD to account
26+ for variation in the resting state of the structure, and the QM region must be defined
27+ carefully in both the QM/MM and flex-EFP stages. It might not be universally true that one
28+ must perform QM/MM geometry optimization. This page is a walkthrough for the flex-EFP procedure
29+ only. Molecular dynamics and QM/MM optimizations are assumed to be complete for your
30+ system prior to these steps.
3131
32- .. image :: images/FMO_trimer_BCLs.bmp
32+ .. image :: ../ images/FMO_trimer_BCLs.bmp
3333 :width: 350
34-
35- .. image :: images/FMO_mon_pigs.bmp
34+
35+ .. image :: ../ images/FMO_mon_pigs.bmp
3636 :width: 400
3737
3838You will need a structure file (.g96) and topology information (.top, for atom charges). In this specific case,
39- a structure file is extracted from a GROMACS molecular dynamics trajectory and all water molecules more than 15 angstroms from
40- the protein's surface have been removed. For a chlorophyll-containing protein, you will likely want to optimize the geometry
41- of each active chlorophyl molecule (with very close amino acids/water molecules) separately with more standard QM/MM approaches
42- before proceeding with EFP calculations on the optimized geometry. For this example, the first BChl, residue number 359,
39+ a structure file is extracted from a GROMACS molecular dynamics trajectory and all water molecules more than 15 angstroms from
40+ the protein's surface have been removed. For a chlorophyll-containing protein, you will likely want to optimize the geometry
41+ of each active chlorophyl molecule (with very close amino acids/water molecules) separately with more standard QM/MM approaches
42+ before proceeding with EFP calculations on the optimized geometry. For this example, the first BChl, residue number 359,
4343has been optimized and will be the QM region for the EFP calcuation.
4444
45- .. image :: images/fmo_waters15a.bmp
45+ .. image :: ../ images/fmo_waters15a.bmp
4646 :width: 400
4747
48- First, an EFP region must be defined. Every amino acid, (non QM) BChl, and water molecule containing an
49- atom within 15 angstroms of the QM BChl headring.
48+ First, an EFP region must be defined. Every amino acid, (non QM) BChl, and water molecule containing an
49+ atom within 15 angstroms of the QM BChl headring.
5050
51- The headring is defined by atomnames: MG CHA CHB HB CHC HC CHD HD NA C1A
52- C2A H2A C3A H3A C4A CMA HMA1 HMA2 HMA3 NB C1B C2B C3B C4B CMB HMB1 HMB2 HMB3 CAB OBB CBB HBB1 HBB2 HBB3 NC C1C C2C H2C C3C
53- H3C C4C CMC HMC1 HMC2 HMC3 CAC HAC1 HAC2 CBC HBC1 HBC2 HBC3 ND C1D C2D C3D C4D CMD HMD1 HMD2 HMD3 CAD OBD CBD HBD CGD O1D O2D
51+ The headring is defined by atomnames: MG CHA CHB HB CHC HC CHD HD NA C1A
52+ C2A H2A C3A H3A C4A CMA HMA1 HMA2 HMA3 NB C1B C2B C3B C4B CMB HMB1 HMB2 HMB3 CAB OBB CBB HBB1 HBB2 HBB3 NC C1C C2C H2C C3C
53+ H3C C4C CMC HMC1 HMC2 HMC3 CAC HAC1 HAC2 CBC HBC1 HBC2 HBC3 ND C1D C2D C3D C4D CMD HMD1 HMD2 HMD3 CAD OBD CBD HBD CGD O1D O2D
5454CED HED1 HED2 HED3
5555
5656The headring surrounded by EFP region looks like this:
5757
58- .. image :: images/tester.bmp
58+ .. image :: ../ images/tester.bmp
5959 :width: 400
6060
61- EFP is, of course, a fragmentation method. The protein residues within the 15 angstrom cutoff will be expressed individually.
62- Because amino acids are a continuous chain, we will need to break each residue into its own fragment. Chemically, we would like
63- to divide each residue by the C-C backbone bond, however, standard PDB listing convention divides residues by the C-N
64- bond. To correct this, 'C' and 'O' atom names should be included in the following aminoc acid. This way the 'C' and 'CA'
61+ EFP is, of course, a fragmentation method. The protein residues within the 15 angstrom cutoff will be expressed individually.
62+ Because amino acids are a continuous chain, we will need to break each residue into its own fragment. Chemically, we would like
63+ to divide each residue by the C-C backbone bond, however, standard PDB listing convention divides residues by the C-N
64+ bond. To correct this, 'C' and 'O' atom names should be included in the following aminoc acid. This way the 'C' and 'CA'
6565(carbonyl carbon and alpha carbon respectively) bond is the division between bonded fragments.
6666See the example below:
6767
68- .. image :: images/pdb_67_col.bmp
68+ .. image :: ../ images/pdb_67_col.bmp
6969 :width: 400
7070
7171For EFP, we would like these two fragments to look like this:
7272
73- .. image :: images/efp_67_col.bmp
73+ .. image :: ../ images/efp_67_col.bmp
7474 :width: 400
7575
7676The desired atoms are contained in the structure file, but they do not completely 'agree' with the amino acid numbering.
77- Below is a snippet from the structure file with the desired EFP fragment 8 highlighted. Note that atom names 'C' and 'O'
77+ Below is a snippet from the structure file with the desired EFP fragment 8 highlighted. Note that atom names 'C' and 'O'
7878have to be included in the following fragment.
7979
80- .. literalinclude :: ./examples/flex-EFP/1.Prepare_Structure/bchl359-50028.g96
80+ .. literalinclude :: .. /examples/flex-EFP/1.Prepare_Structure/bchl359-50028.g96
8181 :linenos:
8282 :lines: 79-101
8383 :emphasize-lines: 10-21
8484
85- Next, the BChl molecules are closer than the 15 angstrom cutoff, so they also appear in the EFP region. It is more cost efficient
85+ Next, the BChl molecules are closer than the 15 angstrom cutoff, so they also appear in the EFP region. It is more cost efficient
8686to treat BChl fragments as separate head and tail groups as is shown below:
8787
88- .. images/efp_headtail.bmp
88+ .. ../ images/efp_headtail.bmp
8989 :width: 400
90-
91- In the case of both amino acid and BChl fragments, we have at least one broken bond; we cannot simply compute a fragment that is
92- missing an atomic bond. To solve this, we will introduce virtual hydrogen atoms to 'cap' the broken bonds. Non terminal
93- amino acid fragments will have two virtual atoms. The first is between the alpha carbon of the previous residue and the
94- carbonyl carbon of the current residue; the other is similarly between the alpha carbon of the current residue and the carbonyl carbon
95- of the following residue. The BChl fragments are split between atoms 'C2A' and 'CAA.' One virtual atom will be added to both
96- head and tail fragments between these atoms. Virtual atoms are added along the vector of the broken bonds with the distance changed
90+
91+ In the case of both amino acid and BChl fragments, we have at least one broken bond; we cannot simply compute a fragment that is
92+ missing an atomic bond. To solve this, we will introduce virtual hydrogen atoms to 'cap' the broken bonds. Non terminal
93+ amino acid fragments will have two virtual atoms. The first is between the alpha carbon of the previous residue and the
94+ carbonyl carbon of the current residue; the other is similarly between the alpha carbon of the current residue and the carbonyl carbon
95+ of the following residue. The BChl fragments are split between atoms 'C2A' and 'CAA.' One virtual atom will be added to both
96+ head and tail fragments between these atoms. Virtual atoms are added along the vector of the broken bonds with the distance changed
9797to the C-H equilibrium bond distance, 1.09 angstroms.
9898
99- .. images/efp_bothvirt.bmp
99+ .. ../ images/efp_bothvirt.bmp
100100
101- Once the system is properly fragmented, we can finally run EFP calculations in the precense of the polarizable, solvatochromic environment.
101+ Once the system is properly fragmented, we can finally run EFP calculations in the precense of the polarizable, solvatochromic environment.
102102
103103EFP Workflow
104104------------
105-
0 commit comments