Skip to content

Commit 5bd961f

Browse files
authored
Merge pull request #7 from libefp2/new_jack
fixing my conflicting files
2 parents 4e81745 + d69ea23 commit 5bd961f

12 files changed

Lines changed: 106 additions & 0 deletions

applied_efp.rst

Lines changed: 105 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,105 @@
1+
.. _FlexEFP_fmo.rst:
2+
3+
*************************
4+
FMO: Applied flexible EFP
5+
*************************
6+
7+
Overview
8+
--------
9+
10+
As is the case with many photoactive proteins,computational methods struggle to reproduce
11+
experimental spectra for the Fenna-Matthews-Olson complex (FMO). Work by
12+
`Kim et al <https://pubs.acs.org/doi/full/10.1021/acs.jpclett.9b03486>`_ shows that
13+
flexible QM/EFP can be applied to FMO to correctly generate computational results in
14+
quantitative agreement to experimental spectra.
15+
16+
The key to applying EFP to your system is to carefully define the active site and EFP region.
17+
FMO is a trimeric protein with eight bacteriochloropyll a (BChl) pigments in each monomer.
18+
FMO completes energy transfer via excitonic couplings across these eight BChls. A summary
19+
of the complete workflow that was performed is the following: 1) molecular dynamics (MD)
20+
simiulations of the FMO protein in water and counter ions, 2) QM/MM (not EFP) geometry
21+
optimization of *each* active site (active sites consist of one BChl pigment and
22+
typically 3 H-bonding amino acids), and 3) flex-EFP excited state energy calculations of
23+
each pigment.
24+
25+
In the case of FMO, these steps must be repeated on several snapshots from MD to account
26+
for variation in the resting state of the structure, and the QM region must be defined
27+
carefully in both the QM/MM and flex-EFP stages. It might not be universally true that one
28+
must perform QM/MM geometry optimization. This page is a walkthrough for the flex-EFP procedure
29+
only. Molecular dynamics and QM/MM optimizations are assumed to be complete for your
30+
system prior to these steps.
31+
32+
.. image:: images/FMO_trimer_BCLs.bmp
33+
:width: 350
34+
35+
.. image:: images/FMO_mon_pigs.bmp
36+
:width: 400
37+
38+
You will need a structure file (.g96) and topology information (.top, for atom charges). In this specific case,
39+
a structure file is extracted from a GROMACS molecular dynamics trajectory and all water molecules more than 15 angstroms from
40+
the protein's surface have been removed. For a chlorophyll-containing protein, you will likely want to optimize the geometry
41+
of each active chlorophyl molecule (with very close amino acids/water molecules) separately with more standard QM/MM approaches
42+
before proceeding with EFP calculations on the optimized geometry. For this example, the first BChl, residue number 359,
43+
has been optimized and will be the QM region for the EFP calcuation.
44+
45+
.. image:: images/fmo_waters15a.bmp
46+
:width: 400
47+
48+
First, an EFP region must be defined. Every amino acid, (non QM) BChl, and water molecule containing an
49+
atom within 15 angstroms of the QM BChl headring.
50+
51+
The headring is defined by atomnames: MG CHA CHB HB CHC HC CHD HD NA C1A
52+
C2A H2A C3A H3A C4A CMA HMA1 HMA2 HMA3 NB C1B C2B C3B C4B CMB HMB1 HMB2 HMB3 CAB OBB CBB HBB1 HBB2 HBB3 NC C1C C2C H2C C3C
53+
H3C C4C CMC HMC1 HMC2 HMC3 CAC HAC1 HAC2 CBC HBC1 HBC2 HBC3 ND C1D C2D C3D C4D CMD HMD1 HMD2 HMD3 CAD OBD CBD HBD CGD O1D O2D
54+
CED HED1 HED2 HED3
55+
56+
The headring surrounded by EFP region looks like this:
57+
58+
.. image:: images/tester.bmp
59+
:width: 400
60+
61+
EFP is, of course, a fragmentation method. The protein residues within the 15 angstrom cutoff will be expressed individually.
62+
Because amino acids are a continuous chain, we will need to break each residue into its own fragment. Chemically, we would like
63+
to divide each residue by the C-C backbone bond, however, standard PDB listing convention divides residues by the C-N
64+
bond. To correct this, 'C' and 'O' atom names should be included in the following aminoc acid. This way the 'C' and 'CA'
65+
(carbonyl carbon and alpha carbon respectively) bond is the division between bonded fragments.
66+
See the example below:
67+
68+
.. image:: images/pdb_67_col.bmp
69+
:width: 400
70+
71+
For EFP, we would like these two fragments to look like this:
72+
73+
.. image:: images/efp_67_col.bmp
74+
:width: 400
75+
76+
The desired atoms are contained in the structure file, but they do not completely 'agree' with the amino acid numbering.
77+
Below is a snippet from the structure file with the desired EFP fragment 8 highlighted. Note that atom names 'C' and 'O'
78+
have to be included in the following fragment.
79+
80+
.. literalinclude:: ./examples/flex-EFP/1.Prepare_Structure/bchl359-50028.g96
81+
:linenos:
82+
:lines: 79-101
83+
:emphasize-lines: 10-21
84+
85+
Next, the BChl molecules are closer than the 15 angstrom cutoff, so they also appear in the EFP region. It is more cost efficient
86+
to treat BChl fragments as separate head and tail groups as is shown below:
87+
88+
.. images/efp_headtail.bmp
89+
:width: 400
90+
91+
In the case of both amino acid and BChl fragments, we have at least one broken bond; we cannot simply compute a fragment that is
92+
missing an atomic bond. To solve this, we will introduce virtual hydrogen atoms to 'cap' the broken bonds. Non terminal
93+
amino acid fragments will have two virtual atoms. The first is between the alpha carbon of the previous residue and the
94+
carbonyl carbon of the current residue; the other is similarly between the alpha carbon of the current residue and the carbonyl carbon
95+
of the following residue. The BChl fragments are split between atoms 'C2A' and 'CAA.' One virtual atom will be added to both
96+
head and tail fragments between these atoms. Virtual atoms are added along the vector of the broken bonds with the distance changed
97+
to the C-H equilibrium bond distance, 1.09 angstroms.
98+
99+
.. images/efp_bothvirt.bmp
100+
101+
Once the system is properly fragmented, we can finally run EFP calculations in the precense of the polarizable, solvatochromic environment.
102+
103+
EFP Workflow
104+
------------
105+

images/FMO_mon_pigs.bmp

4.74 MB
Binary file not shown.

images/FMO_trimer_BCLs.bmp

3.22 MB
Binary file not shown.

images/efp_67_col.bmp

1.1 MB
Binary file not shown.

images/efp_6and7.bmp

414 KB
Binary file not shown.

images/efp_bothvirt.bmp

2.97 MB
Binary file not shown.

images/efp_headtail.bmp

2.1 MB
Binary file not shown.

images/fmo_waters15a.bmp

2.7 MB
Binary file not shown.

images/pdb_67_col.bmp

1.23 MB
Binary file not shown.

images/pdb_6and7.bmp

1.21 MB
Binary file not shown.

0 commit comments

Comments
 (0)