Skip to content

Commit 128bfb9

Browse files
rarhin711rarhin711
authored andcommitted
Merge branch 'main' of https://github.com/libefp2/libefp2.github.io into richy
adding new changes to gallery
2 parents 127adda + d36ff75 commit 128bfb9

1,049 files changed

Lines changed: 1723199 additions & 37154 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.DS_Store

2 KB
Binary file not shown.
Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
name: documentation
2+
3+
on: [push, pull_request, workflow_dispatch]
4+
5+
permissions:
6+
contents: write
7+
8+
jobs:
9+
docs:
10+
runs-on: ubuntu-latest
11+
steps:
12+
- uses: actions/checkout@v4
13+
- uses: actions/setup-python@v5
14+
- name: Install dependencies
15+
run: |
16+
pip install sphinx sphinx_rtd_theme myst_parser
17+
- name: Sphinx build
18+
run: |
19+
sphinx-build doc _build
20+
- name: Deploy to GitHub Pages
21+
uses: peaceiris/actions-gh-pages@v3
22+
if: ${{ github.event_name == 'push' && github.ref == 'refs/heads/main' }}
23+
with:
24+
publish_branch: gh-pages
25+
github_token: ${{ secrets.GITHUB_TOKEN }}
26+
publish_dir: _build/
27+
force_orphan: true

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
_build

Untitled

Lines changed: 0 additions & 9 deletions
This file was deleted.
File renamed without changes.

doc/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+

applied_efp.rst renamed to doc/applied_efp.rst

Lines changed: 52 additions & 53 deletions
Original file line numberDiff line numberDiff line change
@@ -7,99 +7,98 @@ FMO: Applied flexible EFP
77
Overview
88
--------
99

10-
As is the case with many photoactive proteins,computational methods struggle to reproduce
11-
experimental spectra for the Fenna-Matthews-Olson complex (FMO). Work by
12-
`Kim et al <https://pubs.acs.org/doi/full/10.1021/acs.jpclett.9b03486>`_ shows that
13-
flexible QM/EFP can be applied to FMO to correctly generate computational results in
14-
quantitative agreement to experimental spectra.
15-
16-
The key to applying EFP to your system is to carefully define the active site and EFP region.
17-
FMO is a trimeric protein with eight bacteriochloropyll a (BChl) pigments in each monomer.
18-
FMO completes energy transfer via excitonic couplings across these eight BChls. A summary
19-
of the complete workflow that was performed is the following: 1) molecular dynamics (MD)
20-
simiulations of the FMO protein in water and counter ions, 2) QM/MM (not EFP) geometry
21-
optimization of *each* active site (active sites consist of one BChl pigment and
22-
typically 3 H-bonding amino acids), and 3) flex-EFP excited state energy calculations of
10+
As is the case with many photoactive proteins,computational methods struggle to reproduce
11+
experimental spectra for the Fenna-Matthews-Olson complex (FMO). Work by
12+
`Kim et al <https://pubs.acs.org/doi/full/10.1021/acs.jpclett.9b03486>`_ shows that
13+
flexible QM/EFP can be applied to FMO to correctly generate computational results in
14+
quantitative agreement to experimental spectra.
15+
16+
The key to applying EFP to your system is to carefully define the active site and EFP region.
17+
FMO is a trimeric protein with eight bacteriochloropyll a (BChl) pigments in each monomer.
18+
FMO completes energy transfer via excitonic couplings across these eight BChls. A summary
19+
of the complete workflow that was performed is the following: 1) molecular dynamics (MD)
20+
simiulations of the FMO protein in water and counter ions, 2) QM/MM (not EFP) geometry
21+
optimization of *each* active site (active sites consist of one BChl pigment and
22+
typically 3 H-bonding amino acids), and 3) flex-EFP excited state energy calculations of
2323
each pigment.
2424

25-
In the case of FMO, these steps must be repeated on several snapshots from MD to account
26-
for variation in the resting state of the structure, and the QM region must be defined
27-
carefully in both the QM/MM and flex-EFP stages. It might not be universally true that one
28-
must perform QM/MM geometry optimization. This page is a walkthrough for the flex-EFP procedure
29-
only. Molecular dynamics and QM/MM optimizations are assumed to be complete for your
30-
system prior to these steps.
25+
In the case of FMO, these steps must be repeated on several snapshots from MD to account
26+
for variation in the resting state of the structure, and the QM region must be defined
27+
carefully in both the QM/MM and flex-EFP stages. It might not be universally true that one
28+
must perform QM/MM geometry optimization. This page is a walkthrough for the flex-EFP procedure
29+
only. Molecular dynamics and QM/MM optimizations are assumed to be complete for your
30+
system prior to these steps.
3131

32-
.. image:: images/FMO_trimer_BCLs.bmp
32+
.. image:: ../images/FMO_trimer_BCLs.bmp
3333
:width: 350
34-
35-
.. image:: images/FMO_mon_pigs.bmp
34+
35+
.. image:: ../images/FMO_mon_pigs.bmp
3636
:width: 400
3737

3838
You will need a structure file (.g96) and topology information (.top, for atom charges). In this specific case,
39-
a structure file is extracted from a GROMACS molecular dynamics trajectory and all water molecules more than 15 angstroms from
40-
the protein's surface have been removed. For a chlorophyll-containing protein, you will likely want to optimize the geometry
41-
of each active chlorophyl molecule (with very close amino acids/water molecules) separately with more standard QM/MM approaches
42-
before proceeding with EFP calculations on the optimized geometry. For this example, the first BChl, residue number 359,
39+
a structure file is extracted from a GROMACS molecular dynamics trajectory and all water molecules more than 15 angstroms from
40+
the protein's surface have been removed. For a chlorophyll-containing protein, you will likely want to optimize the geometry
41+
of each active chlorophyl molecule (with very close amino acids/water molecules) separately with more standard QM/MM approaches
42+
before proceeding with EFP calculations on the optimized geometry. For this example, the first BChl, residue number 359,
4343
has been optimized and will be the QM region for the EFP calcuation.
4444

45-
.. image:: images/fmo_waters15a.bmp
45+
.. image:: ../images/fmo_waters15a.bmp
4646
:width: 400
4747

48-
First, an EFP region must be defined. Every amino acid, (non QM) BChl, and water molecule containing an
49-
atom within 15 angstroms of the QM BChl headring.
48+
First, an EFP region must be defined. Every amino acid, (non QM) BChl, and water molecule containing an
49+
atom within 15 angstroms of the QM BChl headring.
5050

51-
The headring is defined by atomnames: MG CHA CHB HB CHC HC CHD HD NA C1A
52-
C2A H2A C3A H3A C4A CMA HMA1 HMA2 HMA3 NB C1B C2B C3B C4B CMB HMB1 HMB2 HMB3 CAB OBB CBB HBB1 HBB2 HBB3 NC C1C C2C H2C C3C
53-
H3C C4C CMC HMC1 HMC2 HMC3 CAC HAC1 HAC2 CBC HBC1 HBC2 HBC3 ND C1D C2D C3D C4D CMD HMD1 HMD2 HMD3 CAD OBD CBD HBD CGD O1D O2D
51+
The headring is defined by atomnames: MG CHA CHB HB CHC HC CHD HD NA C1A
52+
C2A H2A C3A H3A C4A CMA HMA1 HMA2 HMA3 NB C1B C2B C3B C4B CMB HMB1 HMB2 HMB3 CAB OBB CBB HBB1 HBB2 HBB3 NC C1C C2C H2C C3C
53+
H3C C4C CMC HMC1 HMC2 HMC3 CAC HAC1 HAC2 CBC HBC1 HBC2 HBC3 ND C1D C2D C3D C4D CMD HMD1 HMD2 HMD3 CAD OBD CBD HBD CGD O1D O2D
5454
CED HED1 HED2 HED3
5555

5656
The headring surrounded by EFP region looks like this:
5757

58-
.. image:: images/tester.bmp
58+
.. image:: ../images/tester.bmp
5959
:width: 400
6060

61-
EFP is, of course, a fragmentation method. The protein residues within the 15 angstrom cutoff will be expressed individually.
62-
Because amino acids are a continuous chain, we will need to break each residue into its own fragment. Chemically, we would like
63-
to divide each residue by the C-C backbone bond, however, standard PDB listing convention divides residues by the C-N
64-
bond. To correct this, 'C' and 'O' atom names should be included in the following aminoc acid. This way the 'C' and 'CA'
61+
EFP is, of course, a fragmentation method. The protein residues within the 15 angstrom cutoff will be expressed individually.
62+
Because amino acids are a continuous chain, we will need to break each residue into its own fragment. Chemically, we would like
63+
to divide each residue by the C-C backbone bond, however, standard PDB listing convention divides residues by the C-N
64+
bond. To correct this, 'C' and 'O' atom names should be included in the following aminoc acid. This way the 'C' and 'CA'
6565
(carbonyl carbon and alpha carbon respectively) bond is the division between bonded fragments.
6666
See the example below:
6767

68-
.. image:: images/pdb_67_col.bmp
68+
.. image:: ../images/pdb_67_col.bmp
6969
:width: 400
7070

7171
For EFP, we would like these two fragments to look like this:
7272

73-
.. image:: images/efp_67_col.bmp
73+
.. image:: ../images/efp_67_col.bmp
7474
:width: 400
7575

7676
The desired atoms are contained in the structure file, but they do not completely 'agree' with the amino acid numbering.
77-
Below is a snippet from the structure file with the desired EFP fragment 8 highlighted. Note that atom names 'C' and 'O'
77+
Below is a snippet from the structure file with the desired EFP fragment 8 highlighted. Note that atom names 'C' and 'O'
7878
have to be included in the following fragment.
7979

80-
.. literalinclude:: ./examples/flex-EFP/1.Prepare_Structure/bchl359-50028.g96
80+
.. literalinclude:: ../examples/flex-EFP/1.Prepare_Structure/bchl359-50028.g96
8181
:linenos:
8282
:lines: 79-101
8383
:emphasize-lines: 10-21
8484

85-
Next, the BChl molecules are closer than the 15 angstrom cutoff, so they also appear in the EFP region. It is more cost efficient
85+
Next, the BChl molecules are closer than the 15 angstrom cutoff, so they also appear in the EFP region. It is more cost efficient
8686
to treat BChl fragments as separate head and tail groups as is shown below:
8787

88-
.. images/efp_headtail.bmp
88+
.. ../images/efp_headtail.bmp
8989
:width: 400
90-
91-
In the case of both amino acid and BChl fragments, we have at least one broken bond; we cannot simply compute a fragment that is
92-
missing an atomic bond. To solve this, we will introduce virtual hydrogen atoms to 'cap' the broken bonds. Non terminal
93-
amino acid fragments will have two virtual atoms. The first is between the alpha carbon of the previous residue and the
94-
carbonyl carbon of the current residue; the other is similarly between the alpha carbon of the current residue and the carbonyl carbon
95-
of the following residue. The BChl fragments are split between atoms 'C2A' and 'CAA.' One virtual atom will be added to both
96-
head and tail fragments between these atoms. Virtual atoms are added along the vector of the broken bonds with the distance changed
90+
91+
In the case of both amino acid and BChl fragments, we have at least one broken bond; we cannot simply compute a fragment that is
92+
missing an atomic bond. To solve this, we will introduce virtual hydrogen atoms to 'cap' the broken bonds. Non terminal
93+
amino acid fragments will have two virtual atoms. The first is between the alpha carbon of the previous residue and the
94+
carbonyl carbon of the current residue; the other is similarly between the alpha carbon of the current residue and the carbonyl carbon
95+
of the following residue. The BChl fragments are split between atoms 'C2A' and 'CAA.' One virtual atom will be added to both
96+
head and tail fragments between these atoms. Virtual atoms are added along the vector of the broken bonds with the distance changed
9797
to the C-H equilibrium bond distance, 1.09 angstroms.
9898

99-
.. images/efp_bothvirt.bmp
99+
.. ../images/efp_bothvirt.bmp
100100
101-
Once the system is properly fragmented, we can finally run EFP calculations in the precense of the polarizable, solvatochromic environment.
101+
Once the system is properly fragmented, we can finally run EFP calculations in the precense of the polarizable, solvatochromic environment.
102102

103103
EFP Workflow
104104
------------
105-

bioefp.rst renamed to doc/bioefp.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ BioEFP
77
BioEFP is a scheme to split a macromolecule into fragments.
88
Proteins can be split along C-Ca bonds, as shown below.
99

10-
.. image:: images/BioEFP.jpg
10+
.. image:: ../images/BioEFP.jpg
1111
:width: 200
1212

1313
The model is described `here <https://pubs.acs.org/doi/abs/10.1021/acs.jpcb.6b04166>`_.
File renamed without changes.

efp_calcs.rst renamed to doc/efp_calcs.rst

Lines changed: 36 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -42,12 +42,12 @@ Two options exist for accounting for charge-penetration contribution to electros
4242
energy:
4343

4444
- exponential screening ("smearing") of charges. This is achieved by invoking screening parameters
45-
defined in ``SCREEN2`` section of the `.efp` potential (see :ref:`SCREEN`). The charge-penetration energy
46-
is not printed separately but included in the electrostatic energy.
45+
defined in ``SCREEN2`` section of the `.efp` potential (see :ref:`SCREEN`). The charge-penetration energy
46+
is not printed separately but included in the electrostatic energy.
4747
- overlap-based screening. This is a separate energy term derived assuming that localized orbitals
48-
can be modeled as spherical gaussions (the same approximation is used in the exchange-repulsion term).
49-
This calculation will utilize exchange-repulsion parameters (:ref:`FOCK`, :ref:`WF`, :ref:`BASIS`, :ref:`LMOC`).
50-
This overlap-based charge-penetration energy is printed as a separate energy term (see examples in :ref:`libefp`).
48+
can be modeled as spherical gaussions (the same approximation is used in the exchange-repulsion term).
49+
This calculation will utilize exchange-repulsion parameters (:ref:`FOCK`, :ref:`WF`, :ref:`BASIS`, :ref:`LMOC`).
50+
This overlap-based charge-penetration energy is printed as a separate energy term (see examples in :ref:`libefp`).
5151

5252
Detailed description of damping functions and their benchmarks are published in
5353
`Damping functions for electrostatic term <http://dx.doi.org/10.1002/jcc.20520>`_
@@ -85,12 +85,39 @@ the `short-range damping functions paper <http://dx.doi.org/10.1080/002689708027
8585
by an optional :ref:`POLAB` keyword; smaller values provide stronger screening of polarization energies which might be necessary for fragments
8686
with large multiple moments (charged or strongly polar species) or large polarizabilities (e.g., large conjugated/aromatic molecules).
8787

88+
.. _disp_energy:
89+
8890
Dispersion term
8991
^^^^^^^^^^^^^^^
9092

9193
Dispersion energy term captures the London interaction between the molecules. Formally, it can be expanded in
92-
series of (1/R) operator as $$E_{disp} = \frac{C_6}{R^6} + \frac{C_8}{R^8} + \frac{C_{10}}{R^{10}} + ...$$.
94+
series of (1/R) operator as :math:`E_{disp} = \frac{C_6}{R^6} + \frac{C_8}{R^8} + \frac{C_{10}}{R^{10}} + ....`
9395
In the case of distributed approach where dispersin contributions are computed as a sum of contributions due to
94-
individual parts of a molecules, the odd terms $\frac{C_7}{R^7}$, $\frac{C_9}{R^9}$ etc are also non-zero.
95-
The dispersion energy in LibEFP is computed as the first $\frac{C_6}{R^6}$ in the dispersion expansion, using
96-
distributed dynamic polarizabilities for evaluating $C_6$ coefficient, i.e.,
96+
individual parts of a molecules, the odd terms :math:`\frac{C_7}{R^7}, \frac{C_9}{R^9}` etc are also non-zero.
97+
98+
The relevant sections of the EFP potential are:
99+
100+
- :ref:`DYN_POINT`
101+
102+
:ref:`DYN_POINT` group section provides coordinates and values of anisotropic dynamic polarizability tensors for computing dispersion energy.
103+
104+
.. _ex_rep:
105+
106+
Exchange Repulsion
107+
^^^^^^^^^^^^^^^^^^
108+
109+
Exchange repulsion accounts for the antisymmetry of the wave function of the fragments.It is modelled using inter-fragment kinetic and
110+
overlap integrals, and the Fock matrices of the fragment.
111+
112+
The relevant sections of the EFP potential are:
113+
114+
- :ref:`BASIS`
115+
- :ref:`MULTIPLICITY`
116+
- :ref:`WF`
117+
- :ref:`FOCK`
118+
- :ref:`LMOC`
119+
120+
:ref:`BASIS` provides details of the basis set used for calculation of the exchange repulsion energy, :ref:`MULTIPLICITY` contains information
121+
on the multiplicity of the fragment (LibEFP works only on fragments with multiplicity 1), :ref:`WF` provides the localized wave function of the
122+
fragment, while :ref:`FOCK` and :ref:`LMOC` contain information regarding the elements of the Fock matrix of the fragment in the localized basis, and
123+
the coordinates of the localized molecular orbital, respectively.

0 commit comments

Comments
 (0)