Offline Search Workflow Handling of Bank Directories#5313
Offline Search Workflow Handling of Bank Directories#5313kkacanja merged 9 commits intogwastro:masterfrom
Conversation
|
@kkacanja Make sure you look through the failed small search especially. If it's a transient issue, just rerun it, but if there are failures there you'll need to fix them before we can approve this. Also fix the flake 8 issues. |
There were issues in how the bank was being passed to hdf_trigger_merge. It was using the original pre-generated bank. Looking at how to correct it to use the master bank from pycbc_coinc_bank2hdf which adds in the template hashes. |
|
@kkacanja This is a great feature to add (especially if using compressed template banks) ... It's much cleaner than doing this via the cache files, and changing file names a lot, as we have been doing for recent LVK runs. A request though: Can this be done without changes to If I also treat Then one still calls Does that make sense? Is there a reason why this doesn't work? |
|
Yea this should be doable, I'll make the changes according to Ian's suggestions. I went the template bank route in hindsight, since it seemed to be the easiest to implement, but I think the split bank route is now the better choice. Thank you for the suggestions! |
a6de9aa to
43a8dc0
Compare
|
These changes shouldn't affect any of the current tests. I have tested that the method works for the offline search. Do we want to implement an additional test for using this method? |
|
@kkacanja It's not a bad idea to add a test. We can leave it as an optional pass for now though. It's also the the test we can update with the other search improvements we are working on that aren't merged yet, so it should run in parallel to the existing search anyway. It makes sense to start now, so that when those other changes get put it in we have the test updated along with it without removing the standard search configuration. |
|
@kkacanja I think this code is now much more straightforward. Can you add a test? Then this should be ready to go. The test can essentially reuse the files from the existing search, and just have a script that calls the code to presplit the bank it generates. |
|
Thanks @kkacanja .. I think this looks much more straightforward now as well, and easier to use in other applications. Thanks! |
| @@ -0,0 +1,51 @@ | |||
| import subprocess | |||
| import time | |||
There was a problem hiding this comment.
@kkacanja Doe sthis need to be a new file? Why not just link to the existing script?
| @@ -0,0 +1,44 @@ | |||
| #!/bin/bash | |||
There was a problem hiding this comment.
@kkacanja It might be worth checking which scripts you actually need to modify and simply link to the existing ones if you can just directly reuse? We can also duplicate later if someone needs to be changed, but it woudl be good to avoid duplication where possible.
There was a problem hiding this comment.
Can you double check I stripped down the folder to only the bank generation and the new analysis file. Do you want to add in the code to generate the workflow?
This PR allows for the offline search workflow to be able to handle pre-split template banks. The user can specify the directories for the injections and the full data path in the config file as
The workflow will then utilize the banks in the directory and use them directly in the inspiral jobs.