Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Np multimapper param starsolo #1172

Merged
merged 35 commits into from
Jan 16, 2024
Merged

Conversation

nikellepetrillo
Copy link
Contributor

Description

Give your PR a concise yet descriptive title.
Please explain the changes you made here.
Explain the motivation for making this change. What existing problem does the pull request solve?
Mention any issues fixed, addressed, or otherwise related to this pull request, including issue numbers or hard links for issues in other repos.
You can delete these instructions once you have written your PR description.


Checklist

If you can answer "yes" to the following items, please add a checkmark next to the appropriate checklist item(s) and notify our WARP documentation team by tagging either @ekiernan or @kayleemathews in a comment on this PR.

  • Did you add inputs, outputs, or tasks to a workflow?
  • Did you modify, delete or move: file paths, file names, input names, output names, or task names?
  • If you made a changelog update, did you update the pipeline version number?

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

@nikellepetrillo nikellepetrillo marked this pull request as ready for review January 11, 2024 15:10
Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

@@ -237,6 +239,12 @@ workflow Optimus {
File gene_metrics = GeneMetrics.gene_metrics
File? cell_calls = RunEmptyDrops.empty_drops_result
File? aligner_metrics = MergeStarOutputs.cell_reads_out
Array[File] soloMultiMappers = STARsoloFastq.matrix_sn_rna
Copy link
Contributor

@ekiernan ekiernan Jan 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this output supposed to be optional?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I dont think so, because we do a touch matrix_sn_rna.mtx in the starsolofastq command, so we will always capture this as an output, even if it is empty

Copy link
Contributor

@kayleemathews kayleemathews left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a couple of comments - if you have time to add some descriptions for the new outputs that would be great. Don't worry too much about the wording, though. I can help with that. Thanks!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link
Contributor

@kayleemathews kayleemathews left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for helping out with those docs Nikelle!

Comment on lines +118 to +122
| multimappers_EM_matrix | UniqueAndMult-EM.mtx | Optional output produced when `soloMultiMappers` is "EM"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| multimappers_Uniform_matrix | UniqueAndMult-Uniform.mtx | Optional output produced when `soloMultiMappers` is "Uniform"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| multimappers_Rescue_matrix | UniqueAndMult-Rescue.mtx | Optional output produced when `soloMultiMappers` is "Rescue"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information. |
| multimappers_PropUnique_matrix | UniqueAndMult-PropUnique.mtx | Optional output produced when `soloMultiMappers` is "PropUnique"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| cell_barcodes_csv | `<cell_csv>` | Optional output produced when `run_cellbender` is "true"; see CellBender [documentation](https://cellbender.readthedocs.io/en/latest/usage/index.html) and [GitHub repository](https://github.com/broadinstitute/CellBender/tree/master) for more information.|
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
| multimappers_EM_matrix | UniqueAndMult-EM.mtx | Optional output produced when `soloMultiMappers` is "EM"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| multimappers_Uniform_matrix | UniqueAndMult-Uniform.mtx | Optional output produced when `soloMultiMappers` is "Uniform"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| multimappers_Rescue_matrix | UniqueAndMult-Rescue.mtx | Optional output produced when `soloMultiMappers` is "Rescue"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information. |
| multimappers_PropUnique_matrix | UniqueAndMult-PropUnique.mtx | Optional output produced when `soloMultiMappers` is "PropUnique"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| cell_barcodes_csv | `<cell_csv>` | Optional output produced when `run_cellbender` is "true"; see CellBender [documentation](https://cellbender.readthedocs.io/en/latest/usage/index.html) and [GitHub repository](https://github.com/broadinstitute/CellBender/tree/master) for more information.|
| multimappers_EM_matrix | `UniqueAndMult-EM.mtx` | Optional output produced when `soloMultiMappers` is "EM"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| multimappers_Uniform_matrix | `UniqueAndMult-Uniform.mtx` | Optional output produced when `soloMultiMappers` is "Uniform"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| multimappers_Rescue_matrix | `UniqueAndMult-Rescue.mtx` | Optional output produced when `soloMultiMappers` is "Rescue"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information. |
| multimappers_PropUnique_matrix | `UniqueAndMult-PropUnique.mtx` | Optional output produced when `soloMultiMappers` is "PropUnique"; see STARsolo [documentation](https://github.com/alexdobin/STAR/blob/master/docs/STARsolo.md#multi-gene-reads) for more information.|
| cell_barcodes_csv | `<cell_csv>` | Optional output produced when `run_cellbender` is "true"; see CellBender [documentation](https://cellbender.readthedocs.io/en/latest/usage/index.html) and [GitHub repository](https://github.com/broadinstitute/CellBender/tree/master) for more information.|

These look great. Thanks Nikelle!

Co-authored-by: Kaylee Mathews <95316074+kayleemathews@users.noreply.github.com>
Copy link

Remember to squash merge!

Copy link

Remember to squash merge!

Copy link
Contributor

@ekiernan ekiernan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

@nikellepetrillo nikellepetrillo merged commit f3e8d6e into develop Jan 16, 2024
3 checks passed
nikellepetrillo added a commit that referenced this pull request Feb 5, 2024
* update paths to input files

* update jg sample map

* Km buildindices docs (#1158)

* add buildindices overview doc and diagram

* Km rnawithumis and ss2 doc updates (#1157)

* update rnawithumis overview

* Update rna-with-umis.methods.md

* Update rna-with-umis.methods.md

* update multi-snSS2 readme

* Update multi_snss2.methods.md

* Update multi_snss2.methods.md

* update multi-snSS2 docs

* update SS2 overview doc

* fix python script link

* Lk pd2448 upstools (#1150)

Added paired-tag wrapper and demultiplexing task

* PD-2435 Test bwa-mem2 step and run Intel distributed BWA-MEM2  (#1147)

* Lk pd2453 add bb tag (#1161)

Added option to incorporate BB tag in BAM and use it in SnapATAC2 software.

* km paired-tag docs (#1165)

* update overview docs

- update pipeline version numbers in Multiome and Optimus Overview docs

* update multi-SS2 overview doc

* Update smart-seq2.methods.md

* update multi-SS2 methods doc

* Update doc_style.md

* add paired-tag overview doc

* Update website/docs/Pipelines/PairedTag_Pipeline/README.md

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Apply suggestions from LK doc review

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

---------

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Np jprb pd 2353 multiple star rsolo align (#1164)

* integrate multiple soloFeatures

* updating counting_mode definition

* logic

* count exons is true

* count exons is true

* fix the logic

* count exons false

* count exons false

* echos

* echos

* echos

* rearrange logic

* rearrange

* testing

* testing

* testing

* count exons true

* switch counting mode order

* try running in scrna

* clean up

* snrna countexons is true

* snrna countexons is true

* snrna countexons is false

* snrna countexons is true

* snrna countexons is false

* cleaning up

* changelogs

* changelogs

* change cpu_platform to Intel Cascade Lake for sci test input

* change cpu_platform to Intel Cascade Lake for sci test input

* Update pipelines/skylab/smartseq2_single_nucleus_multisample/MultiSampleSmartSeq2SingleNucleus.changelog.md

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Update pipelines/skylab/multiome/Multiome.changelog.md

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Update pipelines/skylab/optimus/Optimus.changelog.md

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Update pipelines/skylab/paired_tag/PairedTag.changelog.md

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Update pipelines/skylab/slideseq/SlideSeq.changelog.md

Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

---------

Co-authored-by: Juan Pablo Ramos Barroso <jpramosbarroso@gmail.com>
Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* Np update multiome sci test (#1167)

* add summary task

* change cpu_platform to Intel Cascade Lake for sci test input

* change cpu_platform to Intel Cascade Lake for sci test input

* change cpu_platform to Intel Cascade Lake for sci test input

* Update VerifyTasks.wdl

* Update VerifyTasks.wdl

* PD-2422 BICAN_Optimus_2nymxis_Oct_2023 (#1152)

* Np multimapper param starsolo (#1172)

* add summary task

* add multimapper option

* update optimus plumbing for ease of testing

* add echos

* add to test

* remove some echoes

* make mouse snrna json go back to what is in dev

* make mouse snrna json go back to what is in dev

* add as outputs

* typo

* changelogs

* changelogs

* changelogs

* update pipeline docs

* optional output

* optional output

* optional output

* optional output

* docs

* docs

* docs

* Update website/docs/Pipelines/Optimus_Pipeline/README.md

Co-authored-by: Kaylee Mathews <95316074+kayleemathews@users.noreply.github.com>

* remove optional input to tests

---------

Co-authored-by: kayleemathews <kmathews@broadinstitute.org>
Co-authored-by: Kaylee Mathews <95316074+kayleemathews@users.noreply.github.com>

* added exit code to CompareTabix (#1174)

Updated the CompareTabix task in the Verify tasks

* Lk pd 2452 add read length check (#1171)

* adding read2 length and barcode orientation check task

* Lk pd 2464 batch methylome (#1181)

Added scatter and preemptibles to snM3C

* Np edit resources needed for bwa task / add logic to compareBams (#1183)

* add summary task

* get a bigger machine

* more memory

* trying out east1

* go back to central and decrease mem and threads

* try different zones

* make machine smaller

* smaller cpu

* smaller cpu

* more mem

* more mem

* 2000 disk

* more mem compare bams

* more mem compare bams

* more mem compare bams

* no zones

* more mem in comparebams

* more mem in comparebams

* 725000

* 825000

* 725

* max records 3000000

* max records 3000000

* add logic to fail fast if bams differ in size by 200 mb

* Update VerifyTasks.wdl

* PD-2483 (#1182)

* rc-2483

* update changelog

* Update README.md

---------

Co-authored-by: kayleemathews <kmathews@broadinstitute.org>
Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>

* PD-2476: Add task before fastqprocess to find number of splits (#1178)

* removed space (#1184)

Removed space from Verify Tabix task

* Lk pd 2455 pairedtag parsebarcodes (#1186)

Added a task to parse cell barcodes from sample barcodes

* Np move snm3c from beta pipelines  (#1185)

* add summary task

* move CondensedSnm3C.wdl out of beta

* update batch numbers

* add sorting to the compare compressed text files

* changelogs and versions

* revert batch change

* batch change

* batch change

* Km snm3c overview doc (#1179)

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update website/docs/Pipelines/snM3C/README.md

Co-authored-by: Nikelle Petrillo <38223776+nikellepetrillo@users.noreply.github.com>

* Update README.md

* fix num_downstr_bases description

* Update README.md

---------

Co-authored-by: Nikelle Petrillo <38223776+nikellepetrillo@users.noreply.github.com>

* Np fix snm3c test (#1190)

* add summary task

* batch change in test wdl

---------

Co-authored-by: Kaylee Mathews <95316074+kayleemathews@users.noreply.github.com>
Co-authored-by: ekiernan <55763654+ekiernan@users.noreply.github.com>
Co-authored-by: aawdeh <awdeh@broadinstitute.org>
Co-authored-by: Nikelle Petrillo <38223776+nikellepetrillo@users.noreply.github.com>
Co-authored-by: Juan Pablo Ramos Barroso <jpramosbarroso@gmail.com>
Co-authored-by: kayleemathews <kmathews@broadinstitute.org>
Co-authored-by: Robert Sidney Cox III <sidcox@broadinstitute.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants