Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add tags related to optical duplicates. #813

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

yfarjoun
Copy link
Contributor

feat: add tags related to optical duplicates.

Picard has long been able to add Sam Tags that provide information about duplicate templates: their type (Library or Sequencing), the size of the sets they belong to, and the query name of the representative template within the set that is not a duplicate. As I look to use these tags in another repository (fgbio), I want to ensure they are generally accepted and well-defined.

Picard has long been able to add Sam Tags that provide information about duplicate templates: Their type (Library or Sequencing), the size of the sets they are part of, and the queryname of the representative template within the set that is _not_ a duplicate. As I look to use these tags in another repository (fgbio) I wanted to make sure that the tags are generally accepted and well defined.
Copy link

Changed PDFs as of f1f5f1b: SAMtags (diff).

Copy link

Changed PDFs as of 8bf0cf3: SAMtags (diff).

@jmarshall
Copy link
Member

I guess these are the tags that are described in MarkDuplicates.java:

https://github.com/broadinstitute/picard/blob/e38a20575cd60667dbe37a8932c5106098218e58/src/main/java/picard/sam/markduplicates/MarkDuplicates.java#L120-L139

This does seem like a worthwhile addition, not least because the Git history suggests they've been in Picard for 8 or 10 years.

DI and DT would need entries in §1.1 too, and they and the DS entry would want to explain what this is about a bit more.

@yfarjoun
Copy link
Contributor Author

I wasn't sure if the 1.1 section was required for all tags or only for those for which there's something interesting to say....but OK... :-)

Copy link

Changed PDFs as of f3bb8bc: SAMtags (diff).

Copy link

Changed PDFs as of 080649f: SAMtags (diff).

Use SQ if the duplication occurred during sequencing (e.g., due to imaging error, aka "optical duplicates", or due to
over-aggressive bridge-amp).

\item[GL:f:\tagvalue{score}]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is GL? Was this line accidentally committed?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants