docs: Add comprehensive Rarefaction section to transformation chapter #825

jagadeeshkaruturi11 · 2025-11-23T14:35:42Z

Added new section 12.3 Rarefaction to address issue #823.

Changes include:

Introduction to rarefaction with rarefyAssay() and niter parameter
Subsection on using rarefaction with alpha diversity (addAlpha)
Subsection on using rarefaction with beta diversity (addMDS)
Function comparison explaining differences between:
- addAlpha() vs getAlpha()
- runMDS() vs addMDS()

Includes practical code examples demonstrating iterative rarefaction with niter=100.

Added new section 12.3 Rarefaction to address issue microbiome#823. Changes include: - Introduction to rarefaction with rarefyAssay() and niter parameter - Subsection on using rarefaction with alpha diversity (addAlpha) - Subsection on using rarefaction with beta diversity (addMDS) - Function comparison explaining differences between: * addAlpha() vs getAlpha() * runMDS() vs addMDS() Includes practical code examples demonstrating iterative rarefaction with niter=100.

antagomir

Nice, this could be useful & apologies for delays.

Kindly see the suggestions and we can finalize.

antagomir · 2025-12-04T15:08:45Z

inst/pages/transformation.qmd

+# Perform iterative rarefaction
+tse <- rarefyAssay(
+  tse,
+  method = "subsample",
+  sample = min_reads,
+  niter = 100
+)
+
+# Calculate alpha diversity on rarefied data
+tse <- addAlpha(
+  tse,
+  assay_name = "counts_rarefied",
+  sample = min_reads,
+  niter = 100
+)


addAlpha can be used independently of rarefyAssay.

Hence I am thinking that it might be more clear to show these as two separate operations that can both be feasible but each on their own right. Shall we split this chunk in two parts?

antagomir · 2025-12-04T15:11:02Z

inst/pages/transformation.qmd

+# Perform MDS ordination on rarefied data
+tse <- addMDS(
+  tse,
+  assay_name = "counts_rarefied",


mia changed argument names last year; "assay_name" is deprecated and should be replaced with "assay.type" everywhere

antagomir · 2025-12-04T15:11:53Z

inst/pages/transformation.qmd

+
+```{r}
+#| label: rarefaction-alpha
+#| eval: false


Why not eval: true?

antagomir · 2025-12-04T15:12:02Z

inst/pages/transformation.qmd

+
+```{r}
+#| label: rarefaction-beta
+#| eval: false


Why not eval: true?

antagomir · 2025-12-04T15:13:05Z

inst/pages/transformation.qmd

+# Perform MDS ordination on rarefied data
+tse <- addMDS(
+  tse,
+  assay_name = "counts_rarefied",


If "niter" parameter is used then doesn't it already take care or rarifification i.e. why not use assay.type="counts"?

antagomir · 2025-12-04T15:13:49Z

inst/pages/transformation.qmd

+**`addAlpha()` vs `getAlpha()`**: Both functions calculate alpha diversity indices, but `addAlpha()` stores the results directly into the `colData` of the TreeSummarizedExperiment object, while `getAlpha()` returns the diversity values as a separate vector or matrix. Use `addAlpha()` when you want to keep all data together in one object, and `getAlpha()` when you need the diversity values for immediate use in other calculations.
+


I suggest to explain this earlier, where the rarified alpha diversity analysis is shown.

antagomir · 2025-12-04T15:14:23Z

inst/pages/transformation.qmd

+**`runMDS()` vs `addMDS()`**: The `runMDS()` function calculates multidimensional scaling coordinates and returns them as a separate matrix, whereas `addMDS()` calculates the MDS coordinates and stores them directly into the `reducedDim` slot of the TreeSummarizedExperiment object. Using `addMDS()` is generally preferred as it maintains all results within the same data object, making downstream analyses and visualization more straightforward.
+


You could also comment whether this is available for other ordination functions e.g. runPCA, runNMDS..?

TuomasBorman · 2025-12-05T08:30:31Z

Sorry, I have been too busy lately...

This PR is clashing with #819

The information on rarefaction was already updated but it is not yet rendered in the book.

This PR has useful information, but we should think about the right place for this. Rarefaction is commonly used only in alpha and beta diversity (e.g., addAlpha and addMDS).

This rarefyAssay() approach is not used often as far as I know. If it is so, I am not sure if Transformation chapter is right place for discussing rarefaction.

One option:

Discuss rarefaction is high-level in Transformation chapter (although it is not data transformation). Explain the idea.
Add links to Alpha and beta diversity chapters where rarefaction is discussed in aforementioned context

antagomir · 2025-12-05T10:22:05Z

Rarefaction can be viewed as a form of transformation / normalization.

Hence I think that rarefyAssay() could be explained in the transformation chapter. We can consider whether a practical example is necessary - perhaps mentioning the existence of this function would be enough(?) It might have more use in the future, even if it is currently used less. Then one could briefly mention that by doing this, and then averaging results across multiple rarified replicates has been proposed by Schloss et al.

Then, links could be provided to the alpha and beta diversity sections for more detailed examples on those.

This way we could avoid that the boundaries between section become blurred and contents mixed?

antagomir requested changes Dec 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Add comprehensive Rarefaction section to transformation chapter #825

docs: Add comprehensive Rarefaction section to transformation chapter #825

Uh oh!

jagadeeshkaruturi11 commented Nov 23, 2025

Uh oh!

antagomir left a comment

Uh oh!

antagomir Dec 4, 2025

Uh oh!

antagomir Dec 4, 2025

Uh oh!

antagomir Dec 4, 2025

Uh oh!

antagomir Dec 4, 2025

Uh oh!

antagomir Dec 4, 2025

Uh oh!

antagomir Dec 4, 2025

Uh oh!

antagomir Dec 4, 2025

Uh oh!

TuomasBorman commented Dec 5, 2025

Uh oh!

antagomir commented Dec 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		`addAlpha()` vs `getAlpha()`: Both functions calculate alpha diversity indices, but `addAlpha()` stores the results directly into the `colData` of the TreeSummarizedExperiment object, while `getAlpha()` returns the diversity values as a separate vector or matrix. Use `addAlpha()` when you want to keep all data together in one object, and `getAlpha()` when you need the diversity values for immediate use in other calculations.

		`runMDS()` vs `addMDS()`: The `runMDS()` function calculates multidimensional scaling coordinates and returns them as a separate matrix, whereas `addMDS()` calculates the MDS coordinates and stores them directly into the `reducedDim` slot of the TreeSummarizedExperiment object. Using `addMDS()` is generally preferred as it maintains all results within the same data object, making downstream analyses and visualization more straightforward.

docs: Add comprehensive Rarefaction section to transformation chapter #825

Are you sure you want to change the base?

docs: Add comprehensive Rarefaction section to transformation chapter #825

Uh oh!

Conversation

jagadeeshkaruturi11 commented Nov 23, 2025

Uh oh!

antagomir left a comment

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

antagomir Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

TuomasBorman commented Dec 5, 2025

Uh oh!

antagomir commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

antagomir commented Dec 5, 2025 •

edited

Loading