docs: reflect the recent `SessionPool` changes in a guide by barjin · Pull Request #3724 · apify/crawlee

barjin · 2026-06-08T14:22:22Z

Adds recent changes to the SessionPool and related classes to the existing guides.

Closes #796

janbuchar

Many of the comments are not blocking — just stuff that popped into my head when reading this.

janbuchar · 2026-06-09T08:49:17Z

-These are the basics of configuring SessionPool.
-Please, bear in mind that a Session pool needs time to find working IPs and build up the pool,
-so we will probably see a lot of errors until it becomes stabilized.
+These are the basics of configuring the session pool. Bear in mind that the pool needs time to find working IPs and build itself up, so you will probably see a number of errors until it stabilizes.


Is this actually true? Also, it fits better in the avoid_blocking guide IMO.

janbuchar · 2026-06-09T08:51:22Z

+
+You influence this with three methods on the session. <ApiLink to="core/class/Session#markGood">`markGood()`</ApiLink> records a successful use — it increments the usage count and heals the error score a little (by `errorScoreDecrement`, default `0.5`). <ApiLink to="core/class/Session#markBad">`markBad()`</ApiLink> records a failure that *might* be the session's fault and *might* just be bad luck — it raises the error score by one, so a session needs to fail repeatedly before it is dropped. <ApiLink to="core/class/Session#retire">`retire()`</ApiLink> drops the session immediately and permanently; this is what you call when you are certain the identity itself is burnt (for example, a `403` response).
+
+The distinction between `markBad()` and `retire()` matters. Use `markBad()` for transient, external problems such as a timeout or a `5XX` response — the IP is probably fine and a couple of retries should not throw it away. Use `retire()` for problems that prove the session is blocked, where reusing it is pointless. Note that in v4 retirement is terminal: once a session is retired, a later `markGood()` will not bring it back.


After we rename markBad and markGood, this paragraph will become obsolete.

That is, if we manage to come up with anything better. Even then, imo practice (i.e., repeating this in docs) makes perfect.

e.g. Session.recordFailure and Session.recordSuccess imo breaks the idea of Session as a simple data struct and hints at some deeper tallying logic. Some other ideas like .succeeded() and .failed() don't really follow our naming conventions for methods.

Let's discuss this further under #3663, but perhaps we don't need this breaking change after all (it started as our hunch anyway, not from an outside user).

janbuchar · 2026-06-09T09:00:56Z

+await sessionPool.addSession({ id: 'cheap', proxyInfo: proxyInfoFromUrl('http://cheap-proxy.com') });
+await sessionPool.addSession({ id: 'premium', proxyInfo: proxyInfoFromUrl('http://expensive-proxy.com') });


And what if I want multiple cheap sessions for example, so that I can alternate between them? 🙂

It's possible, by creating multiple cheap-[xyz] sessions, keeping track of these ids and reassigning the request.sessionId with a getCheapSession() helper (that cycles between the cheap session ids).

I'd argue that we should describe the main principles in the docs (assigning sessions to requests), but any more complex usage of these should be left as an excercise for the reader, as it doesn't really add that much information. Wdyt?

docs: reflect the recent SessionPool changes in a guide

5096cbb

janbuchar self-requested a review June 8, 2026 15:01

janbuchar reviewed Jun 9, 2026

View reviewed changes

barjin added 2 commits June 9, 2026 11:26

chore: fix broken link

fa33949

chore: touch up docs

d2acbfd

barjin mentioned this pull request Jun 9, 2026

Investigate Crawlee v4 Session.fingerprint performance #3726

Open

docs: polish the guides

1880f0f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: reflect the recent `SessionPool` changes in a guide#3724

docs: reflect the recent `SessionPool` changes in a guide#3724
barjin wants to merge 4 commits into
v4from
docs/session-pool-guide

barjin commented Jun 8, 2026

Uh oh!

janbuchar left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janbuchar Jun 9, 2026

Uh oh!

janbuchar Jun 9, 2026

Uh oh!

barjin Jun 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

janbuchar Jun 9, 2026

Uh oh!

barjin Jun 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		You influence this with three methods on the session. <ApiLink to="core/class/Session#markGood">`markGood()`</ApiLink> records a successful use — it increments the usage count and heals the error score a little (by `errorScoreDecrement`, default `0.5`). <ApiLink to="core/class/Session#markBad">`markBad()`</ApiLink> records a failure that might be the session's fault and might just be bad luck — it raises the error score by one, so a session needs to fail repeatedly before it is dropped. <ApiLink to="core/class/Session#retire">`retire()`</ApiLink> drops the session immediately and permanently; this is what you call when you are certain the identity itself is burnt (for example, a `403` response).

		The distinction between `markBad()` and `retire()` matters. Use `markBad()` for transient, external problems such as a timeout or a `5XX` response — the IP is probably fine and a couple of retries should not throw it away. Use `retire()` for problems that prove the session is blocked, where reusing it is pointless. Note that in v4 retirement is terminal: once a session is retired, a later `markGood()` will not bring it back.

		await sessionPool.addSession({ id: 'cheap', proxyInfo: proxyInfoFromUrl('http://cheap-proxy.com') });
		await sessionPool.addSession({ id: 'premium', proxyInfo: proxyInfoFromUrl('http://expensive-proxy.com') });

Conversation

barjin commented Jun 8, 2026

Uh oh!

janbuchar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janbuchar Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

janbuchar Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

barjin Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

janbuchar Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

barjin Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

barjin Jun 11, 2026 •

edited

Loading

barjin Jun 11, 2026 •

edited

Loading