@@ -46,7 +46,7 @@ Browsertrix Crawler includes a number of additional command-line options, explai
4646The Browsertrix Crawler docker image currently accepts the following parameters:
4747
4848```
49- browsertrix- crawler [options]
49+ crawler [options]
5050
5151Options:
5252 --help Show help [boolean]
@@ -74,18 +74,23 @@ Options:
7474 -c, --collection Collection name to crawl to (replay
7575 will be accessible under this name
7676 in pywb preview)
77- [string] [default: "capture-2021-04-10T04-49-4 "]
77+ [string] [default: "capture-YYYY-MM-DDTHH-MM-SS "]
7878 --headless Run in headless mode, otherwise
7979 start xvfb[boolean] [default: false]
8080 --driver JS driver for the crawler
8181 [string] [default: "/app/defaultDriver.js"]
8282 --generateCDX, --generatecdx, If set, generate index (CDXJ) for
8383 --generateCdx use with pywb after crawl is done
8484 [boolean] [default: false]
85+ --combineWARC, --combinewarc, If set, combine the warcs
86+ --combineWarc [boolean] [default: false]
87+ --rolloverSize If set, declare the rollover size
88+ [number] [default: 1000000000]
8589 --generateWACZ, --generatewacz, If set, generate wacz
8690 --generateWacz [boolean] [default: false]
8791 --logging Logging options for crawler, can
88- include: stats, pywb, behaviors
92+ include: stats, pywb, behaviors,
93+ behaviors-debug
8994 [string] [default: "stats"]
9095 --text If set, extract text to the
9196 pages.jsonl file
0 commit comments