You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ Docker-only FastAPI service that uses [Botasaurus](https://github.com/omkarcloud
8
8
-`GET /health`
9
9
-`POST /scrape`
10
10
- Intended usage: run and test through Docker only.
11
-
- Runtime boundary: async FastAPI handler delegates sync browser work to a bounded threadpool (`SCRAPE_MAX_WORKERS`), with a per-request timeout (`SCRAPE_TIMEOUT_SECONDS`).
11
+
- Runtime boundary: async FastAPI handler delegates sync browser work to a bounded threadpool (`SCRAPE_MAX_WORKERS`, default `4`), with a per-request timeout (`SCRAPE_TIMEOUT_SECONDS`, default `25`).
12
12
- On-demand isolation-first runtime: every scrape request runs with an ephemeral browser profile and request-scoped runtime dir, then gets fully cleaned up.
13
13
14
14
## Prerequisites
@@ -105,7 +105,7 @@ Request body (full options):
105
105
"max_retries": 2,
106
106
"wait_for_selector": "h1",
107
107
"wait_timeout_seconds": 15,
108
-
"block_images": false,
108
+
"block_images": true,
109
109
"block_images_and_css": false,
110
110
"wait_for_complete_page_load": true,
111
111
"user_agent": "Mozilla/5.0 ...",
@@ -125,8 +125,8 @@ Request options (contract):
125
125
-`google_get_bypass`: only `google_get(bypass_cloudflare=true)`
126
126
-`max_retries`: `0..3`, default `2` (attempts = `1 + max_retries`, with `auto` capped by 3 strategy steps).
127
127
-`wait_for_selector`: if set, response waits for selector before capture.
128
-
-`wait_timeout_seconds`: selector wait timeout (capped by service timeout).
129
-
-`block_images`: pass image blocking to driver.
128
+
-`wait_timeout_seconds`: selector wait timeout (default `15`, capped by service timeout).
129
+
-`block_images`: pass image blocking to driver. Default `true`.
130
130
131
131
Currently accepted passthrough options (implemented, not part of stable request-options contract):
132
132
@@ -255,7 +255,7 @@ curl -s -X POST http://localhost:4010/scrape \
0 commit comments