Skip to content

Commit 9b4efaf

Browse files
authored
Merge branch 'main' into pre/beta
2 parents 386b46a + 68bb34c commit 9b4efaf

File tree

5 files changed

+11
-7
lines changed

5 files changed

+11
-7
lines changed

README.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,9 @@
1+
## 🚀 **Looking for an even faster and simpler way to scrape at scale (only 5 lines of code)? ** Check out our enhanced version at [**ScrapeGraphAI.com**](https://scrapegraphai.com/?utm_source=github&utm_medium=readme&utm_campaign=oss_cta&ut#m_content=top_banner)! 🚀
2+
3+
---
4+
15
# 🕷️ ScrapeGraphAI: You Only Scrape Once
6+
27
[English](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/README.md) | [中文](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/chinese.md) | [日本語](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/japanese.md)
38
| [한국어](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/korean.md)
49
| [Русский](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/russian.md) | [Türkçe](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/turkish.md)
@@ -34,7 +39,7 @@ You can find more informations at the following [link](https://scrapegraphai.com
3439
- **API**: [Documentation](https://docs.scrapegraphai.com/introduction)
3540
- **SDKs**: [Python](https://docs.scrapegraphai.com/sdks/python), [Node](https://docs.scrapegraphai.com/sdks/javascript)
3641
- **LLM Frameworks**: [Langchain](https://docs.scrapegraphai.com/integrations/langchain), [Llama Index](https://docs.scrapegraphai.com/integrations/llamaindex), [Crew.ai](https://docs.scrapegraphai.com/integrations/crewai), [CamelAI](https://github.com/camel-ai/camel)
37-
- **Low-code Frameworks**: [Pipedream](https://pipedream.com/apps/scrapegraphai), [Bubble](https://bubble.io/plugin/scrapegraphai-1745408893195x213542371433906180), [Zapier](https://zapier.com/apps/scrapegraphai/integrations), [n8n](http://localhost:5001/dashboard)
42+
- **Low-code Frameworks**: [Pipedream](https://pipedream.com/apps/scrapegraphai), [Bubble](https://bubble.io/plugin/scrapegraphai-1745408893195x213542371433906180), [Zapier](https://zapier.com/apps/scrapegraphai/integrations), [n8n](http://localhost:5001/dashboard), [LangFlow](https://www.langflow.org)
3843
- **MCP server**: [Link](https://smithery.ai/server/@ScrapeGraphAI/scrapegraph-mcp)
3944

4045
## 🚀 Quick install
@@ -192,9 +197,6 @@ The Official API Documentation can be found [here](https://docs.scrapegraphai.co
192197
<a href="https://scrape.do">
193198
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/scrapedo.png" alt="Stats" style="width: 11%;">
194199
</a>
195-
<a href="https://www.scrapeless.com/en?utm_source=github&utm_medium=ads&utm_campaign=scraping&utm_term=scrapegraphai">
196-
<img src="https://github.com/ScrapeGraphAI/Scrapegraph-ai/blob/main/docs/assets/scrapeless.png" alt="Stats" style="width: 11%;">
197-
</a>
198200
</div>
199201

200202
## 📈 Telemetry

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
[project]
22
name = "scrapegraphai"
33

4-
version = "1.52.0b2"
4+
version = "1.53.0"
55

66
description = "A web scraping library based on LangChain which uses LLM and direct graph logic to create scraping pipelines."
77
authors = [

scrapegraphai/helpers/models_tokens.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -179,6 +179,9 @@
179179
"claude-3-haiku-20240307": 200000,
180180
"claude-3-5-sonnet-20240620": 200000,
181181
"claude-3-5-haiku-latest": 200000,
182+
"claude-opus-4-20250514": 200000,
183+
"claude-sonnet-4-20250514": 200000,
184+
"claude-3-7-sonnet-20250219": 200000,
182185
},
183186
"bedrock": {
184187
"anthropic.claude-3-haiku-20240307-v1:0": 200000,

scrapegraphai/nodes/fetch_node.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -356,7 +356,7 @@ def handle_web_source(self, state, source):
356356
compressed_document = [
357357
Document(page_content=parsed_content, metadata={"source": "html file"})
358358
]
359-
state["original_html"] = document
359+
state["doc"] = document
360360
state.update(
361361
{
362362
self.output[0]: compressed_document,

scrapegraphai/nodes/parse_node.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -121,7 +121,6 @@ def execute(self, state: dict) -> dict:
121121

122122
state.update({self.output[0]: chunks})
123123
state.update({"parsed_doc": chunks})
124-
state.update({"content": chunks})
125124

126125
if self.parse_urls:
127126
state.update({self.output[1]: link_urls})

0 commit comments

Comments
 (0)