🛒 Okala Database Crawler

A robust PHP-based crawler to extract and save product data from Okala including stores, categories, and product details in structured JSON format.

📦 What It Does

Crawls multiple store pages from Okala
Iterates through multiple category slugs
Downloads and stores:
- Product search result pages
- Product detail.json
- Product features.json
Saves all data in structured folders under /data/
Fully supports UTF-8/Persian characters
Respects existing files to avoid redundant requests

🗂 Folder Structure

data/
├── search/
│   └── {store_id}/{category_slug}/{page}.json
├── product/
│   └── {product_id}/
│       ├── features.json
│       └── {store_id}/detail.json

🚀 Usage

✅ Requirements

PHP 7.4+ with curl and json extensions
Git (for automated commit + push loop)
Internet access

📥 Clone the Repo

git clone https://github.com/BaseMax/okala-database-crawler.git
cd okala-database-crawler

🧪 Run the Crawler

php crawler.php

🔁 Auto Git Push (Optional)

To automatically commit and push updated JSON data every 5 minutes:

crawler.bat

💡 Useful when you're running long crawling jobs and want a backup of progress on GitHub.

🛠 Customization

You can edit the following in crawler.php:

Stores list ($stores)
Categories list ($categories)
Fetch delay (usleep(250_000) for 250ms between requests)

🧼 Features

✅ Automatic file structure and directory creation
✅ Skips already downloaded data (but still verifies products)
✅ Handles self-signed SSL issues via cURL
✅ UTF-8 safe JSON storage (e.g., Persian: فارسی)
✅ Color-coded CLI output for easier tracking

🤝 Contributions

PRs welcome! Please fork the repo and submit your improvements.

📬 Contact

Have questions or ideas? Reach out via GitHub Issues.

📄 License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
crawler.bat		crawler.bat
crawler.php		crawler.php

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛒 Okala Database Crawler

📦 What It Does

🗂 Folder Structure

🚀 Usage

✅ Requirements

📥 Clone the Repo

🧪 Run the Crawler

🔁 Auto Git Push (Optional)

🛠 Customization

🧼 Features

🤝 Contributions

📬 Contact

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

BaseMax/okala-database-crawler

Folders and files

Latest commit

History

Repository files navigation

🛒 Okala Database Crawler

📦 What It Does

🗂 Folder Structure

🚀 Usage

✅ Requirements

📥 Clone the Repo

🧪 Run the Crawler

🔁 Auto Git Push (Optional)

🛠 Customization

🧼 Features

🤝 Contributions

📬 Contact

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages