Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] main from Zipstack:main #7

Open
wants to merge 82 commits into
base: main
Choose a base branch
from
Open

[pull] main from Zipstack:main #7

wants to merge 82 commits into from

Conversation

pull[bot]
Copy link

@pull pull bot commented Jan 8, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

muhammad-ali-e and others added 3 commits January 8, 2025 10:53
* Renamed worker to runner everywhere

* renamed module

* added runner

* modified gitignore

* Updated redis dependency

* Commit pdm.lock changes

---------

Co-authored-by: ritwik-g <[email protected]>
…1056)

* removed highlight support for json

* Update tooltip message

Co-authored-by: Gayathri <[email protected]>
Signed-off-by: jagadeeswaran-zipstack <[email protected]>

---------

Signed-off-by: jagadeeswaran-zipstack <[email protected]>
Co-authored-by: Gayathri <[email protected]>
@pull pull bot added the ⤵️ pull label Jan 8, 2025
Deepak-Kesavan and others added 26 commits January 9, 2025 09:45
Fixed missing keys in highlight data

Signed-off-by: Deepak K <[email protected]>
Added missing metrics key for table and record type

Signed-off-by: Deepak <[email protected]>
Co-authored-by: vishnuszipstack <[email protected]>
* Roll to latest SDK

* Commit pdm.lock changes

* Remove pandoc and tessaract

* Revert change

* Roll tool versions

* Roll tool versions

* Roll version

* Commit pdm.lock changes

* Remove reote storage tool registry

* Use ENvHelper for env standardisation

* Commit pdm.lock changes

* Minor improvement

* Indexing and env fixes

* Indexing and env fixes

* Correcting sample env

* Roll SDK version

* SDK version roll

* Add sample env

* Add sample env

* Lock file check in

---------

Co-authored-by: gaya3-zipstack <[email protected]>
Co-authored-by: Hari John Kuriakose <[email protected]>
added checks for client side network issues

Co-authored-by: vishnuszipstack <[email protected]>
* env standadization for backend

* Sample envs for remote storage

* Sample envs for remote storage

---------

Signed-off-by: Gayathri <[email protected]>
Co-authored-by: Gayathri <[email protected]>
Co-authored-by: gayathrivijayakumar <[email protected]>
* Fix: Issue 705. Handling white space in path

* Fixed env merge issue

---------

Signed-off-by: Ritwik G <[email protected]>
Co-authored-by: Hari John Kuriakose <[email protected]>
Co-authored-by: vishnuszipstack <[email protected]>
* added support for image,docx

* added support for image,docx BE

* added doc to pdf converter

* Removed unwanted changes 

Signed-off-by: jagadeeswaran-zipstack <[email protected]>

* code refactor

* removed unwanted logs and code refactor

* removed unwanted code

* removed unwanted code

* removed plugin related logic

* code refactor

* added types

* code refactored

* merge conflict fix

---------

Signed-off-by: jagadeeswaran-zipstack <[email protected]>
Co-authored-by: vishnuszipstack <[email protected]>
fixed extract not found  issue
Fixing regression for remote storage file view

Co-authored-by: Gayathri <[email protected]>
Pass ENV from backend to tool

Signed-off-by: Deepak <[email protected]>
* Changes for line-item extraction prompt type

Signed-off-by: Deepak <[email protected]>

* Removed commented out code

Signed-off-by: Deepak <[email protected]>

* Minor fix

Signed-off-by: Deepak <[email protected]>

* Minor improvements

Signed-off-by: Deepak <[email protected]>

* [FEAT] FS APIs for Line item extractor (#1060)

* FS APIs for Line item extractor

* Optimizing if-else branch

* Minor fix

Signed-off-by: Deepak <[email protected]>

---------

Signed-off-by: Deepak <[email protected]>
Signed-off-by: Deepak K <[email protected]>
Co-authored-by: harini-venkataraman <[email protected]>
…s not required. (#1066)

* Roll to latest SDK

* Commit pdm.lock changes

* Remove pandoc and tessaract

* Revert change

* Roll tool versions

* Roll tool versions

* Roll version

* Commit pdm.lock changes

* Remove reote storage tool registry

* Use ENvHelper for env standardisation

* Commit pdm.lock changes

* Minor improvement

* Indexing and env fixes

* Indexing and env fixes

* Correcting sample env

* Roll SDK version

* SDK version roll

* Add sample env

* Add sample env

* Lock file check in

* Init fs correctly when remote storage support not required

---------

Co-authored-by: gaya3-zipstack <[email protected]>
* Roll to latest SDK

* Commit pdm.lock changes

* Remove pandoc and tessaract

* Revert change

* Roll tool versions

* Roll tool versions

* Roll version

* Commit pdm.lock changes

* Remove reote storage tool registry

* Use ENvHelper for env standardisation

* Commit pdm.lock changes

* Minor improvement

* Indexing and env fixes

* Indexing and env fixes

* Correcting sample env

* Roll SDK version

* SDK version roll

* Add sample env

* Add sample env

* Lock file check in

* Init fs correctly when remote storage support not required

* classifier tool roll

---------

Co-authored-by: gaya3-zipstack <[email protected]>
* passing file execution id from backend to tools

* bumped sdk version

---------

Signed-off-by: ali <[email protected]>
…ribute (#1069)

* Roll to latest SDK

* Commit pdm.lock changes

* Remove pandoc and tessaract

* Revert change

* Roll tool versions

* Roll tool versions

* Roll version

* Commit pdm.lock changes

* Remove reote storage tool registry

* Use ENvHelper for env standardisation

* Commit pdm.lock changes

* Minor improvement

* Indexing and env fixes

* Indexing and env fixes

* Correcting sample env

* Roll SDK version

* SDK version roll

* Add sample env

* Add sample env

* Lock file check in

* Init fs correctly when remote storage support not required

* classifier tool roll

* Fix error in classifier

* Fix version

* Correct text-extractor tool version

* Revert

---------

Co-authored-by: gaya3-zipstack <[email protected]>
Revert "passing file execution id from backend to tools (#1065)"

This reverts commit 650c328.
* Roll to latest SDK

* Commit pdm.lock changes

* Remove pandoc and tessaract

* Revert change

* Roll tool versions

* Roll tool versions

* Roll version

* Commit pdm.lock changes

* Remove reote storage tool registry

* Use ENvHelper for env standardisation

* Commit pdm.lock changes

* Minor improvement

* Indexing and env fixes

* Indexing and env fixes

* Correcting sample env

* Roll SDK version

* SDK version roll

* Add sample env

* Add sample env

* Lock file check in

* Init fs correctly when remote storage support not required

* classifier tool roll

* Fix error in classifier

* Fix version

* Correct text-extractor tool version

* Revert

* Roll classifier version

---------

Co-authored-by: gaya3-zipstack <[email protected]>
* Using run-platform.sh script for build test workflow

Signed-off-by: Ritwik G <[email protected]>

* Removed the build step and corrected working directory

Signed-off-by: Ritwik G <[email protected]>

* Update ci-container-build.yaml

Signed-off-by: Ritwik G <[email protected]>

* Test workflow dispatch

Signed-off-by: Ritwik G <[email protected]>

---------

Signed-off-by: Ritwik G <[email protected]>
Signed-off-by: Ritwik G <[email protected]>
* Roll to latest SDK

* Commit pdm.lock changes

* Remove pandoc and tessaract

* Revert change

* Roll tool versions

* Roll tool versions

* Roll version

* Commit pdm.lock changes

* Remove reote storage tool registry

* Use ENvHelper for env standardisation

* Commit pdm.lock changes

* Minor improvement

* Indexing and env fixes

* Indexing and env fixes

* Correcting sample env

* Roll SDK version

* SDK version roll

* Add sample env

* Add sample env

* Lock file check in

* Init fs correctly when remote storage support not required

* classifier tool roll

* Fix error in classifier

* Fix version

* Correct text-extractor tool version

* Revert

* Roll classifier version

* Read from local

* Read from local

---------

Co-authored-by: gaya3-zipstack <[email protected]>
removed tooltip for highlight not supported

Co-authored-by: Gayathri <[email protected]>
* changed read length to full length

* added todo

* code refactor

---------

Co-authored-by: Gayathri <[email protected]>
* Update unstract-sdk version to 0.55.0rc2

* updated llama-index version

* Updated llama-index version in core toml

* Created the lock files in local

* Merge branch 'main' into feat/llama-index-version-update

* Merge branch 'main' into feat/llama-index-version-update

* Updated lock files manually

* Generate new set of lock files for another version of pdm

* Updated the versions for the tools in properties.json

* Updated tool version for text extractor in public_tools.json

---------

Signed-off-by: Praveen Kumar <[email protected]>
Co-authored-by: Gayathri <[email protected]>
* passing file execution id from backend to tools

* bumped sdk version

* Bumped toolversion in properties

---------

Signed-off-by: ali <[email protected]>
tahierhussain and others added 30 commits February 4, 2025 15:36
* Fixed issue in disabling the side menu items

* Allow access to the '/pricing' page only for admins
* fixed page goes blank issue

* fixed page goes blank issue

---------

Co-authored-by: Deepak K <[email protected]>
* Support for subscription usage

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Review comments update

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* pre-commit fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix for duplicate container while retrying executions

* hotfix: UN-2107 MIME type validation for large files (#1116)

fix: Output type mismatch error handling, mime type resolution for large files

---------

Co-authored-by: ali-zipstack <[email protected]>
Co-authored-by: ali <[email protected]>
…1119)

Directly use the 'isAdmin' attribute from the 'sessionDetails' global state
* Multi-doc upload

* Update backend/utils/file_storage/helpers/prompt_studio_file_helper.py

Co-authored-by: Hari John Kuriakose <[email protected]>
Signed-off-by: Gayathri <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Gayathri <[email protected]>
Co-authored-by: Hari John Kuriakose <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Version bump SDK 0.57.0rc2

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Commit pdm.lock changes

* Update platform-service/pyproject.toml

Signed-off-by: Praveen Kumar <[email protected]>

* Update prompt-service/pyproject.toml

Signed-off-by: Praveen Kumar <[email protected]>

* Update prompt-service/pyproject.toml

Signed-off-by: Praveen Kumar <[email protected]>

* Commit pdm.lock changes

* version bump

* Commit pdm.lock changes

---------

Signed-off-by: Praveen Kumar <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Praveen Kumar <[email protected]>
Co-authored-by: pk-zipstack <[email protected]>
Co-authored-by: Hari John Kuriakose <[email protected]>
Passing down execution source
…#1123)

Set the user details state in the finally block

Co-authored-by: Gayathri <[email protected]>
* feat: API to list executions and filter by entity and date range with pagination, ordering

* Moved execution app into workflow_manager, added API to list file executions for an execution with latest logs, schema migration for indexes on some columns

---------

Co-authored-by: Gayathri <[email protected]>
…1127)

* Pass right env

* Roll strcture tool version

* Roll strcture tool version
… and top menu items (#1130)

Rearranged the sidemenu items and handle disabling of side and top menu items
* Updated unstract-sdk's version and tools version

* Commit pdm.lock changes

* Update backend/pyproject.toml

Co-authored-by: Chandrasekharan M <[email protected]>
Signed-off-by: Praveen Kumar <[email protected]>

* Update sample.env

Signed-off-by: Praveen Kumar <[email protected]>

* Update pyproject.toml

Signed-off-by: Praveen Kumar <[email protected]>

* Updated properties.json and pyproject.toml for prompt-service

* Updated pdm.lock files

* Update unstract-sdk's versiont o 0.58.0rc1

* Updated unstract-sdk's version to 0.57.0rc4

* Updated pdm.lock to latest unstract-sdk version

---------

Signed-off-by: Praveen Kumar <[email protected]>
Co-authored-by: pk-zipstack <[email protected]>
Co-authored-by: Chandrasekharan M <[email protected]>
Co-authored-by: Gayathri <[email protected]>
* * Add issue template config
* Improve readme readability

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: Hari John Kuriakose <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* SDK version roll

* Fix lock resolution

* Commit pdm.lock changes

* Fix lock resolution
* feat: Updated execution time in WF and file execution models

* feat: Added data migration to calculate execution time for executions

* refactor: Removed a redundant file execution status update

* minor: Removed an unnecessary TODO comment

* feat: Added support for total files and processing status in execution (#1131)

* feat: Added field total_files to WF execution model, added successful_files and failed_files fields to execution serializer

* Update backend/workflow_manager/workflow_v2/workflow_helper.py

Signed-off-by: Chandrasekharan M <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* feat: Updated status enum to use TextChoices for 3 models (#1132)

* feat: Updated status field for execution models and FileHistory to correctly obtain and use values from an enum

* minor: Removed unnecessary commented code

* fix: Show latest log while listing file centric execution

* feat: Added execution-detail API

---------

Signed-off-by: Chandrasekharan M <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* minor: Squashed schema migrations for file execution and workflow execution models

---------

Signed-off-by: Chandrasekharan M <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Gayathri <[email protected]>
* refactor: Prompt service dockerfile optimization - cache install of gunicorn and other lib

* fix: Fixed security hotspot of providing permissions to non root user, uncommented hadolint-docker check in precommit

* fix: Using hadolint instead of hadolint-docker to avoid precommit CI issue

* fix: Skipped failing hadolint check for CI

* refactor: Optimized dockerfiles to use caching better

* fix: Resolved minor Sonarcloud issues in Dockerfiles

* docs: Updated reason for chmod in Dockerfiles

* Update docker/dockerfiles/backend.Dockerfile

Co-authored-by: Hari John Kuriakose <[email protected]>
Signed-off-by: Chandrasekharan M <[email protected]>

* minor: Restructured Dockerfile's install commands for readability

---------

Signed-off-by: Chandrasekharan M <[email protected]>
Co-authored-by: Deepak K <[email protected]>
Co-authored-by: Hari John Kuriakose <[email protected]>
Co-authored-by: Gayathri <[email protected]>
* Remove feature flag condition

* Remove feature flag condition

* mode=r missed in read

* Missed out changes in merge

* Missed out changes in merge

* Remove unwanted env

* Changes to rename platform service env var for costing

* Precommit failure fix

* Fix unit tests

* Fix unit tests

* Fix docker test cases

* Change mime read length to use default

* Support remote storage for OSS users (#1136)

* OSS support with remote storage

* Add data sync

* Remove unwanted dependencies
* SDK roll

* Commit pdm.lock changes
Fix folder path

Co-authored-by: Deepak K <[email protected]>
* Implemented changes related to unstract subscription

* Have different home pages for OSS and Cloud

* Renamed the unstract subscription pages and replaced '<></>' with '<Outlet />'

* Updated import path of the TrialDaysInfo component

* Support 'readonly' parameter for all custom RJSF widgets

* Transform the JSON schema for the LLM Whisperer V2 adapter for cloud

* Fixed Eslint issues

* Code quality improvement

* Allow JSON schema transform for LLMW V2 only for paid plan users

* Allow JSON schema transform for LLMW V2 only for paid plan users

* Fix sonar issue

* Implemented the  to structure the API path

* Rearranged the onboard and subscription check APIs

* Fixed related to unstract subscription plugins

* Restrict routing from top navbar if case if subscription expired

* Modified the  to keep the time as optional in the response

* Added a conditional top navbar dropdown link for the pricing page

* Updated the route from /subscription-plans to /pricing

* Integration of the transform formdata function for LLMW Paid Adapter

* Fixed sonar issue

* Code optimization in declaring  variable

* Usage query improvements

---------

Co-authored-by: Tahier Hussain <[email protected]>
Co-authored-by: Tahier Hussain <[email protected]>
Co-authored-by: vishnuszipstack <[email protected]>
…1145)

fix: Added task ID in WF execution model in API deployment POST call itself
* fix: Override GoogleDriveFS upload_file_to_storage to handle existing files

- Added check to delete existing files at destination before upload
- Ensures clean upload without file conflicts
- Maintains original upload functionality through super() call

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Added comment

Signed-off-by: Ritwik G <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update google_drive.py

Signed-off-by: Ritwik G <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Ritwik G <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
fix: Removed assert on task ID NULL, avoided calling shared_task on scheduled run
#1148)

Fix for response as PENDING always when the API deployment doesn't time out

Signed-off-by: Ritwik G <[email protected]>
* SDK Roll over to v0.59.0

* SDK Roll over to v0.59.0

* Adding azure and aws

* Adding dev comments

* Adding dev comments

* Lock files

* Lock files

* Commit pdm.lock changes
…1158)

* UN-2166 Passed optional container name from backend to runner

* Removed tool-sandbox from lockfile automation script - since core is a private lib that's added, the lockfile can't be resolved and pushed

* minor: Addressed review comments for renaming a variable
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.