You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+16
Original file line number
Diff line number
Diff line change
@@ -197,6 +197,22 @@ The dependency options are listed below:
197
197
|`.[tools]`| Install dependencies for dedicated tools, such as quality classifiers. |
198
198
|`.[sandbox]`| Install all dependencies for sandbox. |
199
199
200
+
- Install dependencies for specific OPs
201
+
202
+
With the growth of the number of OPs, the dependencies of all OPs becomes very heavy. Instead of using the command `pip install -v -e .[sci]` to install all dependencies,
203
+
we provide two alternative, lighter options:
204
+
205
+
- Automatic Minimal Dependency Installation: During the execution of Data-Juicer, minimal dependencies will be automatically installed. This allows for immediate execution, but may potentially lead to dependency conflicts.
206
+
207
+
- Manual Minimal Dependency Installation: To manually install minimal dependencies tailored to a specific execution configuration, run the following command:
Copy file name to clipboardexpand all lines: configs/config_all.yaml
+5
Original file line number
Diff line number
Diff line change
@@ -341,6 +341,11 @@ process:
341
341
horizontal_flip: false # flip frame image horizontally (left to right).
342
342
vertical_flip: false # flip frame image vertically (top to bottom).
343
343
mem_required: '20GB'# This operation (Op) utilizes deep neural network models that consume a significant amount of memory for computation, hence the system's available memory might constrains the maximum number of processes that can be launched
344
+
- video_extract_frames_mapper: # extract frames from video files according to specified methods
345
+
frame_sampling_method: 'all_keyframes'# sampling method of extracting frame images from the videos. Should be one of ["all_keyframes", "uniform"]. The former one extracts all key frames and the latter one extract specified number of frames uniformly from the video. Default: "all_keyframes".
346
+
frame_num: 3# the number of frames to be extracted uniformly from the video. Only works when frame_sampling_method is "uniform". If it's 1, only the middle frame will be extracted. If it's 2, only the first and the last frames will be extracted. If it's larger than 2, in addition to the first and the last frames, other frames will be extracted uniformly within the video duration.
347
+
duration: 0# The duration of each segment in seconds. If 0, frames are extracted from the entire video. If duration > 0, the video is segmented into multiple segments based on duration, and frames are extracted from each segment.
348
+
frame_dir: None # Output directory to save extracted frames. If None, a default directory based on the video file path is used.
344
349
- video_face_blur_mapper: # blur faces detected in videos
345
350
cv_classifier: ''# OpenCV classifier path for face detection. By default, we will use 'haarcascade_frontalface_alt.xml'.
346
351
blur_type: 'gaussian'# type of blur kernel, including ['mean', 'box', 'gaussian']
0 commit comments