scene-understanding

Here are 104 public repositories matching this topic...

zchoi / Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

agent awesome navigation manipulator-robotics planning-algorithms scene-understanding embodied-agent embodied-ai large-language-model

Updated Sep 23, 2024

yinyunie / 3D-Shape-Analysis-Paper-List

Star

A list of recent papers, libraries and datasets about 3D shape/scene analysis (by topics, updating).

3d-reconstruction shape-analysis 3d-representation shape-completion 3d-detection scene-understanding scene-reconstruction

Updated Dec 5, 2023
Python

xiaoyufenfei / Efficient-Segmentation-Networks

Star

Lightweight models for real-time semantic segmentationon PyTorch (include SQNet, LinkNet, SegNet, UNet, ENet, ERFNet, EDANet, ESPNet, ESPNetv2, LEDNet, ESNet, FSSNet, CGNet, DABNet, Fast-SCNN, ContextNet, FPENet, etc.)

computer-vision pytorch neural-networks segmentation image-segmentation semantic-segmentation cityscapes scene-understanding semantic-segmentation-models camvid real-time-semantic-segmentation efficient-segmentation-networks lightweight-semantic-segmentation driving-scene-understanding

Updated Jul 25, 2024
Python

SimonVandenhende / Multi-Task-Learning-PyTorch

Star

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

pascal computer-vision pytorch segmentation multi-task-learning scene-understanding eccv2020 nyud

Updated Jan 13, 2022
Python

bertjiazheng / awesome-scene-understanding

Star

😎 A list of awesome scene understanding papers.

awesome computer-vision deep-learning indoor-scenes 3d-scene scene-understanding

Updated Nov 8, 2024

NVlabs / FB-BEV

Star

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

deep-learning autonomous-driving scene-understanding 3d-object-detection 3d-perception nuscenes bev-perception 3d-occupancy-prediction

Updated Jan 12, 2024
Python

ZhaoJ9014 / Multi-Human-Parsing

Star

🔥🔥Official Repository for Multi-Human-Parsing (MHP)🔥🔥

semantic parsing detection annotations segmentation nus evaluation-metric human-parsing instance-segmentation scene-understanding multi-human-parsing group-behavior-analysis human-centric-analysis mhp

Updated Dec 9, 2021
JavaScript

bertjiazheng / Structured3D

Star

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

computer-vision deep-learning computer-graphics annotations dataset 3d-reconstruction eccv scene-understanding room-layout structure-annotations house-designs

Updated Jan 9, 2024
Python

Jingkang50 / OpenPSG

Star

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

scene-graph scene-understanding scene-graph-generation

Updated Apr 10, 2023
Python

GAP-LAB-CUHK-SZ / Total3DUnderstanding

Star

Implementation of CVPR'20 Oral: Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

pytorch scene-understanding scene-reconstruction cvpr2020

Updated Apr 11, 2024
Python

Yangzhangcst / RGBD-semantic-segmentation

Star

A paper list of RGBD semantic segmentation (processing)

awesome image-segmentation rgbd semantic-segmentation indoor-scenes scene-understanding rgbd-segmentation rgbd-images

Updated Oct 7, 2023

prismformore / Multi-Task-Transformer

Star

Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding" and ECCV2022 paper "Inverted Pyramid Multi-task Transformer for Dense Scene Understanding"

pascal computer-vision deep-learning segmentation human-parsing depth-estimation cityscapes multi-task-learning scene-understanding nyudv2 eccv2022 cityscapes-3d

Updated Apr 24, 2024
Python

vinthony / ghost-free-shadow-removal

Star

[AAAI 2020] Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN

deep-learning tensorflow data-augmentation scene-understanding shadow-removal

Updated Dec 12, 2023
Jupyter Notebook

Open3DA / LL3DA

Star

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

gpt language-model multi-modal 3d 3d-models scene-understanding llm instruction-tuning cvpr2024 3d-to-text