LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models (ICCV 2023) 논문 읽기
tensorflow의 object-detection을 활용하여 MobileNet-SSD 학습시키기
BLIP : Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation (2022, arXiv) paper review
depth estimation with stereo camera
ROS message_filter를 활용하여 서로 다른 토픽 메시지 동기화하기
git clone 하다가 생긴 permission error를 해결하자
A Survey of Transformers (2021) 논문 스터디 - (1)
YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information (2024, arxiv) 읽어보기
[CV] real-time human detection, face recognition
EfficientNetv2과 image classification