'Deep Learning/Computer Vision' 카테고리의 글 목록

Deep Learning/Computer Vision 6

[ Semantic Segmentation ] Language-driven Semantic Segmentation

(논문 그대로 해석한 것이 아닌 제 나름의 방식대로 해석해서 작성했음을 알리며 참고 바랍니다.) Boyi Li et al. Language-driven Semantic Segmentation, ICLR(2022) Summary- 서로 다른 modality 를 가진 text 와 image embedding 이 corresponding semantic class 를 가지도록 학습- 기존의 fixed-label segmentation method 와 비교해 경쟁력있는 zero-shot 성능을 보임 1. Introduction( * 이전 포스트에서 N-shot learning method 를 리뷰하였습니다. )Semantic segmentation task 에는 많은 양의 annotated data 가 필요..

Deep Learning/Computer Vision 2024.06.10

[ Visual Language Navigation ] Visual Language Maps for Robot Navigation

※ 해당 글을 작성하게 된 이유,Visual Language Navigation task 는 지속적으로 주목받고 있는 분야이기에 관련 논문 하나를 정리해보고자 작성합니다. (논문 그대로 해석한 것이 아닌 제 나름의 방식대로 해석해서 작성했음을 알리며 참고 바랍니다.) Chenguang Huang, Oier Mees, Andy Zeng, Wolfram Burgard, Visual Language Maps for Robot Navigation, ICRA(2023) SummaryVisual language feature 와 3D reconstruction 정보를 융합한 spatial map 제안 = VLMaps- Multi-object & Long-horizon goal navigation / Zero-s..

Deep Learning/Computer Vision 2024.06.07

[ Semantic Segmentation ] U-Net: Convolutional Networks for Biomedical Image Segmentation

※ 해당 글을 작성하게 된 이유, U-Net 은 Semantic Segmentation task 에서 기본이 되는 모델입니다. 참고로, Diffusion model ( DDPM ) 에서 U-Net 구조를 채택합니다. (논문 그대로 해석한 것이 아닌 제 나름의 방식대로 해석해서 작성했음을 알리며 참고 바랍니다.) Olaf Ronneberger, Philipp Fischer, Thomas Brox, U-Net: Convolutional Networks for Biomedical Image Segmentation, MICCAI(2015) Summary - FCN을 사용해 Semantic Segmentation task 를 수행 - Data augmentation 을 사용해 적은 데이터를 가지고도 좋은 성능을 ..

Deep Learning/Computer Vision 2023.12.13

[ Human Parsing ] Self-Correction for Human Parsing

※ 해당 글을 작성하게 된 이유, 글을 쓰는 현 시점에서, SCHP 는 Human Parsing task 에서 널리 사용되는 모델입니다. CVPR 2023 년에 발표된 SOLIDER-HumanParsing 에도 baseline model 로 사용되었습니다. (논문 그대로 해석한 것이 아닌 제 나름의 방식대로 해석해서 작성했음을 알리며 참고 바랍니다.) Peike Li, Yunqiu Xu, Yunchao Wei, Yi Yang, Self-Correction for Human Parsing, CVPR(2019) Summary - 이전 CE2P 의 한계점을 제시하며 이를 해결하기 위함 → Targeted learning objectives - 기존 label noise 에 대한 문제를 제시하며 self-cor..

Deep Learning/Computer Vision 2023.11.12

[ Semantic Segmentation ] Fully Convolutional Networks for Semantic Segmentation

※ 해당 글을 작성하게 된 이유, FCN 은 Semantic Segmentation task 에서 기본이 되는 모델입니다. (논문 그대로 해석한 것이 아닌 제 나름의 방식대로 해석해서 작성했음을 알리며 참고 바랍니다.) Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully Convolutional Networks for Semantic Segmentation, CVPR(2015) Summary - Convolutional networks 만을 사용해 효과적으로 Semantic Segmentation task 를 수행 - 기존 Vision Recognition model 에 적용함으로써 효과 인증 1. Introduction Convolutional networks..

Deep Learning/Computer Vision 2023.11.11

[ Human Parsing ] Devil in the Details: Towards Accurate Single and Multiple Human Parsing

※ 해당 글을 작성하게 된 이유, CE2P는 Human Parsing task 에서 baseline model 로 쓰이며, 후에 발표된 Human Parsing 모델에 많은 영향을 주었습니다. (논문 그대로 해석한 것이 아닌 제 나름의 방식대로 해석해서 작성했음을 알리며 참고 바랍니다.) Tao Ruan, Ting Liu, Zilong Huang, Yunchao Wei, Shikui Wei, Yao Zhao, Thomas Huang, Devil in the Details: Towards Accurate Single and Multiple Human Parsing, AAAI(2019) Summary - Feature resolution, global context information, edge datai..

Deep Learning/Computer Vision 2023.11.05

Sin prisa, pero sin pausa.

저를 2가지 형태의 "뚜벅이"로 소개합니다. 하나, 느려도 착실하게 저만의 스토리를 꾸려가는 것을 즐깁니다. 둘, 제 커리어에 하나의 스토리를 추가한 후 홀로 걷기 여행 가는 것을 좋아합니다.

debugging, SCHP, FCN, vlmaps, Computer Vision, visual language navigation, A-CE2P, VLN, Semantic Segmentation, LSEG, machine translation, vscode, U-Net, Human Parsing, nlp, natural language processing, CE2P, zero-shot learning, ResNet, few-shot learning, Transformer, One-shot learning, git,

Today :
Yesterday :

일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Deep Learning/Computer Vision 6

티스토리툴바