2024.08.14 - #4 - DeepMind table tennis robot, FLUX, CppCon, MoAI, CoLLaVO, Hydra-MDP, NPUs

# Interesting papers

- Yan 2024 - An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion [링크](https://omages.github.io/)
    - Diffusion을 통해서 64 x 64 크기의 '부품 이미지' (Object image)를 만든 후, 이걸 조립하면 object가 된다는 논문.

<img width="945" alt="image" src="https://github.com/user-attachments/assets/a2e9dbdf-1863-465b-a518-dfb1dafc68f9">

- Nakkiran 2024 - Step-by-Step Diffusion: An Elementary Tutorial [링크](https://arxiv.org/abs/2406.08929)
    - Diffusion 튜토리얼
    - AI 한글번역본 [링크](https://github.com/user-attachments/files/16612424/Nakkiran.2024.-.Step-by-Step.Diffusion_.An.Elementary.Tutorial_ko.pdf)


# Industry news

- DeepMind의 탁구 로봇 [논문링크](https://arxiv.org/pdf/2408.03906)
     - 다수의 low level 스킬 컨트롤러 + 1개의 high level 컨트롤러. High level 컨트롤러가, 어떤 스킬을 사용하면 좋을지 선택함. 각각의 low-level 스킬 컨트롤러는 modular policy architecture를 기반으로 학습됨.
     - Zero-shot sim-to-real을 통해 학습함
     - 실시간으로 처음 보는 상대의 스킬에 적응하는 능력을 갖춤

https://github.com/user-attachments/assets/2f6490b7-5adf-4a62-94fb-a6f8af2dc1b3

- Perceptive -> 최초의 fully autonomous 로봇치과 수술 [링크](https://gigazine.net/news/20240801-robot-dentist-human-treatment/) [수술 영상 링크. 조금 징그러워요](https://youtu.be/RXvSiUUO9Z4?si=3ruWoLhIzjOLvGDc)
    - 3D handheld scanner로 치아 스캔 -> 로봇 수술
    - 2 시간 걸릴 작업을 15분만에 수행

<img width="553" alt="image" src="https://github.com/user-attachments/assets/5f6e9e73-be79-4206-a13b-bf95bfb0fb2b">

- FLUX + Runway
    - Flux로 리얼한 얼굴 생성 + Runway로 이미지를 동영상화.
    - 해외에서 1. 미디어 관련으로 쓸 게 많다, 2. 얼굴 관련 데이터셋을 만들 수 있을거라고 인기가 많음  

https://github.com/user-attachments/assets/1b75f544-dea2-4b42-a7e9-57bd4f072e29

- 보스턴 다이나믹스 아틀라스 푸쉬업

https://github.com/user-attachments/assets/f61fb42b-1b1a-49e4-842b-348d50c4274b


# Useful resources

- Cppcon [링크](https://www.youtube.com/@CppCon/playlists)
    - 로봇쪽은 C++ 코딩이 많이 사용됨.
    - C++는 굉장히 어려움... 잘 쓰기 너무너무너무 어려움...
    - Good practices를 배우기 어려움
    - 그래서 고수들이 얘기하는 C++를 보고 배우면 좋음
        - Back to basic 코스를 보고 고급 개념들만 잘 익혀도 잘 짤 수 있음

<img width="1479" alt="image" src="https://github.com/user-attachments/assets/5fcc245d-30a7-4c80-82c9-effeba75543d">



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2024.08.14 - #4 - DeepMind table tennis robot, FLUX, CppCon, MoAI, CoLLaVO, Hydra-MDP, NPUs #6

Interesting papers

Industry news

Useful resources

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

2024.08.14 - #4 - DeepMind table tennis robot, FLUX, CppCon, MoAI, CoLLaVO, Hydra-MDP, NPUs #6

Description

Interesting papers

Industry news

Useful resources

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions