2025.01.22 - #22 - NVIDIA NIM, Cosmos, new Hesai/Robosense LiDAR, MatchAnything, HF smolagents, DeepSeek-R1

# NVIDIA NIM (Neural inference microservices?)

> 개인 GPU 없이, 최신 NVIDIA 모델 및 LLM 모델들을 돌려보세요! NVIDIA 엔지니어가 최적화도 다 해놔서 속도도 HuggingFace보다 훨씬 빠릅니다.

https://build.nvidia.com/explore/discover

## Physical AI

## cosmos-nemotron-34b (Video summarization, Video captioning)

- "Elaborate what the worker is doing, why he would be taking this action and also provide information on what he is wearing."

![Image](https://github.com/user-attachments/assets/b7b66667-7314-4bf5-a5b7-667a2a337871)

- "How many people are in this video?" (틀림...)

<img width="1802" alt="Image" src="https://github.com/user-attachments/assets/24c00466-0b3d-4598-b06b-4d5329076e63" />

## cosmos-1.0-diffusion-7b (Text-to-world, Image-to-world 동영상 생성)

- "A first person view from the perspective from a dog sized robot as it works in a car manufacturing site. The robot has many unfinished cars and engine components nearby. The camera on moving forward. Photorealistic"

![Image](https://github.com/user-attachments/assets/5621766b-e5a8-44dc-82f8-c6e22df8cad7)

- "A first person view from the perspective from a quadrupled robot as it works in a car manufacturing site. The robot has many unfinished cars and engine components nearby. The camera on moving forward. Photorealistic"

![Image](https://github.com/user-attachments/assets/a310077a-3a90-4e77-a400-342c069c133f)

## cosmos-1.0-autoregressive-5b (Video-to-world, Image-to-world 동영상 생성. 짧은 동영상을 길게 만들기)

![Image](https://github.com/user-attachments/assets/8aba1b98-a874-49f5-bf53-a527c1be40a8)

## API list

- 서버 없이 LLM을 돌려보기 딱 좋음
- 기본적으로 NVIDIA 아키텍처에서 필요한 모든 가속이 들어가있음.
   - 엔비디아 엔지니어들에게 들어보니, 이걸 전부 다 손으로 만든다고 😭 

<img width="1846" alt="Image" src="https://github.com/user-attachments/assets/523d868e-6ed6-4fdd-9ff7-e98d35dea47c" />

- Qwen2.5-coder:32b-Instruct

![Image](https://github.com/user-attachments/assets/ab3f3484-b927-4a79-9aec-b22832a25348)

## Blueprints

<img width="1846" alt="Image" src="https://github.com/user-attachments/assets/37792192-e560-4f69-908b-9ec07ada7ecd" />

## Price

- 개발자 프로그램 참여하면 무료로 API 무제한 사용 된다고 들었던거 같은데... 확인 필요
- 스타트업이라면 NVIDIA Inception 프로그램 참가하면 API 크레딧 엄청 많이 준다고 알고있음.

<img width="838" alt="Image" src="https://github.com/user-attachments/assets/53317db5-1515-4156-a79d-5b69bba9af04" />

---

# LLM

## 한국말 잘하는 LLM

> 챗봇 만들때 굿

- 마이크로소프트 Phi4 (14B)
- 알리바바 Qwen 2.5 (7B, 14B, 32B, 70B)
- 구글 Gemma2 (9B, 27B)

## 코딩할 때 좋은 LLM

- DeepSeek DeepSeek-R1-distilled-Qwen2.5 (32B) 아키텍처 용으로 최고.
- 알리바바 Qwen 2.5-coder (32B 이상)
- 마이크로소프트 Phi4 (14B) 범용성 최고
- Llama 3.2 (3B) - auto-complete용 모델로 속도 빠름

## DeepSeek-R1

- OpenAI o1 급 성능?
- MIT 라이센스 (상업적 사용 가능)
- 성능 굿 👍👍
- QwQ 보다 좋음. 
- 질문 제대로 안하면 무한루프에 빠져들음
- 이전 채팅 기록에 이어서 질문하는거 잘 안됨
- <think> 태그 필터링 필요
- 알고리즘/자료구조 문제 기가막히게 잘풀음
- 중국 역사 질문 피함

![Image](https://github.com/user-attachments/assets/59d52eb9-a65e-4838-9992-7f402c865d8c)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2025.01.22 - #22 - NVIDIA NIM, Cosmos, new Hesai/Robosense LiDAR, MatchAnything, HF smolagents, DeepSeek-R1 #24

NVIDIA NIM (Neural inference microservices?)

Physical AI

cosmos-nemotron-34b (�Video summarization, Video captioning)

cosmos-1.0-diffusion-7b (Text-to-world, Image-to-world 동영상 생성)

cosmos-1.0-autoregressive-5b (Video-to-world, Image-to-world 동영상 생성. 짧은 동영상을 길게 만들기)

API list

Blueprints

Price

LLM

한국말 잘하는 LLM

코딩할 때 좋은 LLM

DeepSeek-R1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

2025.01.22 - #22 - NVIDIA NIM, Cosmos, new Hesai/Robosense LiDAR, MatchAnything, HF smolagents, DeepSeek-R1 #24

Description

NVIDIA NIM (Neural inference microservices?)

Physical AI

cosmos-nemotron-34b (�Video summarization, Video captioning)

cosmos-1.0-diffusion-7b (Text-to-world, Image-to-world 동영상 생성)

cosmos-1.0-autoregressive-5b (Video-to-world, Image-to-world 동영상 생성. 짧은 동영상을 길게 만들기)

API list

Blueprints

Price

LLM

한국말 잘하는 LLM

코딩할 때 좋은 LLM

DeepSeek-R1

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions