add internvl2 example #12102

ch1y0q · 2024-09-20T07:51:01Z

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

N/A
Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g., 1234). And paste your action link here once it has been successfully finished.
Application test
Document test
...

5. New dependencies

New Python dependencies
- Dependency1
- Dependency2
- ...
New Java/Scala dependencies and their license
- Dependency1 and license1
- Dependency2 and license2
- ...

rnwang04 · 2024-09-20T07:54:02Z

python/llm/example/GPU/HuggingFace/Multimodal/internvl2/chat.py

+    # When running LLMs on Intel iGPUs for Windows users, we recommend setting `cpu_embedding=True` in the from_pretrained function.
+    # This will allow the memory-intensive embedding layer to utilize the CPU instead of iGPU.
+    model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True,
+                                             load_in_low_bit="sym_int4",


fix the code style

rnwang04 · 2024-09-20T07:55:03Z

python/llm/example/GPU/HuggingFace/Multimodal/internvl2/chat.py

+        "max_new_tokens": 64,
+        "do_sample": False,
+    }
+


add with torch.inference_mode(): context manager for inference

rnwang04 · 2024-09-20T07:55:56Z

python/llm/example/GPU/HuggingFace/Multimodal/internvl2/chat.py

+    question = "<image>" + query
+
+    generation_config = {
+        "max_new_tokens": 64,


make it as a parameter

rnwang04 · 2024-09-20T07:56:37Z

Add link to https://github.com/intel-analytics/ipex-llm?tab=readme-ov-file#verified-models.

jason-dai · 2024-09-20T08:06:28Z

Add link to intel-analytics/ipex-llm#verified-models.

And here https://github.com/intel-analytics/ipex-llm/blob/main/README.zh-CN.md#%E6%A8%A1%E5%9E%8B%E9%AA%8C%E8%AF%81

rnwang04

LGTM

add internvl2 example

0cff479

rnwang04 reviewed Sep 20, 2024

View reviewed changes

add to README.md

496cc04

rnwang04 reviewed Sep 20, 2024

View reviewed changes

ch1y0q added 3 commits September 20, 2024 16:14

update

dc627d2

Merge branch 'main' into npu-internvl2-4b

e4fb954

add link to zh-CN readme

b9ddef1

rnwang04 approved these changes Sep 20, 2024

View reviewed changes

rnwang04 merged commit 2269768 into intel:main Sep 20, 2024
1 check passed

ch1y0q deleted the npu-internvl2-4b branch September 20, 2024 08:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add internvl2 example #12102

add internvl2 example #12102

Uh oh!

ch1y0q commented Sep 20, 2024

Uh oh!

rnwang04 Sep 20, 2024

Uh oh!

rnwang04 Sep 20, 2024 •

edited

Loading

Uh oh!

rnwang04 Sep 20, 2024

Uh oh!

rnwang04 commented Sep 20, 2024

Uh oh!

jason-dai commented Sep 20, 2024

Uh oh!

rnwang04 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add internvl2 example #12102

add internvl2 example #12102

Uh oh!

Conversation

ch1y0q commented Sep 20, 2024

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. New dependencies

Uh oh!

rnwang04 Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

rnwang04 Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rnwang04 Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

rnwang04 commented Sep 20, 2024

Uh oh!

jason-dai commented Sep 20, 2024

Uh oh!

rnwang04 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rnwang04 Sep 20, 2024 •

edited

Loading