EAGLE support and examples #10925

jeanyu-habana · 2024-05-03T23:25:19Z

Description

ipex-llm and EAGLE integration plus EAGLE example scripts

1. Why the change?

EAGLE provides significant speedup in addition to ipex-llm optimizations. Please see below:
Llama 7B, temporature=1.0, Intel CPU
speed 27.445331381249126 TPS (optimized with both EAGLE and ipex-llm)
speed 20.132597255230788 TPS (optimized with EAGLE only)
speed 14.549053180428723 TPS (optimized with ipex-llm only)
speed_base 10.275284471199816 TPS (Baseline: not optimized)

Llama 7B, temporature=1.0, Intel GPU
speed 60.68802901159256 TPS eagle + ipex-llm (ratio: 3.74)
speed 41.41260508527679 TPS ipex-llm only (ratio: 2.55)
speed 31.480931699222744 TPS eagle only (ratio: 1.94)
speed_base 16.220403337894584 TPS (Baseline: not optimized)

2. User API changes

N/A

3. Summary of the change

Integrate with EAGLE (https://github.com/SafeAILab/EAGLE) and provide examples

4. How to test?

N/A
Unit test
[ X] Application test
Document test
...
Please follow the setup instructions and example commands in the README.

jenniew · 2024-05-09T00:04:27Z

I suggest you to put examples under ipex-llm/python/llm/example/CPU/PyTorch-Models and ipex-llm/python/llm/example/GPU/PyTorch-Models

python/llm/example/CPU/EAGLE/evaluation/gen_ea_answer_llama2chat.py

python/llm/example/CPU/EAGLE/README.md

python/llm/example/GPU/EAGLE/evaluation/gen_baseline_answer_llama2chat.py

python/llm/src/ipex_llm/transformers/eagle/ipex_ea_model.py

jenniew · 2024-05-14T23:04:14Z

@jason-dai For the Eagle example placement, which directory is best to put to? like ipex-llm/python/llm/example/CPU/PyTorch-Models/EAGLE or ipex-llm/python/llm/example/CPU/Speculative-Decoding/EAGLE? If we put eagle example to ipex-llm/python/llm/example/CPU/Speculative-Decoding, we may need to create a new directory, like ipex-llm/python/llm/example/CPU/Speculative-Decoding/ipex-llm, and move our current speculative examples to that directory.

jason-dai · 2024-05-16T14:43:22Z

@jason-dai For the Eagle example placement, which directory is best to put to? like ipex-llm/python/llm/example/CPU/PyTorch-Models/EAGLE or ipex-llm/python/llm/example/CPU/Speculative-Decoding/EAGLE? If we put eagle example to ipex-llm/python/llm/example/CPU/Speculative-Decoding, we may need to create a new directory, like ipex-llm/python/llm/example/CPU/Speculative-Decoding/ipex-llm, and move our current speculative examples to that directory.

Maybe example/CPU/Speculative-Decoding with Self-Speculation and Eagle folders

…to eagle

jeanyu-habana · 2024-05-24T19:11:35Z

After discussing with @jenniew , we agreed to use a new PR for a cleaner merge : #11104. For this reason, I am closing this PR

EAGLE support and examples

8e97b3b

jenniew reviewed May 9, 2024

View reviewed changes

Jean Yu added 2 commits May 17, 2024 12:30

Merge branch 'main' of https://github.com/intel-analytics/ipex-llm in…

65e9ef2

…to eagle

move Eagle code under Speculative-Decoding

8fc5060

jeanyu-habana closed this May 24, 2024

jeanyu-habana deleted the eagle branch May 24, 2024 19:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EAGLE support and examples #10925

EAGLE support and examples #10925

Uh oh!

jeanyu-habana commented May 3, 2024

Uh oh!

jenniew commented May 9, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jenniew commented May 14, 2024

Uh oh!

jason-dai commented May 16, 2024

Uh oh!

jeanyu-habana commented May 24, 2024

Uh oh!

Uh oh!

EAGLE support and examples #10925

EAGLE support and examples #10925

Uh oh!

Conversation

jeanyu-habana commented May 3, 2024

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

Uh oh!

jenniew commented May 9, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jenniew commented May 14, 2024

Uh oh!

jason-dai commented May 16, 2024

Uh oh!

jeanyu-habana commented May 24, 2024

Uh oh!

Uh oh!