[EPIC][CPU] DT Enablement #15313

dcaballe · 2023-10-26T18:10:12Z

Creating this issue to track all the DT issues related to compilation, runtime or performance issues that would need a fix before the default enablement. We shouldn't include here performance improvements that would be nice to have but currently do not necessarily lead to a performance regression over the existing non-DT approach.

The issues are sorted by priority:

### Tasks
- [ ] #15061
- [ ] #15692
- [ ] #15242
- [ ] #15349 
- [ ] #15391
- [ ] #15392
- [ ] #15249

The text was updated successfully, but these errors were encountered:

dcaballe · 2023-10-26T18:11:08Z

I'm currently investigating a ~2x slowdown on an internal model when DT is enabled. I'll update the list once I know more!

dcaballe · 2023-11-12T02:02:02Z

I did another round of benchmarking with ToT and reverting the problematic ExpandVector commit. The performance regression of DT against the original CG base went from 10x to 4.9x slower. Not bad :). I'll look at the profiles again but fixing #15242 should be mostly what is remaining! Great job everyone and fantastic coordination and collaboration to address all these issues! Thank you all!

dcaballe · 2023-11-29T14:17:36Z

Good news! With @NatashaKnk's fix for the vecmat/matvec DT issue + @hanhanW's fix for the i1 issue, performance goes from 5x slower to ~17% faster for the i8 version of the model :) (the f32 version seems to be off still but probably some bug with an easy fix). I also see 2-3x improvements vs previous DT numbers for LLaMA and Falcon! Awesome work!

Thanks for bearing with me and sorry for the pressure to fix all of this before the default enablement. Green light from me to do so :)

@stellaraccident @jpienaar

stellaraccident · 2023-11-29T15:53:02Z

This is great! I appreciate the high bar and I'm glad we were able to clear all of the hurdles. Thanks for all of the work on this. Really great to see the across the board improvements.

dcaballe assigned MaheshRavishankar Oct 26, 2023

hanhanW self-assigned this Oct 26, 2023

dcaballe changed the title ~~[CPU] DT Enablement~~ [EPIC][CPU] DT Enablement Oct 30, 2023

allieculp assigned dcaballe Nov 2, 2023

This was referenced Nov 12, 2023

Turn data-tiling on by default. #15256

Merged

[CPU][DT] Select proper vec/unroll sizes for vecmat/matvec codegen #15421

Merged

dcaballe closed this as completed Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC][CPU] DT Enablement #15313

[EPIC][CPU] DT Enablement #15313

dcaballe commented Oct 26, 2023 •

edited

Loading

dcaballe commented Oct 26, 2023 •

edited

Loading

dcaballe commented Nov 12, 2023

dcaballe commented Nov 29, 2023

stellaraccident commented Nov 29, 2023

[EPIC][CPU] DT Enablement #15313

[EPIC][CPU] DT Enablement #15313

Comments

dcaballe commented Oct 26, 2023 • edited Loading

dcaballe commented Oct 26, 2023 • edited Loading

dcaballe commented Nov 12, 2023

dcaballe commented Nov 29, 2023

stellaraccident commented Nov 29, 2023

dcaballe commented Oct 26, 2023 •

edited

Loading

dcaballe commented Oct 26, 2023 •

edited

Loading