-
Notifications
You must be signed in to change notification settings - Fork 403
Issues: SJTU-IPADS/PowerInfer
Meta: Implementing hybrid inference across key desktop platforms
#92
opened Dec 27, 2023 by
hodlen
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
统计predictor的overhead
question
Further information is requested
#220
opened Sep 16, 2024 by
guanchenl
3 tasks done
Help! Want a toy example to run matmul with q40 weight by cuda kernel
question
Further information is requested
#219
opened Sep 11, 2024 by
Eutenacity
CUDA toolkit version?
question
Further information is requested
#218
opened Sep 6, 2024 by
shujiehan
Am i doing something wrong?
question
Further information is requested
#216
opened Aug 28, 2024 by
RealMrCactus
3 tasks done
Some question about Fig4.
question
Further information is requested
#213
opened Jul 23, 2024 by
rhmaaa
我要如何获得预测文件呢
question
Further information is requested
#211
opened Jul 15, 2024 by
LDLINGLINGLING
3 tasks
Feature request : Support for PHI3 mini
enhancement
New feature or request
#210
opened Jul 14, 2024 by
raymond-infinitecode
3 tasks
请问powerinfer能否兼容llama.cpp的模型呢
question
Further information is requested
#209
opened Jul 5, 2024 by
mailonghua
the output for Q4_gguf is strange again!!
bug-unconfirmed
Unconfirmed bugs
#208
opened Jul 4, 2024 by
milktea888
About powerinfer-2
enhancement
New feature or request
#207
opened Jul 2, 2024 by
Ther-nullptr
3 tasks done
Where is the TurboSparse-Mixtral mlp_predictor?
question
Further information is requested
#203
opened Jun 27, 2024 by
MatthewCroughan
How to convert ProSparse-LLaMA-2-13B model to .gguf?
question
Further information is requested
#201
opened Jun 23, 2024 by
Graysonicc
3 tasks done
Source for v2 (mobile inference engine)
question
Further information is requested
#194
opened Jun 12, 2024 by
peeteeman
Need quite a long time to load the model
question
Further information is requested
#188
opened May 21, 2024 by
meicale
Will this work with Falcon 2?
question
Further information is requested
#186
opened May 14, 2024 by
aaronrmm
关于在A100显卡上测得的效果异常的疑问
question
Further information is requested
#184
opened May 4, 2024 by
bulaikexiansheng
在A100-80G上无法找到cuda的情况
question
Further information is requested
#182
opened Apr 24, 2024 by
bulaikexiansheng
Where is the definition or addition location of GGML_USE_HYBRID_THREADING?
question
Further information is requested
#172
opened Mar 25, 2024 by
wfloveiu
two questions that i want to solve
question
Further information is requested
#167
opened Mar 18, 2024 by
yeptttt
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.