-
Notifications
You must be signed in to change notification settings - Fork 80
Issues: microsoft/onnxruntime-genai
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Backward/forward compatibility with different version of ORT
bug
Something isn't working
platform:windows
#727
opened Jul 26, 2024 by
skyline75489
Can we run phi-3 vision on mobile
model:transformer
platform:mobile
#726
opened Jul 26, 2024 by
surajat17
UnicodeDecodeError: 'utf-8' codec can't byte 0xb2 in position 175: invalid start byte
ep:DML
platform:windows
#725
opened Jul 26, 2024 by
camike
Can I inference Phi-3-vision with batch?
enhancement
New feature or request
#720
opened Jul 24, 2024 by
2U1
CSharp examples crash on attempt to load model in Cuda mode (Release or Debug)
ep:CUDA
model:transformer
platform:windows
#716
opened Jul 22, 2024 by
asmirnov82
Inference with batching is significantly slower than without batching.
ep:CUDA
#714
opened Jul 20, 2024 by
Jester6136
Nodes are not topologically sorted from models generated by model builder
#707
opened Jul 17, 2024 by
BowenBao
Consider removing direct link-time dependency on ORT on Linux/macOS
platform:mobile
#693
opened Jul 12, 2024 by
skyline75489
How to use Phi3 ONNX model in Triton efficiently?
waiting-for-customer
#674
opened Jul 4, 2024 by
khaerensml6
[DML] [iGPU] [AMD] [Intel] GPU command exception
ep:DML
model:transformer
#667
opened Jul 2, 2024 by
tjtanaa
GPU driver error when using AMD eGPU via DirectML
ep:DML
model:transformer
platform:windows
#644
opened Jun 25, 2024 by
x0wllaar
GPU suspended (887A0005) while running Phi2 example in DML
ep:DML
#628
opened Jun 21, 2024 by
skyline75489
is there any exmaple of phi-3 vision model deploy on Android?
platform:mobile
#608
opened Jun 14, 2024 by
henrywang0314
Memory leak during back-to-back inferences
ep:DML
model:transformer
#590
opened Jun 10, 2024 by
jeremyfowers
Phi-3-Mini fails to execute on long prompts on Intel integrated GPU with DirectML
ep:DML
#570
opened Jun 5, 2024 by
ofirzaf
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.