microsoft / onnxruntime-genai Public

Notifications You must be signed in to change notification settings
Fork 80
Star 345

Code
Issues 40
Pull requests 14
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: microsoft/onnxruntime-genai

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

40 Open 129 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Issues running on Ryzen platform:windows

#728 opened Jul 26, 2024 by jarroddavis68

Backward/forward compatibility with different version of ORT bug

Something isn't working

platform:windows

#727 opened Jul 26, 2024 by skyline75489

Can we run phi-3 vision on mobile model:transformer platform:mobile

#726 opened Jul 26, 2024 by surajat17

UnicodeDecodeError: 'utf-8' codec can't byte 0xb2 in position 175: invalid start byte ep:DML platform:windows

#725 opened Jul 26, 2024 by camike

[DML] Test that destroys a generator, tweaks GeneratorParams and then creates another generator throws a KV_Cache exception ep:DML platform:windows

#722 opened Jul 24, 2024 by yuslepukhin

[Feature request] Support for LLama3.1

#721 opened Jul 24, 2024 by nguyenhoangthuan99

Can I inference Phi-3-vision with batch? enhancement

New feature or request

#720 opened Jul 24, 2024 by 2U1

CSharp examples crash on attempt to load model in Cuda mode (Release or Debug) ep:CUDA model:transformer platform:windows

#716 opened Jul 22, 2024 by asmirnov82

Inference with batching is significantly slower than without batching. ep:CUDA

#714 opened Jul 20, 2024 by Jester6136

Nodes are not topologically sorted from models generated by model builder

#707 opened Jul 17, 2024 by BowenBao

Post the strength compare with other deploy tools

#700 opened Jul 16, 2024 by lucasjinreal

Consider removing direct link-time dependency on ORT on Linux/macOS platform:mobile

#693 opened Jul 12, 2024 by skyline75489

try to build model gemma2 ,but failed. ep:CUDA

#692 opened Jul 12, 2024 by iwaitu

Using software adapter crashes DirectML helper ep:DML

#688 opened Jul 11, 2024 by skyline75489

How to use Phi3 ONNX model in Triton efficiently? waiting-for-customer

#674 opened Jul 4, 2024 by khaerensml6

[DML] [iGPU] [AMD] [Intel] GPU command exception ep:DML model:transformer

#667 opened Jul 2, 2024 by tjtanaa

GPU driver error when using AMD eGPU via DirectML ep:DML model:transformer platform:windows

#644 opened Jun 25, 2024 by x0wllaar

Performance Regression in DML ep:DML

#641 opened Jun 25, 2024 by contentis

More API parameters could be const

#631 opened Jun 21, 2024 by skottmckay

GPU suspended (887A0005) while running Phi2 example in DML ep:DML

#628 opened Jun 21, 2024 by skyline75489

Fix the README in the nuget package to show C# code

#614 opened Jun 18, 2024 by stephentoub

is there any exmaple of phi-3 vision model deploy on Android? platform:mobile

#608 opened Jun 14, 2024 by henrywang0314

Memory leak during back-to-back inferences ep:DML model:transformer

#590 opened Jun 10, 2024 by jeremyfowers

Phi3 Vision models feedback and questions

#571 opened Jun 5, 2024 by AshD

Phi-3-Mini fails to execute on long prompts on Intel integrated GPU with DirectML ep:DML

#570 opened Jun 5, 2024 by ofirzaf

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly