-
-
Notifications
You must be signed in to change notification settings - Fork 29
Issues: InftyAI/llmaz
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
fix: add ut for backend runtime.
cleanup
Categorizes issue or PR as related to cleaning up code, process, or technical debt.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#428
opened May 22, 2025 by
X1aoZEOuO
Loading…
feat: support runai streamer for vllm
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#423
opened May 19, 2025 by
cr7258
Loading…
[Umbrella] Metrics Aggregator Implementation
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#421
opened May 17, 2025 by
kerthcet
2 tasks
Enable envoy token rate limiting by configuration
feature
Categorizes issue or PR as related to a new feature.
help wanted
Extra attention is needed
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#412
opened May 14, 2025 by
kerthcet
1 of 3 tasks
Add T/$ as indicator to measure the cost efficiency
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#401
opened May 6, 2025 by
kerthcet
3 tasks
Manage the ai gateway resources automatically
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#387
opened Apr 30, 2025 by
kerthcet
3 tasks done
open-webui support namespaces
feature
Categorizes issue or PR as related to a new feature.
help wanted
Extra attention is needed
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#386
opened Apr 29, 2025 by
kerthcet
3 tasks
[Umbrella] grafana support with inference engines
feature
Categorizes issue or PR as related to a new feature.
help wanted
Extra attention is needed
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#385
opened Apr 29, 2025 by
kerthcet
1 of 7 tasks
chore: add ci to test deploy with helm
cleanup
Categorizes issue or PR as related to cleaning up code, process, or technical debt.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#377
opened Apr 24, 2025 by
liangyuanpeng
Loading…
[Umbrella] advanced traffic load balancing algorithms
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#376
opened Apr 24, 2025 by
kerthcet
2 of 7 tasks
[Umbrella] inference engine metrics installation
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Envoy gateway plugin support with random selection
feature
Categorizes issue or PR as related to a new feature.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#371
opened Apr 23, 2025 by
kerthcet
3 tasks done
[OSPP] KEDA-based Serverless Elastic Scaling for llmaz
feature
Categorizes issue or PR as related to a new feature.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#362
opened Apr 22, 2025 by
pacoxu
3 tasks
[OSPP]Enabling Efficient Model and Container Image Distribution in LLMaz with Dragonfly
feature
Categorizes issue or PR as related to a new feature.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#361
opened Apr 22, 2025 by
pacoxu
3 tasks
Support runai model streamer for fast model loading
feature
Categorizes issue or PR as related to a new feature.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Can this initcontainer image be configurable?
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#350
opened Apr 17, 2025 by
pacoxu
after backend runtime update, should we re-create the playground pod?
bug
Categorizes issue or PR as related to a bug.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#332
opened Mar 26, 2025 by
pacoxu
Proposal for LoRA autoscaler
approved
Indicates a PR has been approved by an approver from all required OWNERS files.
do-not-merge/hold
Indicates that a PR should not merge because someone has issued a /hold command.
do-not-merge/needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#313
opened Mar 13, 2025 by
kerthcet
Loading…
Lora Autoscaler
feature
Categorizes issue or PR as related to a new feature.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Add popular open source models as in-tree support
cleanup
Categorizes issue or PR as related to cleaning up code, process, or technical debt.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Milestone v0.2.0
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#259
opened Jan 27, 2025 by
kerthcet
2 tasks
vllm only has /health endpoint
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#241
opened Jan 17, 2025 by
kerthcet
3 tasks
Able to set the toRender parameters dynamically
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#239
opened Jan 16, 2025 by
kerthcet
1 of 3 tasks
Unify the chat api for all inference servers
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#218
opened Dec 10, 2024 by
kerthcet
3 tasks
Serverless support
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-04-22.