add route payload to deploy Inference Endpoints #3013

Vaibhavs10 · 2025-04-18T12:11:24Z

ref: https://huggingface.slack.com/archives/C016D661PAN/p1744900723918749?thread_ts=1742997838.592509&cid=C016D661PAN

HuggingFaceDocBuilderDev · 2025-04-18T12:15:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/huggingface_hub/hf_api.py

Wauplin

nice addition :)

Wauplin · 2025-04-22T08:01:45Z

src/huggingface_hub/hf_api.py

+            domain (`str`, *optional*):
+                The custom domain for the Inference Endpoint (e.g. `"my-new-domain.cool-website.woof"`).
+            path (`str`, *optional*):
+                The custom path for the Inference Endpoint, should start with a `/` (e.g. `"/models/google-bert/bert-base-uncased"`).


What does it means exactly? Is it that the endpoint should be served under endpoints-(...)-url.co/models/google-bert/bert-base-uncased or is it that the URL endpoints-(...)-url.co/ points to the /models/google-bert/bert-base-uncased path of the docker image? (worth some clarification in the docstring IMO)

IIRC @oOraph mentioned this is applicable for anyone with a custom domain registered as well, so I'd consider this to be generic.

Yes, worth having the parameter supported in huggingface_hub! Just wanted to be sure what it meant exactly (The custom path for the Inference Endpoint is not explicit enough IMO)

updating now

updated here: 54572d5

i think it's when you want to expose the resulting endpoint's inference route under a custom path but you can check with @oOraph and @XciD

Wauplin · 2025-04-22T08:06:07Z

src/huggingface_hub/hf_api.py

+                The custom domain for the Inference Endpoint (e.g. `"my-new-domain.cool-website.woof"`).
+            path (`str`, *optional*):
+                The custom path for the Inference Endpoint, should start with a `/` (e.g. `"/models/google-bert/bert-base-uncased"`).
+            cache_http_responses (`bool`, *optional*):


(link this private thread to avoid forgetting about it - if this attribute is HF-only, let's remove it)

Wauplin · 2025-04-22T08:06:51Z

src/huggingface_hub/hf_api.py

+            cache_http_responses (`bool`, *optional*):
+                Whether to cache HTTP responses from the Inference Endpoint. Defaults to `False`.
+            tags (`List[str]`, *optional*):
+                A list of tags to associate with the Inference Endpoint.


Out of curiosity, how do we play with tags once set? (e.g. can we list endpoints based on tags? or is it a UI change?)

yeah! you can list all endpoints deployed with a certain specific tag

…uggingface_hub into vb/upd-inference-endpoints

Wauplin · 2025-04-23T12:31:12Z

CI failure is unrelated, merging it now (code quality is ✔️ though)

tomaarsen · 2025-04-23T12:59:06Z

src/huggingface_hub/hf_api.py

+            cache_http_responses (`bool`, *optional*):
+                Whether to cache HTTP responses from the Inference Endpoint.


So we're okay with keeping this?

yes I think so in the end. It's also publicly documented here: https://api.endpoints.huggingface.cloud/#post-/v2/endpoint/-namespace-. This method is already not much used so if it can make HF's people life easier, let's go for it

Perfect, convenient, thanks!

add route payload to deploy Inference Endpoints.

bba6945

Vaibhavs10 requested review from hanouticelina and Wauplin April 18, 2025 12:12

Vaibhavs10 added 2 commits April 18, 2025 14:25

mirror changes from update to create.

cd3be6a

add tags + cached requests.

ee23208

oOraph reviewed Apr 18, 2025

View reviewed changes

src/huggingface_hub/hf_api.py Outdated Show resolved Hide resolved

src/huggingface_hub/hf_api.py Outdated Show resolved Hide resolved

Vaibhavs10 added 2 commits April 18, 2025 14:41

up + review.

2f342a0

moar up.

0416ee3

oOraph approved these changes Apr 18, 2025

View reviewed changes

Wauplin reviewed Apr 22, 2025

View reviewed changes

Vaibhavs10 added 2 commits April 23, 2025 12:36

moar up.

54572d5

Merge branch 'main' into vb/upd-inference-endpoints

a0f9e9c

Wauplin approved these changes Apr 23, 2025

View reviewed changes

hanouticelina approved these changes Apr 23, 2025

View reviewed changes

Vaibhavs10 added 2 commits April 23, 2025 14:26

reword.

c9dfcb4

Merge branch 'vb/upd-inference-endpoints' of github.com:huggingface/h…

a1e946b

…uggingface_hub into vb/upd-inference-endpoints

Wauplin approved these changes Apr 23, 2025

View reviewed changes

Wauplin merged commit 34bb25d into main Apr 23, 2025
10 of 25 checks passed

Wauplin deleted the vb/upd-inference-endpoints branch April 23, 2025 12:31

tomaarsen reviewed Apr 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add route payload to deploy Inference Endpoints #3013

add route payload to deploy Inference Endpoints #3013

Vaibhavs10 commented Apr 18, 2025

HuggingFaceDocBuilderDev commented Apr 18, 2025

Wauplin left a comment

Wauplin Apr 22, 2025

Vaibhavs10 Apr 23, 2025

Wauplin Apr 23, 2025

Vaibhavs10 Apr 23, 2025

Vaibhavs10 Apr 23, 2025

julien-c Apr 23, 2025

Wauplin Apr 22, 2025

Wauplin Apr 22, 2025

Vaibhavs10 Apr 23, 2025

Wauplin commented Apr 23, 2025

tomaarsen Apr 23, 2025

Wauplin Apr 23, 2025

tomaarsen Apr 23, 2025

		cache_http_responses (`bool`, optional):
		Whether to cache HTTP responses from the Inference Endpoint.

add route payload to deploy Inference Endpoints #3013

add route payload to deploy Inference Endpoints #3013

Conversation

Vaibhavs10 commented Apr 18, 2025

HuggingFaceDocBuilderDev commented Apr 18, 2025

Wauplin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin commented Apr 23, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment