Skip to content

Releases: roboflow/inference

v0.47.0

18 Apr 11:34
43777b0
Compare
Choose a tag to compare

💪 Added

Depth Anything V2 🤝 inference

Depth estimation turns flat images into 3D information by calculating how far each pixel is from the camera. This adds an essential dimension to computer vision - distance.

Thanks to @reiffd7 we have Depth Anything V2 model in inference and Workflows ecosystem. Here is what the model brings into the table:

  • ✨ Detailed results with sharp objects boundaries
  • 💪 Robust performance across diverse scenes and lighting conditions
  • ⚡ 10x faster processing than similar high-quality models
  • 🪶 Lightweight (yet not available in Roboflow Hosted API, please use Roboflow Dedicated Deployment or deploy locally)

Why use it?

  • Improve object detection with distance information
  • Create 3D visualizations from single images
  • Enable depth-aware applications (AR, robotics)
  • Understand spatial relationships in scenes

Check out HF model card.

✨ New: OverlapFilter Block

@lou-roboflow added a new Overlap Filter block to the Workflows ecosystem.

The Overlap Filter removes objects that do not overlap a selected class. It is useful when you want to focus only on objects that are interacting with specific items.

How it works:

  • You specify an overlap class (e.g., "bicycle").
  • Only objects overlapping instances of that class are kept.
  • The overlap class itself is removed from the results.

Example applications:

  • Detecting people on bicycles (removing bicycles themselves).
  • Identifying items on pallets.
  • Finding passengers in cars, etc.

⚡ Speed-ups (with @codeflash-ai)

🧑‍🏭 Maintanence

🏅 New Contributors

Full Changelog: v0.46.5...v0.47.0

v0.46.5

14 Apr 21:43
cf36001
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.46.4...v0.46.5

v0.46.4

09 Apr 23:50
ccbb459
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.46.3...v0.46.4

v0.46.3

09 Apr 17:51
27d9bd9
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.46.1...v0.46.3

v0.46.1

07 Apr 16:37
73eaec9
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.46.0...v0.46.1

v0.46.0

04 Apr 12:04
d3f0336
Compare
Choose a tag to compare

💪 Added

SmolVLM2 🤝 inference

SmolVLM2, a multimodal vision model developed by Hugging Face, is now available for use in Inference and Workflows. Thanks to its relatively compact size when compared to other multimodal models, SmolVLM2 is capable of running efficiently on less powerful devices, while still maintaining decent performance for a range of multimodal tasks, including VQA, document OCR, document VQA, and object counting.

You can use the model in Workflows:

image

Or you can run it directly with the inference Python package:

from PIL import Image
from inference.models.smolvlm.smolvlm import SmolVLM

smol_vlm = SmolVLM(api_key="API_KEY")
image = Image.open("dog.jpeg")
result = smol_vlm.predict(image, "How many dogs are in this image?")
print(result)

New operations in Workflows

Thanks to @grzegorz-roboflow contribution, Property Definition block was extended with new UQL operation - extracting times that elapsed since start of video from each video frame.

⚙️ Maintanence

New Contributors

Full Changelog: v0.45.3...v0.46.0

v0.45.3

02 Apr 17:21
6312998
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.45.2...v0.45.3

v0.45.2

02 Apr 15:28
05cba37
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.45.1...v0.45.2

v0.45.1

02 Apr 12:50
2f40c21
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.45.0...v0.45.1

v0.45.0

28 Mar 19:07
11e2164
Compare
Choose a tag to compare

💪 Added

⚡️ Speed improvements by CodeFlash

Other changes

Full Changelog: v0.44.1...v0.45.0