tests: start adding e2e tests #55

sd2k · 2025-03-21T17:05:18Z

This PR adds end-to-end tests for Loki integration and adds test documentation.
This is iteration one as we want to add a basic structure on e2e testing for now. We need to iterate further on them.

Note: prompts needs to be specific when using llm-as-a-judge. I've noticed some flakiness on the llm responses so some times tests are failing, especially the test_loki_logs_tool.

When we are confident that tests are consistently passing then we can make it part of the ruleset.

Very WIP.

tests/loki_test.py

sd2k

Nice, thanks for getting this working properly! I can't approve since I'm the OG author so we'll need someone else to take a look too.

tests/pyproject.toml

csmarchbanks · 2025-04-18T14:01:13Z

tests/loki_test.py

+@pytest.mark.parametrize("model", models)
+async def test_loki_logs_tool(model: str, mcp_client: ClientSession):
+    tools = await mcp_client.list_tools()
+    prompt = "Can you list the last 10 log lines from all containers using any available Loki datasource? Give me the raw log lines. Please use only the necessary tools to get this information."


This test is failing for me at least half the time. Generally from trying to put in some non-container label matcher, anything from {job=~".+"} to {job="varlog"}. I wonder if we could at least tweak the prompt or the tool description to get the test to work more consistently.

sd2k added 3 commits March 21, 2025 17:05

tests: start adding e2e tests

2b38931

Very WIP.

Fix SSE URL

12bf177

Get tests running, if not passing

b6512b2

sd2k commented Mar 26, 2025

View reviewed changes

tests/loki_test.py Outdated Show resolved Hide resolved

sd2k and others added 9 commits April 2, 2025 11:02

Use anyio to run async tests instead of pytest-asyncio

8dba5c2

Don't use session-scoped fixtures

338e9be

test label values question

78ea7f8

verify loki datasource exists

79c23f2

change the flow to test loki logs tool

159b145

iterate on the prompts

7419c1a

Update README

76c20c4

Add model info in the README

045db3f

basic linting

ccf5e21

ioanarm force-pushed the e2e-tests branch from a9f0f0c to ccf5e21 Compare April 3, 2025 09:40

ioanarm marked this pull request as ready for review April 3, 2025 09:42

ioanarm requested a review from a team as a code owner April 3, 2025 09:42

rename test

461185f

sd2k commented Apr 4, 2025

View reviewed changes

tests/pyproject.toml Show resolved Hide resolved

ioanarm self-assigned this Apr 4, 2025

add dotenv on dev deps

af925f2

ioanarm mentioned this pull request Apr 7, 2025

Add e2e tests using Python / Typescript clients #43

Open

12 tasks

ioanarm requested a review from gitdoluquita April 15, 2025 13:53

ioanarm assigned gitdoluquita Apr 15, 2025

Add log line for debugging tool calls

8f935de

csmarchbanks reviewed Apr 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: start adding e2e tests #55

tests: start adding e2e tests #55

sd2k commented Mar 21, 2025 •

edited by ioanarm

Loading

sd2k left a comment

csmarchbanks Apr 18, 2025

tests: start adding e2e tests #55

Are you sure you want to change the base?

tests: start adding e2e tests #55

Conversation

sd2k commented Mar 21, 2025 • edited by ioanarm Loading

sd2k left a comment

Choose a reason for hiding this comment

csmarchbanks Apr 18, 2025

Choose a reason for hiding this comment

sd2k commented Mar 21, 2025 •

edited by ioanarm

Loading