MNT: Replace live server calls with mocks in tests/test_utils/test_utils.py by Rahuldrabit · Pull Request #1679 · openml/openml-python

Rahuldrabit · 2026-02-21T03:02:41Z

Reference Issue: Fixes #1649
Replace tests' live-server network calls with local mocks/patches so tests run deterministically and offline.
Add helpers to mock listing endpoints and API call responses (for datasets, tasks, flows, setups, runs, evaluations).
Patch cache/config calls and use tmp_path where needed to test cache utilities without touching real filesystem config.
No production code changes; test-only changes.
Result : Passed

…ils.py

Copilot

Pull request overview

This PR replaces live server network calls with local mocks in tests/test_utils/test_utils.py to make tests run deterministically and offline, addressing part of issue #1649. The changes are test-only with no production code modifications.

Changes:

Added helper function _create_mock_listing_call to generate mock responses for paginated listing endpoints
Modified _mocked_perform_api_call to return mock XML responses for dataset queries
Updated 10 test functions to use mocks instead of live server calls, removing @pytest.mark.test_server() and @pytest.mark.flaky() markers

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/test_utils/test_utils.py

Rahuldrabit · 2026-02-25T18:47:04Z

@geetu040 Hi !! can you please give review .

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-30T03:37:55Z

tests/test_utils/test_utils.py

+    def mock_listing_call(limit, offset, **kwargs):
+        if offset >= total_items:
+            return pd.DataFrame() if return_type == "dataframe" else []
+        size = min(limit, total_items - offset)
+        items = [item_factory(i) for i in range(offset, offset + size)]
+        return pd.DataFrame(items) if return_type == "dataframe" else items
+    return mock_listing_call


_create_mock_listing_call appends an empty batch when offset >= total_items, which can leave an extra empty element in the _list_all() result list in cases where limit is None and total_items is an exact multiple of batch_size. This differs from real listing calls which often signal end-of-results via OpenMLServerNoResult, and can subtly change downstream behavior (e.g., callers that expect no empty final batch). Consider raising openml.exceptions.OpenMLServerNoResult when offset >= total_items, or ensure _list_all-style termination without adding an empty batch to results.

Copilot · 2026-03-30T03:37:55Z

tests/test_utils/test_utils.py

+        return """<oml:data xmlns:oml="http://openml.org/openml">
+            <oml:dataset>
+                <oml:did>61</oml:did>
+                <oml:name>iris</oml:name>
+                <oml:version>1</oml:version>
+                <oml:status>active</oml:status>
+            </oml:dataset>
+        </oml:data>"""
+    raise ValueError(f"Unexpected call: {call}")
+
+


_mocked_perform_api_call hard-codes the full expected call string (including parameter order). This makes the test brittle to harmless refactors in URL construction (e.g., reordering filters) while not changing behavior. Consider matching on structured components (e.g., prefix + presence of required segments) or parsing the endpoint into parts before deciding which response to return.

Copilot · 2026-03-30T03:37:56Z

tests/test_utils/test_utils.py

+    mock_list_tasks.side_effect = _create_mock_listing_call(
+        min_number_tasks_on_test_server, lambda i: {"tid": i}
+    )
    tasks = openml.tasks.list_tasks(size=min_number_tasks_on_test_server)
    assert min_number_tasks_on_test_server == len(tasks)




These tests now use local mocks rather than the test server, but the fixture/variable name min_number_tasks_on_test_server (and similar) still suggests a dependency on the remote test server state. Renaming the parameter/fixture (and updating its docstring) to reflect that it’s just a mocked item count would make the intent clearer and avoid confusion about network requirements.

Copilot · 2026-03-30T03:37:56Z

tests/test_utils/test_utils.py

    """This test verifies that the test server downloads the data from the correct source.

    If this tests fails, it is highly likely that the test server is not configured correctly.
    Usually, this means that the test server is serving data from the task with the same ID from the production server.
    That is, it serves parquet files wrongly associated with the test server's task.
    """
+    mock_task = unittest.mock.Mock()
+    mock_dataset = unittest.mock.Mock()
+    mock_dataset.features = {0: "feature1", 1: "feature2"}
+    mock_dataset.get_data.return_value = (pd.DataFrame({"feature1": [1], "feature2": [2]}), None, None, None)
+    mock_task.get_dataset.return_value = mock_dataset
+    mock_get_task.return_value = mock_task
+
    task = openml.tasks.get_task(119)


test_correct_test_server_download_state no longer tests the stated behavior in its docstring: patching openml.tasks.get_task replaces the whole OpenML code path with a mock, so the assertion only validates the mock setup (features vs dataframe columns) rather than verifying the server/data-source configuration. Either (a) update/rename the test and docstring to reflect the new goal, or (b) mock at a lower level (e.g., _perform_api_call / download layer) so openml.tasks.get_task() and dataset parsing logic are still exercised offline.

MNT: replace live server calls with mocks in tests/test_utils/test_ut…

65e8f7b

…ils.py

Copilot AI review requested due to automatic review settings February 21, 2026 03:02

Copilot started reviewing on behalf of Rahuldrabit February 21, 2026 03:03 View session

Copilot AI reviewed Feb 21, 2026

View reviewed changes

Update tests/test_utils/test_utils.py

73d12dc

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 30, 2026 03:34

Update tests/test_utils/test_utils.py

c145567

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started reviewing on behalf of Rahuldrabit March 30, 2026 03:34 View session

Rahuldrabit and others added 2 commits March 30, 2026 09:34

Update tests/test_utils/test_utils.py

caec466

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Merge branch 'main' into rahul/mocks-test-utils

4afa10c

Copilot AI reviewed Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MNT: Replace live server calls with mocks in tests/test_utils/test_utils.py#1679

MNT: Replace live server calls with mocks in tests/test_utils/test_utils.py#1679
Rahuldrabit wants to merge 5 commits intoopenml:mainfrom
Rahuldrabit:rahul/mocks-test-utils

Rahuldrabit commented Feb 21, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Rahuldrabit commented Feb 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Rahuldrabit commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Rahuldrabit commented Feb 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Rahuldrabit commented Feb 21, 2026 •

edited

Loading