Skip to content

Clear page cache in some scripts#786

Closed
rschu1ze wants to merge 0 commit intomainfrom
page-cache
Closed

Clear page cache in some scripts#786
rschu1ze wants to merge 0 commit intomainfrom
page-cache

Conversation

@rschu1ze
Copy link
Member

@rschu1ze rschu1ze commented Feb 9, 2026

Resolves #774

@rschu1ze rschu1ze force-pushed the page-cache branch 2 times, most recently from bc275a5 to 9b4288c Compare February 9, 2026 17:00
"hardware": "cpu",
"tuned": "no",
"comment": "",
"tags": ["column-oriented", "lukewarm-cold-run"],
Copy link
Contributor

@puzpuzpuz puzpuzpuz Feb 9, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's still lukewarm since the Python process is kept around between query runs. The tag should be kept as is. Am I missing something?

@puzpuzpuz
Copy link
Contributor

@rschu1ze thanks for handling the issue. Are you going to rerun the benchmarks as well? The results are likely to change after your modifications.

@rschu1ze
Copy link
Member Author

rschu1ze commented Feb 9, 2026

I triggered the automation but all jobs fell victim to GitHub's outage today :-( Trying again now.

"proprietary": "no",
"hardware": "cpu",
"tuned": "no",
"tags": ["column-oriented","dataframe","lukewarm-cold-run"],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as with polars, the lukewarm-cold-run tag should be kept. Worth checking other changed results too, e.g. it looks like duckdb-dataframe also keeps Python process around.

@puzpuzpuz
Copy link
Contributor

I triggered the automation but all jobs fell victim to GitHub's outage today :-( Trying again now.

GH seems running normally now. Could you retry?

@rschu1ze
Copy link
Member Author

Sure (however this PR is currently low priority for me, I work on it on the side, probably tomorrow or on the weekend).

@puzpuzpuz
Copy link
Contributor

Sure (however this PR is currently low priority for me, I work on it on the side, probably tomorrow or on the weekend).

Sorry, I just wanted to check if this PR is still on your radar. No need to rush, I understand the circumstances.

@xe-nvdk
Copy link
Contributor

xe-nvdk commented Feb 15, 2026

Hey team, quick question. I thought that the tag was for the systems that do not clear OS cache and do not restart with every query. Am I wrong?

Forcing to all the systems restarting for every query is that in the roadmap yet?

Thank you

@nwoolmer
Copy link

nwoolmer commented Feb 17, 2026

Hey @xe-nvdk, my understanding is that lukewarm runs only clear OS page cache and do not require restarts. True cold runs require database restarts to ensure the internal database caches are also cleared.

Many results have not been migrated to 'true cold' runs yet, I think @rschu1ze wanted some help crowd-sourcing the refresh.

Citation:

2.a) True cold runs. Before each first run of each query, all operating system caches (page cache) and database caches (e.g. buffer pools) are cleared. 

Some databases provide commands to clear internal caches. For fairness towards databases which do not offer such statements, it is required to restart the database before the first run of each query. 

Databases which do not stick around as a background process between queries, e.g. [clickhouse-local](https://clickhouse.com/docs/operations/utilities/clickhouse-local), satisfy this requirement implicitly. 

It is still needed to clear the page cache before each first query to qualify as a true cold run.

@rschu1ze
Copy link
Member Author

I somehow pushed the wrong branch to the wrong location, GitHub closed this PR and won't let me open it again.

@rschu1ze rschu1ze deleted the page-cache branch February 17, 2026 20:20
@puzpuzpuz
Copy link
Contributor

I somehow pushed the wrong branch to the wrong location, GitHub closed this PR and won't let me open it again.

I guess you could create a new branch and open a new PR.

@rschu1ze
Copy link
Member Author

rschu1ze commented Feb 17, 2026

Replacement PR: #792

Re cold runs: Yes, the migration to "true" cold runs has not started yet, and we better crowd-source it. Let me create an umbrella ticket --> #793

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Participants with local on-disk storage, but without OS page cache flush

4 participants