Also regarding the "just stop the deleting" and "way too much space". I did a test where I just removed the code that deletes old rows (). On my local test wiki, the database tables would take up this much space:
Table name |
Size in MB |
Rows |
(only oldest and newest) |
(all revisions) |
(only oldest and newest) |
(all revisions) |
discussiontools_item_ids |
0.86 |
0.86 |
4046 |
4046 |
discussiontools_item_pages |
0.50 |
0.50 |
3822 |
3822 |
discussiontools_item_revisions |
0.95 |
20.55 |
8352 |
171850 |
discussiontools_items |
0.53 |
0.53 |
3393 |
3393 |
Total |
2.84 |
22.44 |
|
(there are 4046 recorded comments or headings across many revisions of 165 pages, some created for testing and some copied from elsewhere)
I'm not sure how representative this is of real-world usage, but I think somewhere between 10x and 100x increase would be expected on a real wiki as well. (Depending on how many comments are present in a usual talk page revision… Most have few, but village pumps etc. can have hundreds, and some user talk pages where the owner refuses to ever archive them have thousands.)
When I was estimating disk usage in production in T303295#7850298, I figured enwiki gets ~11,500 comments per day, corresponding to about ~1-2 MB of data per day (that might have been an underestimate, looking at it now). Handling 100-200 MB per day probably wouldn't be impossible, but it would require additional maintenance and maybe additional hardware and didn't seem worth it (although I wanted to do it before I estimated this).