Greetings,
In the event that someone else may find use from this, I thought I'd share what I found / did on discovery of a huge session table in the DB.
I just created a new site for a client; one who had formerly used a (poor) discussion forum tool that used, well... oodles of unique URLs. I've eliminated that forum tool from the current site.
New site has been up for a week (on a new host), and the host had applied a small amount of "throttling" to the site due to high system resource utilization, which was unexpected as it's getting less than 100 visitors per day per Google Analytics.
I looked in the host's slow DB queries log and found a number of them for simple queries to the modx_sessions table. Looked at the table and found that it had 60,000+ rows after only a week online!
That made no sense at all -- at first -- with less than 100 visitors / day, then my brain kicked in and I remembered the old forum software. I looked in the raw Apache access logs for the site and found that the site is getting hit about six times per minute by search engines -- primarily the Chinese Baiduspider (or something claiming to be) -- trying to crawl that old - now non-existent -- forum software's URLs.
Of course, MODX was handling each of those requests and displaying a 404, but I'm guessing because the crawlers weren't accepting cookies(?) a new session was getting created for each request. Well.... yuck.
SOLUTIION
My solution was to create a folder for the old forum software and add .htaccess in that specific folder that let Apache return an HTTP 404 for every URL that would be from the old forum software, without MODX (and the DB) needing to get involved:
RewriteEngine Off
DirectoryIndex 404.html
ErrorDocument 404 /obsolete-forum-directory/404.html
I also added an entry in the robots.txt file to keep robots out of that directory, hopefully causing them to give up a bit sooner.
Anyway -- the modx_session table is now nice and quiet, and I suspect that the host's throttling algorithm will be happier too, as will the resultant site performance.
Hope this helps someone else with the same unusual combination of circumstances