Four years after the previous one, another 30,000 foot view of the blog’s output, starting with the blog’s 133,177 words:
29,809 words in the comments, fewer than last time because of a few long comment threads on deleted posts. More ‘point’s, more money, and less Safari.
65,407 words on Twitter, nearly three times the words from four years ago! Much less focused on current events – less Obama, and more @hamandcheese. Probably an improvement.
And finally, a new one – the 73,674 words from quotes I’ve collected.
Make your own at wordle.net. And since pasting a feed only pulls the last 5 (or so) posts, here’s the code to generate the total text from WordPress which you can paste into Wordle.
<?php include('wp-load.php');
query_posts('posts_per_page=-1');
while (have_posts()) {
the_post();
$content = html_entity_decode(strip_tags(get_the_title()." ".get_the_content()." "));
echo preg_replace("/[^A-Za-z ]/"," ",$content);
}
<?php include('wp-load.php');
$q="SELECT comment_content FROM wp_comments WHERE comment_approved=1";
$r = $wpdb->get_results($q);
foreach ($r as $c) echo $c->comment_content." ";
Principal component analysis is a way to summarize as much of the variation in many-dimensional data as you can, using fewer dimensions. For example, a genome is many-dimensional, but since much of the variation in different genes is correlated, a good chunk of the total variation can be captured in . . .
Since my 10 year old related posts plugin can’t even be downloaded anymore because of a security vulnerability, I figure it’s time to bring related posts into the ✨ AI era ✨ with vector embedding. Surprisingly, I didn’t find any Wordpress plugins to do that, so – inspired by TomBot2000, . . .
Leave a Reply