One tech obsessed geek girl opining about current events, technology, and herself.
So earlier today I was doing some catching up on Google Alerts for some domains that I manage.
And I kept on finding pages that look like the one below - same formatting, even.
When I first noticed these pages the middle of last week, I took them for a stupidly overzealous SEO who was planting link farms on sites he owns.
Now, I don’t think so - after examining a number of these rogue SEO pages, it looks like someone is taking advantage of an exploit in Apache to post directories full of these rogue SEO pages, to boost their page rank (while adding outside links on these rogue pages to, I guess, appear genuine).
All of the pages I’ve found are on machines running Apache in shared hosting settings with poorly maintained / designed parent sites. That sure as heck points to exploit.
Take for example the page I posted above. The full URL looks like http://destinationconcerts.com/tmp416/cnf336/neurology_49.htm.
Since, like I noted before, the site is poorly maintained which means you can go ahead and browse the parent directories. The main Web site seems to be a homepage (created in Microsoft FrontPage) for a concert promoter in Allentown, PA. The hosting provider is E-Commerce, Inc. And this was just one, out of a number of pages that I found hosted by E-Commerce, Inc. I also found other pages on sites hosted by The Planet and, irony abounding, The Institute for Intelligence Studies at Mercyhurst College.
So, just who is planting these pages and why?
Technorati Tags: Exploit, Apache, SEO, Rogue SEO, Search Engine Optimization
This past weekend there’s been a conversation about Shyftr a new RSS service that allows people to read and comment on full text stories on the Shyftr site, rather making the reader click through to the originating blog to comment. The thought is that folks who care about pageviews for advertising will lose out in such a scenario.
So, in the spirit of helping the wider, feathers in a ruffle, blogging community out, I’ve pasted the Shyftr RSS bot info below. The good news is that you can block the Shyftr IP address from accessing your blog (if you already have that capability through your blog hosting solution, etc.). As of present, the IP address is 66.234.234.34.
Unlike other annoying bots, I would not block the user agent in your .htaccess file as the RSS bot software the Shyftr folks are using is the generic MagpieRSS toolset, which is used by other RSS services. Hopefully, the people at Shyftr will rename the user agent to something more uniquely identifiable in the future so you can block via .htaccess.
(Note: Blocking a future unique Shyftr user agent via robots.txt probably won’t work as the crawler would need to fetch the robots.txt file first before fetching your feed and I didn’t see that behavior tonight.)
Host: 66.234.234.34 * /feed Http Code: 200 Date: Apr 12 19:48:28 Http Version: HTTP/1.0 Size in Bytes: 6244 Referer: - Agent: MagpieRSS/0.72 (+http://magpierss.sf.net) * /favicon.ico Http Code: 200 Date: Apr 12 19:48:28 Http Version: HTTP/1.0 Size in Bytes: 1406Referer: - Agent: -
Technorati Tags: Shyftr, RSS, RSS toolset, robots, crawlers, bots, lost pageviews, comments, blogger brouhaha
On the MSNBC developer blog, the question was posed How do you share?. Not in the grade school way, but in the newfangled Web 2.0 way.
Overall, the comments from MSNBC readers were pretty… negative. Aside from the “I’ll just paste the link I want to share in an email” or the “I’ll just add the page to my browser bookmarks” or the “they’re tracking your habits for nefarious purposes” comments, other commenters cited just one or two social bookmarking sites (the most popular seeming to be either del.icio.us or digg.com). And a few other commenters wondered, “Hey, MSNBC, don’t you own Newsvine?”
It appears that the zen habits of social bookmarking hasn’t been widely accepted by the at large Internet populace.
Technorati Tags: social bookmarking, bookmarking, Web 2.0, del.icio.us, digg, newsvine, putting it in my favorites
For those of you with Apple TV, do you like it?
I’m thinking of springing for it, seeing as the idea of downloading movies and watching them on my (nearly outdated last of the mohicans CRT TV) does appeal to me. I don’t watch broadcast TV, I don’t have on-demand anything nor do I Netflix.
On the other hand, the iMac is in the family room too and I could, I suppose, hook that up to the TV negating the need for another product from Apple.
Thoughts?
Technorati Tags: Apple TV
Building upon a discussion elsewhere on the Web, here’s some brute force SEO for you.
Apparently, the NY Times is inserting tagging in the page META title tag, in the instances where it seems that article headlines lack sufficient keywords. Normally, the Times just carries the article’s headline into the page META title tag.
For example, in the article headlined The Falling-Down Professions, the page title tag reads as “Economic Conditions-Economic trends-legal profession-lawyers-prestige-doctors - New York Times”.
You see, the page title tag is important for SEO as Google in particular lends much weight to the text contained within the title tag.
All in all, the NY Times approach is definitely an interesting methodology for organizations deploying content management systems and who wish to build traffic from search engines.
Technorati Tags: SEO, tagging, title tag, keywords, NY Times, CMS, search engines, search engine traffic