From the monthly archives:

August 2007

Friday Tea Time - 8/31/07

by TheMadHat on August 31, 2007

If you're new here, you must subscribe to my RSS feed, or I will hunt you down. Thanks for visiting!

It’s Friday before a three day weekend! Wait, I’ll probably work the whole weekend anyway…it’s a good think I like this stuff. Since it’s a holiday weekend we’re staying away from any Humankind vs Matt Cutts posts, let’s give him a break for at least 6 minutes :)…Okay, break over. Anyhoo, here are this week’s fun and wacky links from the world of SEO:

* Google made final cut of the GMail video. My favorite is some guy who climbed Mount Kilimanjaro (19,340 ft) with the freaking GMail card. Nutball. Here’s the video:

* Vanessa mentioned something about her wink profile, so here is TheMadHat on Wink. The funny thing is I my occupation is “spammer”. Apparently there are 204 other spammers out there. We should start a club.

* Oh how I wish I was a sexy English major at an Ivy League school…wait a sec, screw that. How about just a sexy chick with brains. Those Ivy League jack-offs don’t know shit. My point is that whoever gets to be the intern for ‘Zilla might have to work her ass off, but she’ll be one of the best in the end (no pun intended). Hell, I might shake my ass for a spot…

* Okay, here’s another video but it’s ROFLCOPTER funny. Found it via Paul.

Breaking News: All Online Data Lost After Internet Crash

* Just got SEO with .net in the mail today. Looking forward to getting some feedback from my .net guy.

* Google Is Good At Detecting Paid Links! (yea right) They Just Suck At Reading Title Tags And Recognizing Link Networks! I mean really come on…you think they can decipher intent in a blog post when they show crap like that? Okay, last Google comment, I’m done, really. Oh yea, that came from comments on this post and Google is FUDD (Full Of Dog Dumps). Last one. Promise.

* 3 interviews of Avinash Kaushik from Occam’s Razor. Good analytics wisdom as always.

* Where’s My Thinkbait? Kidnapped by Google, or in a coma. Vote now.

* Intraweb Gold: Some people have less of a life than SEO’s.

Have fun laboring over your beers on labor day. The End.

{ 0 comments }

Friday Tea Time - 8/24/07 (Belated Sunday Edition)

by TheMadHat on August 26, 2007

So Friday Tea Time is on Sunday this week. No complaining. You’ll drink it and you’ll like it. End of story.

* I’m sharing some new blogs I discovered this week that are worth a read. Stuff by Sarah includes all kinds of subjects including PHP, blogging, affiliate stuff and more (also lots of flower pictures if you’re into that). Second we have some great writing by the younger crowd (high-school??) over at Blogtrepreneur. Great design, well written with good advice. He’s also started a Blog consulting and review service called BlogLOUDER. Great start Adnan. Third is Marketfederation with blogging tips and Web 2.0 related stuff with sharp insights and opinions and he stays away from regurgitating news. Welcome to my reader and thanks for the link love! Next we have MikaelRieck.com with a new blog on making money online. Good start Mikael, and I got this idea from your first post so thanks! Last but not least, a blog about nothing specific.

* There are a lot of “SEO and blogging” posts out there, but this week Stephan Spencer has a good posts on common mistakes bloggers make that covers all the basics and then jumps to more advanced stuff like using the nofollow tag on your archive links. At least that tag may be good for something ;p

* Staying on the blogging topic, Darren posts 97 tips from his readers that every blogger should know. Become an A-Lister from the resources on this one page (or just have an excellent blog, some of the A-Listers are overrated anyway).

* Another Google paid links post, one of thousands this week including one on Google losing relevancy from me, but SEO book has a good conspiracy theory on why Google hates paid links that hits the nail on the head. Mostly. Good post, give it a read.

* 12 must be the new 10. Amber at PPC Hero has 12 things to do to increase landing page conversions. It’s a good set of guidelines for optimizing landing pages (PPC or otherwise). Combine this with the post from the Grok that claims Google checkout converts better than traditional checkouts. I’m not too sure about this one. Just because Google says it doesn’t make it so.

* Intraweb Gold: No More Showing Your Thong! High school girls everywhere prepare for jail.

Later folks. Next week I’ll try to be on time!

{ 2 comments }

Google Is Losing Its Grip On Relevance

by TheMadHat on August 23, 2007

Google is beginning to lose its firm lead in search relevance category and they don’t know what to do about it. They’re in a state of panic. The first sign was the release of the Nofollow tag. Originally created to allow publishers to battle spam on sites with user created content, it was quickly twisted into a tool to improve their algorithm. Google may have some methods of detecting paid links, but they know it isn’t very hard for marketers to figure out how to fly under the radar. They need the webmaster community to help them fix the issue by using nofollow on paid links. We all know about this tactic and it’s been discussed (2005) and debated (yesterday) for many moons. Part two of this specific scramble is the constant FUD about paid links that’s being pushed around by their engineers. They claim to be able to detect paid links, yet they are on a constant campaign to get people to use nofollow and trying to scare web marketers into not buying links. That tells me they aren’t very good at detecting paid links, and they think it’s hurting their relevance.

Part two of the relevance decline was the announcement of the unavailable_after tag. Again a blatant acknowledgment of their algorithm flaws. They can’t determine when a document is no longer relevant and out of date, so they’re asking the webmaster community to help them fix it. Let’s expand on this idiotic tag for a moment…

On the Google blog and elsewhere, people were all so excited about this new tag, claiming it would do wonderful things by allowing them more control over what pages are indexed. What a mind-fuck Google brainwash. One example given was running a promotion of some kind that would expire on a certain date. So, here I am selling widgets and I have this great promotion on “Super Human Green Widgets”. It’s an unheard of promotion and generates a ton of buzz in the widget community. I get loads of links (editorial links naturally) pointing to my promo page. After the promo I’m just going to have Google remove that page and lose all that link juice? You’d have to be insane to do something like that. 301 redirect that page to your normal super human green widget page and keep all the value you gained from running the promotion.

Part two of this nonsense tag is your competition. You really want to take your pages out of the index leaving the space open for your competitors who aren’t using this tag? If those outdated pages are ranking well, update the page, send them manually to a more relevant page, or 301 the page so you keep the value of those links. Using the unavailable_after tag is just plain stupid and a sneaky ploy by Google to get you to do their job.

Let’s summarize. This comes straight from the Google Webmaster Guidlines: “Make pages for users, not for search engines.” What it needs to say is “Make pages for users, not for search engines (unless we need help fixing our crappy results! If you need help, buy AdWords!).” Everyone stay the course and go buy some links.

{ 15 comments }

Personalized Anchor Text Score

by TheMadHat on August 21, 2007

Bill pointed out a patent recently awarded to Google I thought I would try and decipher. Based on the summary they could assign a separate personalized anchor text score based on a “set of user-specific parameters” and combine this with the regular score to reorder your results. Here is the summary from the patent:

A search engine identifies a list of documents from a set of documents in a database in response to a set of query terms. For each document in the list, the search engine determines an information retrieval score based on its content and the query terms, and also identifies a set of source documents that have links to the document and that also have anchor text satisfying a predefined requirement with respect to the query terms. The search engine calculates a personalized page importance score for each of the identified source documents according to a set of user-specific parameters and accumulates the personalized page importance scores to produce a personalized anchor text score for the document. The personalized anchor text score is then combined with the document’s information retrieval score to generate a personalized ranking for the document. The documents are ordered according to their respective personalized rankings.

Key takeaways from the summary and the claims:

* “Each of the identified source documents have anchor text that satisfies a search query corresponding to the set of query terms.”

* “Each of the identified source documents have anchor text that contains at least one of the query terms in the set of query terms.”

* “The personalized page importance score of the identified source document is independent from the set of query terms.”

Those seem to be fairly basic IR methods but looking a bit further into the patent I’ve come to a couple conclusions/theories/guesses. First, in certain cases this could have a significant impact on the result set. There are a lot of mentions of the source document meeting “predefined requirements” further down in the patent.

* The anchor text of each of the identified source documents is examined to determine if the anchor text satisfies a predefined requirement with respect to the query terms. (They note that the predefined requirement is that it must have terms that correspond the the search query, but there are more than just that. It would be fairly simple to spam in that case with massive page generation based on as many keyword combinations you could think up. More on the possible requirements later)

* After the above condition is met, then a personalized page score is calculated according to a set of user-specific parameters that could include a list of user favored websites or keywords suitable for identifying user favored websites. (Once again they are favoring websites you frequently visit, making it harder to find new sources of information that could be more relevant to your query, but I’ve already covered that)

* User specific parameters may be provided by the user, collected from a third-party having such information, or derived from a users search history and web history (Spyware in a nicely worded sentence - TURN YOUR WEB AND SEARCH HISTORY OFF!!!) .

The reason I think this could have significant impact on the result set is I don’t think the predefined requirement will be met very often and when it is you may see a significant ranking jump with certain sites in your personalized results. I would believe most result sets would carry the normal IR score and a very low to zero anchor text score. When the document meets all the criteria the anchor text score combined with the IR score will boost the page.

Assume that a user named Adam is looking for a website covering Stanford’s sports teams. For the purposes of this explanation, we will assume that Adam would prefer that the search engine return www.gostanford.com 110-2 ahead of www.espn.com 120-2. To achieve this goal, one approach would be to allow a user like Adam to instruct the search engine to personalize the rankings of search results by providing appropriate user information such as the user’s background information or a plurality of favorite websites. For example, Adam may register with the search engine that he prefers web pages whose URL includes the term “Stanford” over other web pages.

This looks to me like another indication of the increasing weight given to keyword domains. I also think it’s crap they just assume Adam would prefer gostanford.com over espn.com just because he’s been to the first one more often or he’s specified that in his profile. Maybe he hasn’t discovered something that is much more relevant and something he would prefer more. He might not ever find it in this system.

Conceptually, when computing personalized page importance scores, the Page Importance Ranker boosts the page importance scores of documents that are deemed to match the user-specific parameters, which in turn boosts the downstream documents linked to those documents. From another viewpoint, the Page Importance Ranker boosts the page importance scores of documents of each host whose URL matches one or more of the user-specific parameters.

Here is the boost I mentioned above, however the amount of “boost” is complete speculation.

In some embodiments, a document is be deemed to match (or not match) user-specific parameters solely based on the URL of the document.

Another hint of keyword domains being more powerful (time to increase my domain buying).

In some embodiments, the source pages listed for a respective document are limited to those that satisfy a predefined requirement with respect to the search query. For instance, in one embodiment the predefined requirement is that the anchor text of the link to the respective document contain at least one query term of the search query. In another embodiment, the predefined requirement is that the anchor text of the link to the respective document satisfy the entire search query, which may be a Boolean expression containing multiple query terms. In yet other embodiments, all source documents are included, without respect to whether the anchor text of the links to the respective document contain any of the query terms.

As I mentioned way up at the beginning of this post, here are some variations of their “predefined requirements”.

That’s about as detailed as I’m going to get. It’s an interesting look into personalized search and ways they might reorder your results. I still think all the concepts they have come up with so far are bad for the web and users alike, but personalized search is still in its infancy so we’ll see where it goes. If you want to take a look at the patent you can find it here. I’d also like to thank Bill for being at SES and having no time so I could be first to the market with this analysis, but I’m sure his will outshine mine, he’s the expert on these things.

{ 5 comments }

No More Nofollow - U Comment and I Follow!

by TheMadHat on August 20, 2007

I Follow Comments! Screw you Google. I believe if you have something to say you should be recognized for your contribution. Nofollow is a great idea in theory but I think it’s the wrong way to battle spam. It doesn’t work for one thing, and for another it goes against the whole idea of the web. So if you’ve got something worthwhile to contribute to any of my posts you will receive some link love for it.

My Akismet does a decent job of blocking most of the spam, and I moderate comments on a daily basis; it isn’t very hard when you have the massive volume of comments that I have :b. I’ve installed DoFollow which is a nifty little plugin that removes the no-follow attributes from Wordpress blogs. You can specify how long comment links stay nofollowed and/or if only registered users are allowed this feature. I have mine set at 24 hours and then nofollow will be removed.

I know this is painting a giant red bullseye on my site for spammers, but right now I’d rather deal with a little spam and get some conversations going. One caveat; “Nice post”, “I agree” comments will either be quickly deleted or manually nofollowed. So for everyone that’s commented thus far, and for those of you that will contribute in the future, I’ll give you a little something back for doing so.

For those of you who feel like sticking it to the man, here is a list of plugins that remove nofollow and here is the Dofollow D-List that lists other blogs joining the Dofollow movement (this list appears to have originated at GeekySpeaky). I’d also like to give props to Randa Clay Design for the logo.

Bring it on trackback spammers…I can handle it!

{ 23 comments }

Friday Tea Time - 8/17/07

by TheMadHat on August 17, 2007

I think I read every post this week from every SEO blogger I could find. My eyeballs hurt…there are a lot of good blogs out there putting out some good ideas and theories. If you’re one of those bloggers then congratulations! If you’re one of those bloggers that regurgitate SEO news every day of the week on every post then please go hang yourself, I’m tired of wasting energy clicking on your buttons. If you’re one of those bloggers with partial feeds, I probably didn’t read your post because I’m lazy like that. On to the best and brightest posts I found this week:

* Dan Thies on proxy hacking taking people out of the SERP’s. It’s happened to me folks and it’s easy to do so you certainly need to check it out. SEO Egghead has the solution on how to defend against proxy hacking which is heavy into PHP but doesn’t look too hard to implement. It’s sad the SEO community had to come up with a solution for this when Google has known about it for quite some time.

* Aaron Wall tells us Matt, Mihai, and other search engine bloggers are all secret propagandists trying to mislead the public into mass confusion. I agree.

* Via ProBlogger: Blog Action Day. Participate, it’s for a good cause and is an interesting experiment into social media. Darren also gives us some tips on how to spy on competing blogs that makes for a good blueprint.

* Again from Darren we finally get an explanation on why our freaking feedburner numbers never stay consistent.

* GEEK ALERT! w00t! I exploded my test server a couple times at 1am last night but finally got it working. It’s the fastest I’ve ever installed Linux with everything where it’s supposed to be. Fast Uniform Linux Install From Batch File. Thank you kindly UKgimp!

* Guess what folks, there are PERVERTS ON THE INTERWEB!!!! CNET of all places published this garbage from some 17 year old who thought getting a friend request from an old guy was “ick, creepy”. Jordan from Marketing Pilgrim agrees that this was a waste of space. Hey Sabena Suri….Shut your damn face! Everyone that reads this please go and add her to your friends list. Or poke her, whatever that is. Here is her profile link: http://www.facebook.com/s.php?q=Sabena+Suri&init=q. Maybe that will teach her to not talk shit about people old enough to remember MS-DOS.

* Here is another celebrity guide post from Copyblogger, this time it’s marketing with David Lee Roth. I love these famous/historical people guides to marketing he publishes. Brian, please make a resource page with all these types of posts. And maybe do something like “The Paris Hilton Guide on Becoming a Legendary Whore”. That should get some diggs.

* I found a new blog yesterday…LinkJuicy from Peter Da Vanzo. He’s only had 6 posts thus far, but every one is excellent. His latest is an advanced linking interview with Ralph Tegtmeier of Fantomaster.com, a well known authority on the topic. Check it out: Part I and Part II. Enjoy and welcome to my feed reader LinkJuicy.

* Are you a time waster? Take this test and find out. Seems to me just taking the test is a good indication that you’re a time waster.

* Found via somewhere I forgot, here is a good financial analysis on the online and offline ad markets from Henry Blodget at Internet Outsider.

* Retailers’ SEO Efforts Found Lacking. No kidding? Really? I think we get this “astonishing” piece of news every year.

* Intraweb Gold: Fast Food Deliciousness

Everyone have fun next week at SES San Jose. Be sure to eat at your hotel because there isn’t anywhere else in that damn vacant downtown to get edible food.

{ 1 comment }