searchlove boston 2013_will critchlow_technical seo

Modern Technical SEO

WILL CRITCHLOW

Best practice says you should remove all

parameters from your URLs

That kind of “recommendation” doesn’t

get things done

Jeff Bezos insists on recommendations written in prose

Read Steve Yegge’s accidental reply-all

https://plus.google.com/110981030061712822816/posts/AaygmbzVeRq



Presenting to Jeff is a gauntlet that tends to

send people back to the cave to lick their wounds

-- Steve Yegge



Imagine you’ve done all that – now be prepared to be

asked “WHY?”

Site speed

You know why you care about speed

You need to care about the details

You’ve probably all used tools like Google PageSpeed Insights

https://developers.google.com/speed/pagespeed/insights



A high score indicates little room for improvement,

while a lower score indicates more room for

improvement. The PageSpeed Score does not

measure the time it takes for a page to load.

-- Google

Wait, what?

Why?Checking boxes doesn’t

delight users

Diagnose when sites are actually slow

Difference between “Has a CDN” vs. “CDN speeds site up”

Gather more site speed data in GAImage/Graph etc.

Add this line

Understand waterfall reports

We’re working on our own site speed. This is from DistilledU

http://www.webpagetest.org/result/130509_0K_PFB/1/details/

http://www.webpagetest.org/result/130509_0K_PFB/1/details/

But what does it mean??Image/Graph etc.

All the credit goes to Waterfalls 101 from Web Performance Today

http://www.webperformancetoday.com/2010/07/09/waterfalls-101/

Loads of rows?

Combine assets (CSS, JS, images)



Seeing lots of ORANGE bars?

Try “keep-alive” to avoid dropping TCP

connections

Image/Graph etc.



Big GREEN bars?Shorten them with server-side

improvements and CDNs

Use analytics to tell you which – segment geographically

Big BLUE bars?

Shorten them by optimizing assets

Shrink images, minify CSS / JS

Before and after

Note: blue bars look bigger because of combined assets but total blue is less

SEGWAYSEGUE

Robots.txt

Why?It’s amazing how often this

gets screwed up

Spot-quiz from DistilledU

With this robots.txt, what areas of the site can googlebot crawl?

http://www.distilled.net/u/robots-txt/

Spot-quiz from DistilledU

Answer: everything but the /secret/ directory – Robots.txt rules do not inherit

With this robots.txt, what areas of the site can googlebot crawl?

http://www.distilled.net/u/robots-txt/

Set alerts for changes to Robots.txt

I use Server Density [disclaimer: we’re investors] – see how here

http://www.serverdensity.com/

http://techcrunch.com/2011/05/24/server-density-scores-angel-funding-and-rolls-out-app-store-for-sysadmins/

http://www.seomoz.org/blog/seo-monitoring

SEGWAYSEGUE

Mobile and Internationalhave similar technical

challenges

m.t.

www.

.co.uk.de.es.fr

One site or more?

Sets of international sites group

with hreflang

Image/Graph etc.

.co.uk.de.es.fr

HREFLANG

Sets of mobile sites group

with alternate

Image/Graph etc.

m.t.

www.

ALTERNATE

rel=alternate

Why?Declare a canonical page

and a mobile version

Do a search like this on the mobile

When you click through, you end up on the mobile version

It’s hard to tell, but there’s no redirect

This link actually goes to the m. version specified with a rel=alternate

But the title, URL, description all come from the desktop version

Image/Graph etc.

Who links while mobile?

Desktop pages accrue all the authority

Check all of this with Chrome mobile emulation

Settings Developer Tools cog (bottom right) Overrides

“Vary” header

Why?Change your page based on user-agent without worrying

about cloakingRESS – REsponsive with Server Side

HTTP header

curl -I www.example.com

Modern meta information

Why?Control how your pages

look when shared

Do better than this kind of share

Actually, I checked and WSJ does have og: tags – specifying that image so maybe it’s deliberate branding

http://online.wsj.com/article_email/SB10001424127887323372504578469401316209428-lMyQjAxMTAzMDAwNzEwNDcyWj.html



Twitter cardsAllow control of the

tweet versus the basic:

Implement Twitter Cards

Get a competitive advantage and sort this out now (stats from BuiltWith)

http://trends.builtwith.com/docinfo/Twitter-Cards

SEGWAYSEGUE

JavaScript

Why?The days of “googlebot

can’t execute JS so I don’t need to understand it” are

gone

Anything beyond very basic customization of analytics code results in you writing custom JavaScript

This is a screenshot of the DistilledU module on customizing GA

http://www.distilled.net/u/interactive-ga/

http://www.distilled.net/u/interactive-ga/

Sidenote: not all Google Analytics has to be JavaScript-based

We were toying with pushing Googlebot visits to GA via a server-side call

You can also use a GET request by constructing URLs like this:

http://www.google-analytics.com/collect?v=1&tid=UA-1618063-1&cid=122303&t=pageview&dp=%2FTest-Page&dt=Hi%20I'm%20the%20Googlebot&dh=distilled.net&cd1*=192.168.1.1&cs=googlebot

http://www.google-analytics.com/collect?v=1&tid=UA-1618063-1&cid=122303&t=pageview&dp=/Test-Page&dt=Hi%20I'm%20the%20Googlebot&dh=distilled.net&cd1*=192.168.1.1&cs=googlebot




Get to grips with jQuery

So much easier than just JavaScript

For example, Optimizely tests are built from jQuery

This is a live test on the Distilled website

Luckily, learning is easier than ever

http://www.codecademy.com/

DebuggingImage/Graph etc.

Chrome ships with some powerful debugging tools

CTRL+SHIFT+J

https://developers.google.com/chrome-developer-tools/docs/javascript-debugging

But alert() and console.log() are your friends

Right-click Inspect Element

Highlight active DIVs and test changes immediately

You can even edit pages here to mock things up

Obviously there are more useful things to do with this super power

AJAX and PushState

These buttons switch stories

URL changed without the page reloading

Content changed via AJAX

It can be hard for JS and server to work together

Node.js on the server?

http://nerds.airbnb.com/weve-launched-our-first-nodejs-app-to-product

Why?You’re going to recommend

it or encounter it in a site audit

And, incidentally, you should – it’s great to

separate content from presentation

How do you audit AJAX?

URLs load content

Spot-check with a browser + disabled js. Test with a crawler

http://www.screamingfrog.co.uk/seo-spider/

http://www.screamingfrog.co.uk/seo-spider/

Links are HTML links

CTRL+U is view source in Chrome on Windows – learn your shortcuts

…and the href is the same as the PushState

CTRL+U is view source in Chrome on Windows – learn your shortcuts

For an idea of the complexity…

Here’s an example of a beautiful site

Fast CoEXIST

http://www.fastcoexist.com/1682002/this-24-year-old-entrepreneur-raised-300000-by-wearing-dad-s-wool-shirt-for-100-days


That redirects to

a mobile sitehttp://www.fastcoexist.com/

1682002/this-24-year-old-entrepreneur-raised-300000-by-wearing-dad-s-wool-shirt-for-100-days

http://m.fastcoexist.com/?m=fastcoexist/node/1682002&url=http://www.fastcoexist.com/1682002/this-24-year-old-entrepreneur-raised-300000-by-wearing-dad-s-wool-shirt-for-100-days#1

&






http://m.fastcoexist.com/?m=fastcoexist/node/1682002&url=http://www.fastcoexist.com/1682002/this-24-year-old-entrepreneur-raised-300000-by-wearing-dad-s-wool-shirt-for-100-days








That loads content via

AJAX

Empty HTML

Infinite Scroll

This is what a mashable page looks like when you

load it

http://mashable.com/2013/05/10/kickstarter-zach-braff-critics/

When you keep scrolling more loads under your mouse

Why?You’re going to encounter it

in a site audit

How do you audit infinite scroll?

Can you get to all these links without scroll-loading?

Image/Graph etc.

Make sure there is a (traditionally) crawlable navigation (Tips here)

http://www.adamsherk.com/seo/seo-tips-for-infinite-scrolling/

Can you see all the important content without the scroll-loading?

Image/Graph etc.

Or at least check that it’s getting indexed

The state ofJavaScript indexation

We know that FB comments can be indexed

See, for example this page whose JS comments are indexed

http://techcrunch.com/2011/10/30/facebooks-zuckerberg-if-i-were-starting-a-company-now-i-would-have-stayed-in-boston/

Is it worth it though?

TechCrunch admits that using Facebook comments drove away most of their

commenters-- techdirt [original TC article]

We have seen a few mis-steps from FB on the comment front

http://www.techdirt.com/articles/20130123/03271121761/techcrunch-admits-that-using-facebook-comments-drove-away-most-their-commenters.shtml

http://techcrunch.com/2013/01/22/we-want-you-back/

http://techland.time.com/2013/05/12/facebook-comments/



When Matt said this in 2011

Original tweet

https://twitter.com/mattcutts/status/131425949597179904

This gives Google …the ability to read comments in AJAX or

JavaScript, such as Facebook comments or Disqus comments

-- SearchEngineLand [emphasis mine]

Much of the coverage was similar to this

http://searchengineland.com/google-can-now-execute-ajax-javascript-for-indexing-99518

Disqus is growing fast by the way

Sites using Disqus vs. FB comments – data from BuiltWith

Image/Graph etc.

http://trends.builtwith.com/widgets/Disqus

Disqus comments can be indexed

Disqus via API in source code indexed

But the JS version can’t

Has anyone seen it work anywhere?

Disqus via JavaScript not indexed

We have seen aggressive crawling of things that look like URLs

<a href="#" onclick="redirect(this);return false;" redir-to="$$$start$$$www/ratedpeople/com???find???trade???greater-london???little-ilford" rel=nofollow f="Brenda" l="Manor">www.ratedpeople.com</a>

function redirect(elem) {url = $(elem).attr('redir-to').replace(/\//g,'.').replace(/\?\?\?/g,'/').replace('$$$start$$$','http://')document.location.href = url;

}

Worth reading the original announcement from Google – especially GET vs POST

http://www.ratedpeople.com/

http://googlewebmastercentral.blogspot.co.uk/2011/11/get-post-and-safely-surfacing-more-of.html

It doesn’t look like arbitrary JavaScript execution

I would always specify that all content and all links can be found without JavaScript

When you’re specifying something, you can be as prescriptive as you like

When you are auditing, you should be more cautious of the cost of changes

If an audit shows content being pulled in via JS

but it’s getting indexedI’d leave well enough alone

If links are only accessible via JS, I would suggest

fixing that even if pages are being discovered

Go write technical recommendations like you’re presenting to this guy

Thanks.Any questions?

WILL CRITCHLOW

[email protected]

@willcritchlow

Image credits:http://kilocopter.deviantart.com/art/Birthday-Unicorns-166578859http://www.flickr.com/photos/dachis/5536760790/http://www.flickr.com/photos/matthigh/3687338082/http://www.flickr.com/photos/dudaphoto/5582847355/http://www.flickr.com/photos/jurvetson/5129303018/

http://kilocopter.deviantart.com/art/Birthday-Unicorns-166578859

http://www.flickr.com/photos/dachis/5536760790/

http://www.flickr.com/photos/matthigh/3687338082/

http://www.flickr.com/photos/matthigh/3687338082/

http://www.flickr.com/photos/dudaphoto/5582847355/

http://www.flickr.com/photos/dudaphoto/5582847355/

http://www.flickr.com/photos/jurvetson/5129303018/

http://www.flickr.com/photos/jurvetson/5129303018/

searchlove boston 2013_will critchlow_technical seo

Technology