DISQUS

DISQUS Hello! benjamingolub.com is using DISQUS, a powerful comment system, to manage its comments. Learn more.

Community Page

Jump to original thread »
Author

URL Canonicalization: Stop the Dupes!

Started by Benjamin Golub · 1 year ago

Aggregation is the name of the game these days and a big problem for sites like RSSmeme and ReadBurner is dealing with duplicates.  How do you know for sure that you have all the shares for a given URL?  What about services like FeedBurner or TinyURL which use redirects to get you where you want […] ... Continue reading »

8 comments

  • Thanks for the tip...I just made the switch to gReader Comments.
  • No problem; glad I could help!
  • I beg to differ that FriendFeed doesn't need to bother with it. Well, OK, they don't need to, but it would make the service that much more convenient for users, if identical items were combined and the discussion merged... Just beating the dead horse. :-)
  • I have to disagree; FriendFeed is about discussion within (and extending slightly out of) your group of friends. I'm glad they don't merge everything together but it would be nice to see what other people have to say about a story.
  • I remember when looking at google shared feeds, is that 95% of the items have the actual *final* url of the story, under original-id tag. Do you not use that at all in aggregation efforts?
  • FeedBurner sometimes gives this out; most of the time it doesn't though. It looked to me like maybe a setting in FeedBurner can enable this? I use it if it's available; if not I have no other choice but do follow the redirect to find it.
  • Well, I've been bitten by this too. However, since my dataset was small and performance wasn't an issue, I tried HEADing every url, with the result that quite a few sites don't respond very well to HEADs :(
  • There is also a way to sort the problem by adding a snippet of code into you .htaccess fil, but I didn't know about these features...

    Thanks for the links and keep up the good articles, this is the 3rd article ive read now and I've learnt something new from all of them =D

    Jordan

Add New Comment

Returning? Login