ASP.NET: REGEX Parse the RSS / ATOM Feed Url from a Page
Posted by: Peter Brombergs UnBlog,
on 18 Aug 2007 |
View original | Bookmarked: 0 time(s)
I've been scraping again, I confess. Just can't resist it. One of the things I've run into when grabbing a bunch of web pages in a threadpool callback is how to determine if the page sports the autodiscovery tags (e.g. there is a feed for the site). Here is one way to do this with a little bit of REGEX: using System.Text.RegularExpressions; namespace WebLogsSearcher{ public static class Matcher { public static...