Text this: Graph-theoretic techniques for web content mining