Abstract & Bib

W. Yih, J. Goodman, V. Carvalho

Finding Advertising Keywords on Web Pages

WWW-06

A large and growing number of web pages display contextual advertising based on keywords automatically extracted from the text of the page, and this is a substantial source of revenue supporting the web today. Despite the importance of this area, little formal, published research exists. We describe a system that learns how to extract keywords from web pages for advertisement targeting. The system uses a number of features, such as term frequency of each potential keyword, inverse document frequency, presence in meta-data, and how often the term occurs in search query logs. The system is trained with a set of example pages that have been hand-labeled with "relevant" keywords. Based on this training, it can then extract new keywords from previously unseen pages. Accuracy is substantially better than several baseline systems.
 
@InProceedings{YihGoCa06,
 author = {W. Yih and J. Goodman and V. Carvalho},
 title = {Finding Advertising Keywords on Web Pages},
 booktitle = {Proceedings of the 15th World Wide Web Conference},
 year = {2006}
}

[Home] [Education] [Experience] [Publications] [Presentations] [Demos] [Services] [Links]