Getting Bulk Data Through Google: An empirical study

No Thumbnail Available

Date

2016-10

Journal Title

Journal ISSN

Volume Title

Publisher

Chitkara University Publications

Abstract

To store the information in a database is one of the major tasks. The efficient storage of data is important for future use. Information retrieval is a method of gathering information related to input queries from the various sources or stored databases. To retrieve the information, a search engine plays an important role. A web search engine creates an index to match queries. The quality of information is improved with the help of search engine. For retrieving the information, a search engine comprises some modules such as query processor, a searching and matching function, document processor and page rank capability. This paper focuses on the retrieval of web documents against input queries and stores them in to database. A Google search API can be used to fetch the results. It analyses the data by processing through these modules and downloads the content available in different formats.

Description

Keywords

Web crawling, indexing, page ranking, retrieve pdf documents, query processing, search engine implementation, web search

Citation