You might have heard the basics of how google search engine works unless you are living under a rock. When you search for anything on Google. Google uses these user generated search queries known as keywords to find it’s matching content across the web.
In a blink of an eye, tons of results come up which are filtered by 200+ ranking factors. SERPs (Search Engine Result Page) processes the results in accordance to most relevant search results to least relevant.
Getting things in a simplistic way:
The first thing is know whether a website is Public or Hidden or Private.
What does Public, Private or Hidden mean?
If a website is Public: This means the site can be indexed by search engines and is visible to all.
If a website is Hidden: This means the site cannot be indexed by search engines but is visible to all.
If a website is Private: This means the site cannot be indexed by search engines and is visible to you and your approved users.
If the website is publicly accessible that means that it can indexed by search engines round the globe. There are 10+ search engines beside Google that are actually performing great. But Google has the largest traffic source from different countries.
If the website is publicly accessible that means that the website pages can be crawled and indexed.
How search engines crawl a web-page?
Their are several methods a Google bot User Agent crawls your website. Basically Crawling is the process through which (Google bots or spiders) searches each web page ( text, media, document, etc) of the website for links/content.
We can also specify if we want search engines whether to crawl a page of our website or not. This can be easily done by specifying it in the robot.txt file.
Furthermore we can also define it in sitemaps.xml for website indexing purposes. A sitemap tells search engines the hierarchy of our website. search engines are intelligent answer machines that can easily grab the worthy information about these page and analyse it.

What is Google search engine index and Indexing?
Before we get ahead with indexing we got to understand what is a search engine index. A search engine index can be better understood with the help of an inverted index. An inverted index is a data structure in which the text elements stored with a database are pointed using pointers (mapped to the contents).
When we talk about a website the data within it is tokenised. It is a process used by search engines to reduce data to it’s meaning full, concise way such that it can process it much faster.
Indexing can be done through various methods:
There are different webmasters where you can easily index the web pages or the complete website. There are guidelines associated to it. Will be talking about these steps in our next blog.
Whenever any user searches any specific query on th internet the google bots or the spider checks there index database. Currently Google is using caffeine – a web indexing system by Google. It is one of the largest indexing systems in the world.
Once, it retrieves all the web pages, ranking algorithms works in order to rank these pages in the SERP – where you can find the most relevant data. In order to process this Page Rank(PR) – an algorithm Google Search uses to rank the most relevant data from most relevant to least relevant.
What is Page Rank(PR)?
Google search is a giant answering machine and beside it works a great no of algorithm to set it’s ranking. There are 200+ ranking factors which determine which site comes up for the particular search query.
Will be defining these ranking factors in the upcoming blog. Stay Tuned!