Well it is a common question that what is the difference between search crawling and indexing. Although most of us must have studied this in their graduation but we are used of studying just to pass exams and these were boring topics.
So I will just give a brief of differences between these two:
Crawling
Crawling means to pull contents from your seach source. It is a dump process where by crawling engine pulls data and documents from all specified sources and caches results to deliver to another process normally indexing engine. In itself it does not do anything. It is just like stress-engine where you are generating network traffic thus consuming memory and resources of the machine. So as a matter of fact, in WAN scenarios or high performance scenaios, it is always advisable to have a dedicated crawling server different from your normal web and indexing server.
Indexing
Indexing is the key of searching. It means making sense out of the retrieved contents, storing the processing results in a (more or less complex) document index. Link analysis is a way to measure URI importance, popularity, trustworthiness and so on.
HTML elements are available which can direct crawling or indexing engine either not to crawl or index contents. These are commonly utilized by web services. These are defined in meta element and are content="noindex" or content="disallow"
Hope it will help in clearing things for everyone.