bloom america
Chapter 284: Google it
Chapter 284 Google it
A search engine, is there anything more bad than this?
Most of the websites that Catherine will frequently visit in the future will be known through "Download". Catherine feels that it is necessary for her to create a search engine...
If you want to trace it back, the "history" of search engines is much less than that of other websites, so the significance of the existence of search engines has been diluted.
But that will certainly not be the case in the future.
Not to mention that I want to separate these annoying people on the Internet. On the other hand, because the cooperation with the "Los Angeles Times" is about to begin, comprehensive websites will always appear, so search engines will
will become necessary.
"Maybe this is a good note..." Catherine held her chin and thought.
Elsa looked at the time and saw that it was almost afternoon..., so she went to make black tea.
"What's the idea?"
Elsa asked while holding tea cakes.
"A search engine is a good thing that can let us know about various websites."
"Can we search the web pages we want?"
"Right, that is it."
"Can this be done?"
"certainly……"
Although the answer was affirmative, Catherine's final tone became a little strange.
Search engines rely on web spiders, that is, "com" (com means "web").
To be more precise, web spiders search for web pages through the link addresses of the web pages. They read the content of the web page starting from the home page of the website, find other link addresses in the web page, and then use these link addresses to find the next web page, and this cycle continues.
, until all the web pages of this website have been crawled. If the entire Internet is regarded as a website, then the web spider can use this principle to crawl all the web pages on the Internet. In short, the web spider is a
Crawler, a program that crawls web pages.
Future websites such as "Baidu" and "Google" will be built on such a foundation.
But Catherine suddenly thought that she didn't seem to understand the "web spider" at all. Although she knew the principle, it seemed to be a bit troublesome to figure it out...
"It seems that we need to set up a working group."
I started standing up with my hands folded.
"I think our company's talent pool is already tight enough." Elsa placed black tea and tea cakes on Catherine's table.
"It doesn't matter. By May, this situation will be alleviated." The first batch of students trained through Intel's cooperation with Stanford University are about to graduate. With their participation, the company's talent shortage problem will definitely be alleviated to a certain extent.
of relief.
"At least in the next ten years, our company's talent is likely to be in short supply. This is a rapidly expanding industry, which is different from those traditional industries."
——Until the 21st century, the thirst for talents in these industries is still strong.
Of course, except China - because there are so many people there that they even have the term "it migrant workers".
Catherine sat down and took a sip of black tea.
While she was drinking tea, she was thinking about how to write a web spider program.
There are three ways to evaluate the quality of a web spider. One is coverage. The primary goal of a web spider is to crawl the information needed on the Internet. Therefore, whether all valuable information is included and the proportion of inclusions are the basic criteria for a web spider.
Evaluation indicators; the second one is timeliness, that is, after an event occurs and spreads on the Internet (in various forms such as news, forums, blogs, etc.), users need to retrieve the corresponding content as soon as possible through search engines. The premise of indexing is
Inclusion, therefore, web spiders are required to crawl the latest resources on the Internet as quickly as possible; finally, there is the duplication rate. There is a lot of duplicate content on the Internet. How to detect page duplications as early as possible and eliminate them is a problem that web spiders need to solve. In addition to reprinting
In addition to the duplication caused, duplication can always be manifested in various patterns, such as site-level duplication, directory-level duplication, cgi-level duplication, parameter-level duplication, etc. Early discovery of these patterns and processing them can save system storage and crawling.
, build database and display resources.
The first problem is best solved because the root server of US Telecom is floating around and the coverage of the astronomy page is always 100.
What needs to be resolved are the second and third issues.
In fact, this is not a big problem. The principle is easy to solve. The most important part is the need for an efficient program.
It may take a lot of time to do it by yourself. Catherine has always dreamed of being a hands-off shopkeeper, how can this happen? - So, it is necessary to find someone.
Finally, Catherine decided to transfer three people from Microsoft's phoenix-stargate development team to help develop this web spider program.
Anyway, the development of the star menu system is almost complete. At this time, it is not a problem to recruit some people to develop web spiders.
Web spiders are the core part of search engines. With this program, building a search engine will be very simple.
So... what should my website be called?
Baidu?
She thought of the name first.
Catherine shook her head. Instead of calling it Baidu, it is more interesting than Google. After all, the latter is a global search engine, while the former can only be limited to mainland China... And the more important reason is that Catherine
I am very dissatisfied with certain functions of Baidu, and they cannot circumvent the firewall.
In this regard, Google search is much more convenient - provided that you use foreign versions, good children's movies, etc., it is easy to find. Although this is an emotion from a previous life, Catherine feels that Google is slightly better.
Finally, Catherine decided to name her website Google, which is Google.
However, Catherine does not intend to be involved in various industries like the Google company in history, including mobile phones and offices.
It already has a dedicated mobile phone department, and Microsoft also owns it. What Google has to do is to play its role as a search engine.
In this regard, Catherine thinks it is a good choice to refer to Baidu.
Tieba, Zhiba, Encyclopedia, these are all necessary.
"Google Tieba...Google knows...Google Encyclopedia..." Catherine wrote down the keywords one by one in her notebook.
"Hmm... Wikipedia seems to be good too... Forget it, let's go to Google."
Catherine decided not to occupy the name of "Wiki". It seemed interesting to see how Assange dug out all the ugliness of these politicians.
But if you occupy the name of "Wiki", if Assange does something in the future, the city will be on fire and the fish will be affected, you will be in trouble.
"Google? What is that?"
Elsa put the tea set away.
"The name of the station."
"oh."
With the website, everyone will be able to find websites that are similar to their own, and at that time, the fire on the Internet will probably not be so strong.
"google..."
Catherine wrote the letters of Google in her notebook.
"Is this the Google you're talking about?"
"Yes, it's not just a search engine, this should be a comprehensive website... Of course, most of the content of this website is search-oriented."
Tieba, Zhiba, Encyclopedia, these are all essential.
“My Google, in addition to its search function, should also have the function of solving problems for people. For example, if people have problems, they can go to our Google website and find solutions to their problems.
method."
"It sounds really good...is it for the sake of user dependence?"
Elsa seemed to see something.
"Yes, yes. User stickiness is very important." Catherine put on a "teachable" expression.
"We can ask users to ask questions on Google. If they encounter problems that are difficult to solve, they can find solutions here... Of course, our company does not provide solutions itself, but lets netizens do it themselves, so that
Forming an interaction. Our Google Encyclopedia is similar to an encyclopedia. If you want to find any knowledge, just go to our Google Encyclopedia."
"Then... what is Tieba?" Elsa noticed that Catherine didn't seem to mention the function of Tieba.
"Tieba should have similar functions to forums, but the nature is a little different. Google will become a very important product for us in the future."
There is a big difference between post bars and forums, but Catherine doesn't know how to explain it to Elsa.
"Google... the more I hear this name, the more pleasant it sounds to my ears. It's really good." Elsa thought for a moment while stroking her chin.
"it is necessary."
In addition to Google, face is also a good thing, but opening a face website... it is simply impossible. It is simply impossible for today's computers to perfectly convert faces into pictures, and the images will suffer huge losses. More importantly
Yes, there is no webcam at all now.
"Kate, you seem to take Google very seriously?"
"Of course, I've even thought about the advertising slogan."
"Advertising slogan?"
“Just Google it and you’ll find out.”
You'll Also Like
-
The original god's plan to defeat the gods is revealed, starting with the God of Fire saving th
Chapter 117 11 hours ago -
The end of the world: My refuge becomes a land of women
Chapter 430 11 hours ago -
Return to Immortality: One point investment, a billion times critical hit!
Chapter 120 11 hours ago -
Mom who traveled through thousands of times
Chapter 102 13 hours ago -
Elf, but I'm a breeder
Chapter 909 13 hours ago -
Douluo Continent: If Douluo had a reversal
Chapter 47 13 hours ago -
Douluo: I rely on pretending to be obedient to be loved by others
Chapter 71 13 hours ago -
Douluo: There is a Lianai who wants to talk to you
Chapter 287 13 hours ago -
Live broadcast: Want the goddess WeChat, cut a knife and fight Xixi
Chapter 108 13 hours ago -
Douluo told you to push Qian Renxue alone, but you became the queen
Chapter 57 13 hours ago