[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Google and limits of search engines

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 33
Thread images: 3

File: searchengine.jpg (31KB, 710x710px) Image search: [Google]
searchengine.jpg
31KB, 710x710px
Hey /g/eniuses,

I'm wondering if there is a 'better' search engine than Google, in terms of breadth and depth of information accessed. There are times often enough when I am searching for a particular phrase, maybe lyrics to an obscure song, or a person's name when Google just doesn't come up with what I am looking for.

It's possible that the people I am looking for don't have an internet presence or the particular phrase isn't transcribed anywhere, but I figure a search engine as powerful as google should at least be able to millions of things I am very closely looking for but not exactly. (Yes, I have tried quotes, using key phrases, omitting others.) It just seems weird that for being the behemoth that it is, the search engine is strangely limited in some ways.

Anyway, I'm wondering if you all have recommendations for better search engines, or can explain to me why Google isn't somehow artificially limiting search results for no reason, or what resources you all use when trying to find difficult and obscure information.

Thanks!
>>
File: 1280px-Yandex_logo_ru.svg.png (36KB, 1280x509px) Image search: [Google]
1280px-Yandex_logo_ru.svg.png
36KB, 1280x509px
>>59997440
Yandex
>>
>>59998112
>Non-US company
>not respect your freedom
>ugly interface
fuck off mongoloid
>>
>>59998382
>wanting US companies
>thinking US companies give a shit about your freedom
You're an absolute retard.
>>
>>59997440
DuckDuckGo.
>>59998403
Yandex spy on their users.
>>
>>59998418
>Google doesn't
>>
>>59998418
DuckDuckGo sells your data too.
>>
>>59998403
Still better than your shithole coutry with noname companies cucked by your governmant.
>>
>>59997440
2006 google was infinitely better. Google is so obsessed with """"helping"""" you search that when you look for "how to close" it instead searches for "how to open" and when you look for "buy new windows" it looks for "buy osx" when you're actually looking for new windows. Nevermind their insistence on providing politically motivated results and censoring what they don't like, which further mitigates the quality of results they provide.

Bing is significantly worse because while they try to apply the same tricks, M$ uses pure rule-based systems when google uses a mix of rule-based and ML.

Other search engines are significantly worse because they take results from bing and google and mix them together which fucks everything up with almost only purely incorrect results.

Basically any search engine that stops being fancy will automatically be orders of magnitudes better than google. Just index pages in the classical way and crawl in the usual way using a usual indexing scheme and you have successfully overtaken google.
>>
>>59998432
[citation needed]
>>
>>59997440
Red pill is I go to your mum's house and browse there with her accounts while she sucking me off.
>>
Related question, since when does putting the search term under quotation marks stopped working, for looking up exact phrases? For example, the other day I wanted to look up any mentions of "e=" and "religion" together, but Google flat out ignores the "e=" bit.
>>
>>59998747
He only uses Yandex because of the beautiful women.
>>
>>59998112
TITS OR GTFO
>>
>>59998112
CYKA
>>
honestly, it would be cool to have a search engine that ignored results from social media, that includes news sites, wikipedia and blogs (whatever hosted in blogspot wordpress tumblr).
>>
>>59998879
kek
>>
>>59999063
(((tits)))
>>
>>59997440
startpage
>>
actually, why aren't there any projects trying to build distributed search engines, dns server, etc without any regulations ?
>>
>>60000362
There have been, but they're blacklisted by the media just like tor and i2p. Governments pay the media to shill against anything that isn't regulated.
>>
>>60000859
really ? can you name some ?
>>
>>59999171
[searchterm] -blogspot -facebook -wsj - wikipedia -tumblr

???
>>
>You are so unsecure about your privacy that you only use things that are completly safe
Oh man, are you guys doing something illegal or why do you care so much about this shit? The government will give a shit about what you do then.
>>
File: hack.png (6KB, 40x40px) Image search: [Google]
hack.png
6KB, 40x40px
>>59997440
https://www.wolframalpha.com/

Also give us an example of obscure search that Google fails to deliver and maybe I can teach you a trick or two.

I'm a search sniper by the way.
>>
>>59998817
>Basically any search engine that stops being fancy will automatically be orders of magnitudes better than google. Just index pages in the classical way and crawl in the usual way using a usual indexing scheme and you have successfully overtaken google.
You don't know what you're talking about. A dumb index search will just give you billions of Viagra website hits. The Internet is cock full of low quality crap filled with indexable keywords and SEO tecniques.
>>
>>59997440
You know how a lot of people in CS do projects to learn CS like develop their own OS or compiler? I've given some thought to doing something like this and it might be a more practical project to spend time on learning more about depths of CS. What do you guys think, working on a search engine project be a good idea and better use of time than something like a hobby OS or compiler?
I mean if you where trying to work through textbooks learning about data mining, ML, etc wouldn't this be a good hobby project to apply what you're learning?

>>60003253
This is true. But wouldn't it be better to apply more than just classical algorithms to this, like implement some of the best research and techniques from the fields of ML, AI, etc?

Also I have a question about the legality of this. Do most sites consider it a breach of TOS and is crawling and storing their sites in your database as illegal as pirating/copyright infringement?
>>
>>60002525
teach us some cool tricks
>>
>>60004734
Check this out:

https://www.exploit-db.com/google-hacking-database/
>>
>>60005688
Do you like the book by Johnny Long?
>>
>>60005721
I don't read books. I downloaded 10 GB of videos about this subject tho. They teach similar things.
>>
>>60003253
How to spot a retard.jpg
>>
Everyone complaining about who is stealing the most data: please use searx and gtfo
Thread posts: 33
Thread images: 3


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.