Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

more search engines treated as external sites #2586

Closed
anonymous-matomo-user opened this issue Jul 23, 2011 · 17 comments
Closed

more search engines treated as external sites #2586

anonymous-matomo-user opened this issue Jul 23, 2011 · 17 comments
Labels
Bug For errors / faults / flaws / inconsistencies etc.
Milestone

Comments

@anonymous-matomo-user
Copy link

Search engines wrongly considered external websites:

www.google.it/imgres?q=beato+vincenzo+romano&rlz=1W1ADSA_it&biw=1280&bih=556&tbm=isch&tbnid=GoEpBEHlKndZFM:&imgrefurl=h t t p://it.cathopedia.org/wiki/Beato_Vincenzo_Romano&docid=lwRZRPTQtuqNeM&w=200&h=204&ei=1_YqTsquFdOt8QPT7KGUDA&zoom=1&iact=rc&dur=78&page=1&tbnh=137&tbnw=127&start=0 (please remove spaces, I needed to insert them in order to avoid the bug be treated as spam)

search.findeer.com/cafarnaum looks for cafarnaum

http://scour.com/search/web/cafarnaum looks for cafarnaum

www.google.com/imgres?imgurl=http://it.cathopedia.org/w/images/b/ba/Zeffirelli_Ges%C3%B9_Nazareth_002.jpg&imgrefurl=h t t p://it.cathopedia.org/wiki/Ges%25C3%25B9_di_Nazareth_%28sceneggiato_televisivo%29&usg=__2nXAIxZzstkylCKLMYT6zyOU2BQ=&h=261&w=193&sz=12&hl=it&start=28&zoom=1&tbnid=BQIQJYLidRDj1M:&tbnh=127&tbnw=92&ei=YGgbTuaULdPA8QPBt_kW&itbs=1&iact=hc&vpx=746&vpy=90&dur=289&hovh=208&hovw=154&tx=114&ty=106&page=2&ndsp=29&ved=1t:429,r:4,s:28&biw=1280&bih=713 is a result of a google images search (please remove spaces, I needed to insert them in order to avoid the bug be treated as spam)

www.google.ch/imgres?q=gerico+giordania+il+paese&hl=it&safe=off&sa=G&biw=1024&bih=459&gbv=2&tbm=isch&tbnid=1RbnTzuwQ5xLdM:&imgrefurl=http://it.cathopedia.org/wiki/Gerico&docid=7h1k1qwY74vIWM&w=300&h=225&ei=QGApTtGcA8fEsgaFnKHnCw&zoom=1&iact=rc&dur=78&page=10&tbnh=135&tbnw=179&start=77&ndsp=9&ved=1t:429,r:3,s:77&tx=106&ty=71

www.google.it./m?q=search

www.url.org/?g=search&ft=2&f=1&l=it&a=1&r=dizionario+lingua+italiana+zanichelli&s=go&c=001&t=001&d=40737&q=dizionario+zanichelli&qs=presbitero

buscador.terra.cl/Results.aspx?source=Search the url doesn't have the search term, which is supposedly passed as a post type one

eu.ixquick.com like the preceeding

int.search-results.com/web?qsrc=1&o=16537&l=dis&atb=sysid%3D1%3Aappid%3D393%3Auid%3Dd15f554585901fc7%3Auc%3D1309549109%3Asrc%3Dhmp%3Ao%3D16537%3Aq%3Dbibbia%2520preghiera%2520padre%2520nostro&q=bibbia+cattolica+preghiera+padre+nostro

mundo.busca.uol.com.br/buscar.html?q=teologia+della+pazienza&ad=on

search.juno.com/search?action=search&source=nextpage&query=Nostra+Signora+di+Lourdes%2C+Tor+Marancia%2C+Roma&start=30&adpage=4&orig_source=startpage-free-c

searchservices.verizon.com/search/ws.portal?_nfpb=true&_pageLabel=google_results&q=Storie+di+Locri+E+Gerace&channel=MyVrzn&clientid=Cnsmr&PAGING_QUERY=true&start=10&pagenum=2

start.flashvideodownloader.org/result.php?q=congregazion+eper+il+clero+email%3A+&ie=UTF-8&cx=partner-pub-5087362176467115:lyglkqaff6i&cof=FORID:10&sa=Search#1311442684842&nq=congregazion%20eper%20il%20clero%20email%3A

www.metacrawler.com/info.metac.test.b8/search/web?fcoid=417&fcop=topnav&fpid=27&q=chiesa+catolica+in+egitto

www.webmii.es/Result.aspx?f=Claudio&l=Doglio&r=es

www.yatedo.fr/search/profil?q=Eucologia&btn_s=&c=all

@mattab
Copy link
Member

mattab commented Sep 16, 2011

Thanks for the suggestion, but would it be possible for you to propose the patch to track these search engines as explained in: http://piwik.org/faq/general/#faq_39
this would greatly help :) thanks!

@anonymous-matomo-user
Copy link
Author

I've sent some of the urls via email as in http://piwik.org/faq/general/#faq_39

Others url are supposingly needing two parameters. How do you manage them?

@sgiehl
Copy link
Member

sgiehl commented Sep 18, 2011

(In [5180]) refs #2586 added search.juno.com to google powered search engines

@sgiehl
Copy link
Member

sgiehl commented Sep 18, 2011

(In [5181]) refs #2586 added search engine www.url.org

@sgiehl
Copy link
Member

sgiehl commented Sep 18, 2011

(In [5182]) refs #2586 added searchresults.verizon.com to google powered search engines

@sgiehl
Copy link
Member

sgiehl commented Sep 18, 2011

(In [5183]) refs #2586 added search engine scour.com

@mattab
Copy link
Member

mattab commented Nov 25, 2011

Please submit the patch for the remaining missing search engines (as per faq) thanks for your feedback!

@anonymous-matomo-user
Copy link
Author

"www.google.it" => array( "google", "q", "m?q={k}"),

"mundo.busca.uol.com.br" => array( "mundo.busca.uol", "q", "buscar.html?q={k}"),

"www.yatedo.fr" => array( "yatedo", "q", "search/profil?q={k}"),

"www.metacrawler.com" => array( "metacrawler", "q", "info.metac.test.b8/search/web?q={k}"),

"int.search-results.com" => array( "search-result", "q", "web?q={k}"),

@anonymous-matomo-user
Copy link
Author

can be reopened

@sgiehl
Copy link
Member

sgiehl commented Jan 4, 2012

Ok. Let's have a look at the suggested entries:

  • google.it should already be discovered by the exisiting rules

@sgiehl
Copy link
Member

sgiehl commented Jan 4, 2012

(In [5649]) refs #2586 fixed detection of infospace powered search engines like metacrrawler

@sgiehl
Copy link
Member

sgiehl commented Jan 4, 2012

(In [5650]) refs #2586 added search engine busco.uol.com.br

@sgiehl
Copy link
Member

sgiehl commented Jan 4, 2012

(In [5651]) refs #2586 added missing int.search-results.com

@sgiehl
Copy link
Member

sgiehl commented Jan 4, 2012

(In [5652]) refs #2586 added yatedo.com / yatedo.fr

@robocoder
Copy link
Contributor

(In [5655]) refs #2586 - fix build

@mattab
Copy link
Member

mattab commented Jan 8, 2012

Thanks paolobenve & Steve, always good to keep this list up to date, it makes Piwik look sharper!

@sgiehl
Copy link
Member

sgiehl commented Jan 9, 2012

I'm closing this ticket now. The remaning urls aren't search engines, or we aren't able to determine the keyword.

@anonymous-matomo-user anonymous-matomo-user added this to the 1.7 Piwik 1.7 milestone Jul 8, 2014
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug For errors / faults / flaws / inconsistencies etc.
Projects
None yet
Development

No branches or pull requests

4 participants