Subscribe: SearchTools Blog
http://www.searchtools.com/searchtools-rss.xml
Added By: Feedage Forager Feedage Grade B rated
Language: English
Tags:
enterprise search  enterprise  favorite links  google  links  new  results  search summit  search  site search  site  web 
Rate this Feed
Rate this feedRate this feedRate this feedRate this feedRate this feed
Rate this feed 1 starRate this feed 2 starRate this feed 3 starRate this feed 4 starRate this feed 5 star

Comments (0)

Feed Details and Statistics Feed Statistics
Preview: SearchTools Blog

SearchTools Blog



SearchTools Blog - LiveJournal.com



Last Build Date: Tue, 03 Jan 2012 01:55:38 GMT

 



searchtools in abeyance

Tue, 03 Jan 2012 01:55:38 GMT

Due to professional commitments and health issues, I'm not updating this blog.



Who owns Search in the Enterprise?

Thu, 13 Oct 2011 16:07:06 GMT

Sometimes it's IT, Communications, HR, everyone, or no one. None of these by themselves are sustainable for supporting information access in the long term, so we as professionals must find some way to do it better.

At Qualcomm, they have created Enterprise Centers of Excellence (CoE), including Search, Content Management, Collaboration and Desktop, combining leaders from IT, Engineering, Program Management, Finance, and more. The Search COE has succeeded in providing a central organizing point for search, muting search technology affinity wars, and increasing awareness of search capabilites leading to increased demand for specialized tools.

Mark Livingstone of Qualcomm and Miles Kehoe of New Idea Engineering will be presenting about this CoE experiences at the Enterprise Search Summit, and we'll have an open discussion at ESSF, moderated by Lynda Moulton, where experts and conference participants can share experiences, good and bad.

The Enterprise Search Summit Fall will be held November 1 - 3, 2011, in Washington DC -- I hope to learn from you there. For a $200 discount, use the code at the online registration page.
SPK11



Fall Enterprise Search Summit is coming soon

Tue, 04 Oct 2011 16:43:44 GMT

The Enterprise Search Summit will be in Washington, CD, from November 1 to 3, and it's looking good!  We're concentrating on strategies for making Enterprise Search work in the real world, with case studies of successful implementations and practical information about search-based applications and mobile search.

New to our lineup is Greg Nudelman, author of this year's best book, Designing Search: UX Strategies for eCommerce Success.  He's been involved in several mobile search interfaces, and will present Ubiquitous Enterprise Search: New Design Approaches for Mobile and Tablet -- this is going to be good!  Register for the conference before October 7 to get the early-bird discount.

BTW, I've been so busy with this conference and a large contract with a giant healthcare system's intranet search that I haven't been very responsive, and I apologize.  If you need something from me, please remind me by commenting here or sending email, don't be shy!



site search, relevance heuristics, open information extraction, rebel search, and more links

Tue, 16 Aug 2011 22:30:21 GMT

On-site search is about more than finding the right products | UTalkMarketing Good intro to ecommerce search based on significant online retail experience. tags: e-commerce site-search site search engine spellchecker user experience search log analysis The Six Commandments of Search Engine Implementation Good overview of enterprise search issues, contrasting unstructured search with RDBMS processes and discussing issues of granularity and simplicity. tags: database-search enterprise intranet search engine indexing [Heuristics for Relevance] Solr Powered ISFDB – Part #11: Using DisMax Traces the algorithmic relevance problems in search on a textual database of authors and books.  This chapter shows how to use the Solr DisMax functions to add extra relevance weight for matches on canonical names and alludes to adding weight for titles matches -- well-known heuristics for getting more useful search results. tags: relevance evaluation open-sourceReVerb - Open Information Extraction Software (open source text analytics) To extract assertions for question-answering systems, the ReVerb package implements the Open Information Extraction (Open IE) theory.  Without needing complex training examples and domain-specific words, this simply extracts relationships, such as "apple has phone".  It's open-source, along with the extracted sets from wikipedia and TReC's ClueWeb09 corpus. tags: natural language processing entity extraction open-source Rebel Search for SharePoint Competing with sharepoint search on near-real-time index updates, faceted refinements, faster crawling, default configuration is easier to maintain, more extensible in pipeline, etc. Price is about $14,000 per server, no need to buy additional SharePoint licenses. tags: enterprise search engine Windows .NET-C#-ASP facts Forrester on Enterprise Search Trends - summary by Bill Ives Talks about a 2011 analyst report on Enterprise Search engines and trends like pervasive search, search within business processes, pro-active search managers, using search to find workplace patterns, demand for transparency and user control, tags: overviews enterprise search search-based applications social-search engines Posted from Diigo. The rest of my favorite links are here.[...]



links: AT&T uses Solr, spellchecking, opensource Google Connectors, Classifying Searcher

Wed, 20 Apr 2011 22:30:24 GMT

Search as middle-ware at att.com, with Shantanu Deo - Coté's People Over Process » Informal conversation starts with AT&T's CMS and goes on to talk about using Solr for catalog search on the site.  It started as a bit of a skunkworks, and takes very little maintenance and few resources. tags: case-studies site-search database-search open-source Solr Spellchecker internals (now with tests!) « emmaespina Excellent introduction to the default Solr spelling checker processes.  The test information is particularly useful, including data sets with natural human-made spelling errors.  There's a follow-up that's also very useful. tags: spellchecker open-source testing research Using Google [Connectors] for Lucene < Real Story Group Blog Lucid Imagination is using the open source enterprise connector famework with their LucidWorks Enterprise distribution of Solr/Lucene.   tags: open-source connectors file-formats enterprise search engine search vendors Classifying Searchers - What Really Counts? - Enterprise Search Blog Thinking about relationships with search engine vendors, market niches, diverse information requirements, and many more issues in managing search.  tags: overviews search vendors enterprise search engines [...]



link: ZettaSearch - combined search and analytics (BI)

Mon, 18 Apr 2011 22:30:26 GMT

Posted from Diigo. The rest of my favorite links are here.



share this



links: Google Autocomplete & Instant, Exorbyte Long Tail of Ecommerce Search

Fri, 15 Apr 2011 22:30:19 GMT

Posted from Diigo. The rest of my favorite links are here.




last call for ESS 2011 Fall talks

Thu, 14 Apr 2011 23:36:22 GMT

This is it! Submit your ideas for presentations by the 16th -- they don't have to be polished abstracts, case studies and real-life examples are best. We particularly need more talks on open-source search, scaling way up, usability testing, and multimedia searching (images, audio, video).Enterprise Search Summit, Fall 2010: Call for PresentationsSearch technology can pay off in big ways, but only if the implementation is firmly focused on user information needs. When search crosses boundaries and provides access to relevant information regardless of domain, it helps the enterprise network become more than the sum of its parts, but getting to that level requires a significant investment. Specific solutions can pioneer the way, solving high profile problems with scalable tools, and then incrementally expanding to make additional silos searchable. Whether applied in an international corporation, government agency, or complex web catalog, search technology is just a means to the true goal of supporting actions and decisions.The Enterprise Search Summit is a forum to share lessons learned from successes, but also from failures; to recognize approaches which return good value for the investment of money and resources; and to evaluate promising new technologies. Presenters in this conference are implementing enterprise-level search in corporations, governmental departments, public catalogs, and even e-commerce sites.Pre-conference workshops will offer an introduction to all aspects of enterprise search, including the indexing pipeline, query processing, recall, precision and relevance, user interface, scaling both content and traffic, federated search, analytics and more.To Submit a PresentationEnterprise Search Summit, Fall 2011 invites search engine practitioner of all kinds to submit proposals. Submit online only..The deadline for submission is April 16, 2011; during the review process we may ask for presentation outlines and examples of previous sessions. We will send acceptance and declination notices by June.Avi Rappoport, (the new Program Chair of ESS Fall, 2011) specifically encourages search administrators to submit proposals for presentations based on case studies, research, and practical experience. Proposals can include all aspects of enterprise search: strategic, technical, cultural, financial, and managerial.We are interested in adding more interactivity and audience participation, so we may redesign some topics from pure lecture mode to round-tables, seminars, challenges, quizzes, and other forms of community discussion. We are very interested in proposals which deprecate PowerPoint in favor conversational approaches.New topics we would particularly like to see included: Scaling and distributing search to handle millions of documents and thousands of queries per minuteIndexing pipeline tools and processes, including both proprietary solutions and new initiatives such as OpenPipeline, Pypes, IUMA and ManifoldCF. Can SaaS cloud-hosted search work in the enterprise, or is the exposure too much for comfort? What’s scarier about search than CRM?What can new semantic technologies add to search? Where is the return on investment most clear?UX and usability, making best use of new interface elements such as autocomplete and AJAX. Mobile search - is it a UI problem only?Standard federated search languages such as RSW, OAI and OpenSearch, with special emphasis on security and authentication.Most effective ways of integrating search on images, audio, and video files.Who should own search in the enterprise? What is the right level of investment in search projects and ongoing maintenance? Thank you and we look forward to seeing your proposal! Enterprise Search Summit, Fall 2011, submit online.. Enterprise Search Summit 2011, Program chair Avi Rappoport[...]



links: Forbes re Unified Access, ecommerce search, autonomy idol search, users search skills limited

Thu, 14 Apr 2011 22:30:37 GMT

A Vision for Unifying Access to Data and Documents - Forbes [Attivio] Describes Attivio's combination of unstructured and structured data for a new kind of BI. ... entity data and the sentiment analysis and other analysis techniques become part of the meta data for the unstructured data. This allows the unstructured data to be presented as structured data. You can do relational joins between the structured and unstructured data to answer questions like: “Find all negative product reviews that mention both the iPad and our top-20 best sellers.” tags: BI enterprise site search engines search vendors MagnetStreet attracts sales with improved site search - Internet Retailer (SLI systems) Consumers using site search browse about 11 more pages per visit than shoppers who don’t, and remain on the e-commerce site  nearly 13 minutes longer. tags: e-commerce site-search search vendors search engiens Autonomy IDOL Universal Search product page "IDOL Universal Search provides users with a simple, personalized search experience tailored specifically to their unique requirements. In many typical environments, content may be spread across disparate internal data repositories and systems as well as external content sources and engines. Users are forced to conduct multiple, repetitive searches or risk missing crucial information. With Universal Search, a single query quickly federates unified results across systems in an intuitive, easy-to-navigate Web interface. Users get the big picture quickly through advanced visualization and dynamic categorization for a faster, more thorough exploration of results." tags: search vendors enterprise search engines federated Incompetent Research Skills Curb Users' Problem Solving (Jakob Nielsen's Alertbox) Results from a usability test - only 1% of the time did users change search strategy.  So search engines need to concentrate on improving reliability of simple search results. tags: research exploratory search user experience Posted from Diigo. The rest of my favorite links are here.[...]



notes: Interaction Models for Faceted Search, compound term processing

Wed, 13 Apr 2011 22:30:28 GMT

Posted from Diigo. The rest of my favorite links are here.




links: ElasticSearch + Cassandra / Lucandra, Solandra, Google Rich Snippets, Libraries

Mon, 11 Apr 2011 22:30:33 GMT

Posted from Diigo. The rest of my favorite links are here.




links for location-awareness research, Applied Relevance taxonomy expertise

Thu, 07 Apr 2011 22:30:24 GMT

Posted from Diigo. The rest of my favorite links are here.




YUI AutoComplete code, Google Commerce site search update

Sat, 02 Apr 2011 22:30:25 GMT

Posted from Diigo. The rest of my favorite links are here.




links: Google Custom Search API pricing and web search option, Autocompleter job description

Fri, 01 Apr 2011 22:30:21 GMT

Posted from Diigo. The rest of my favorite links are here.




links: "Good" Search Results, PHP/ir

Wed, 30 Mar 2011 22:30:23 GMT

Posted from Diigo. The rest of my favorite links are here.




links: migrating to Solr, Oracle Secure Search licensing

Tue, 29 Mar 2011 22:30:27 GMT

Posted from Diigo. The rest of my favorite links are here.




Enterprise Search Summit, Fall 2010: Call for Presentations

Mon, 28 Mar 2011 04:43:39 GMT

We've already received some good submissions via KMWorld, but this is for search in particularEnterprise Search Summit, Fall 2010: Call for PresentationsThe blurb: Search technology can pay off in big ways, but only if the implementation is firmly focused on user information needs. When search crosses boundaries and provides access to relevant information regardless of domain, it helps the enterprise network become more than the sum of its parts, but getting to that level requires a significant investment. Specific solutions can pioneer the way, solving high profile problems with scalable tools, and then incrementally expanding to make additional silos searchable. Whether applied in an international corporation, government agency, or complex web catalog, search technology is just a means to the true goal of supporting actions and decisions.The Enterprise Search Summit is a forum to share lessons learned from successes, but also from failures; to recognize approaches which return good value for the investment of money and resources; and to evaluate promising new technologies. Presenters in this conference are implementing enterprise-level search in corporations, governmental departments, public catalogs, and even e-commerce sites.Pre-conference workshops will offer an introduction to all aspects of enterprise search, including the indexing pipeline, query processing, recall, precision and relevance, user interface, scaling both content and traffic, federated search, analytics and more.To Submit a PresentationEnterprise Search Summit, Fall 2011 invites search engine practitioner of all kinds to submit proposals. Submit online only: http://www.enterprisesearchsummit.com/Fall2011/Proposal.aspx.The deadline for submission is April 16, 2011; during the review process we may ask for presentation outlines and examples of previous sessions. We will send acceptance and declination notices by June.Avi Rappoport, (the new Program Chair of ESS Fall, 2011) specifically encourages search administrators to submit proposals for presentations based on case studies, research, and practical experience. Proposals can include all aspects of enterprise search: strategic, technical, cultural, financial, and managerial.We are interested in adding more interactivity and audience participation, so we may redesign some topics from pure lecture mode to round-tables, seminars, challenges, quizzes, and other forms of community discussion. We are very interested in proposals which deprecate PowerPoint in favor conversational approaches.New topics we would particularly like to see included:Scaling and distributing search to handle millions of documents and thousands of queries per minuteIndexing pipeline tools and processes, including both proprietary solutions and new initiatives such as OpenPipeline, Pypes, IUMA and ManifoldCF. Can SaaS cloud-hosted search work in the enterprise, or is the exposure too much for comfort? What’s scarier about search than CRM?What can new semantic technologies add to search? Where is the return on investment most clear?UX and usability, making best use of new interface elements such as autocomplete and AJAX. Mobile search - is it a UI problem only?Standard federated search languages such as RSW, OAI and OpenSearch, with special emphasis on security and authentication.Most effective ways of integrating search on images, audio, and video files.Who should own search in the enterprise? What is the right level of investment in search projects and ongoing maintenance? Thank you and we look forward to seeing your proposal! Enterprise Search Summit 2011, Program chair Avi Rappoport[...]



links: Secure search, Sharepoint strategy, RAMP multimedia, Open Vocabularies Service, Solr & Lucene

Sat, 26 Mar 2011 22:30:23 GMT

Secure Search in Enterprise Webs: Tradeoffs in Efficient Implementation for Document Level Security Useful research - dependence of query processing time on result set size and visibility density for different classes of user. Scaled up to collections of tens of thousands of documents, our results suggest that query times will be unacceptable if exact counts of matching documents are required and also for users who can view only a small proportion of documents. We show that the time to conduct access checks is dramatically increased if requests must be sent off-server, even on a local network, and discuss methods for reducing the cost of security checks. We conclude that enterprises can effectively reduce DLS overheads by organizing documents in such a way that most access checking can be at collection rather than document level, by forgoing accurate match counts, by using caching, batching or hierarchical methods to cut costs of DLS checking and, if applicable, by using a single portal both to access and search documents. tags: enterprise intranets search engines security user experience Developing a SharePoint 2010 Strategy. . . or How Setting It Up and "Getting It Out There" Is Not a Strategy A synthesis of many cases where Sharepoint has been implemented so haphazardly that the results complex structures and frustrating user experiences.  Jeff Carr identifies key components for success with SharePoint: purpose, governance, people & objectives, requirements, IA, technology, and maintenance. tags: intranets enterprise CMS search engine information architecture RAMP Multimedia Search Service RAMP site search indexes text, images, audio, and video with its own transcript-creator and MetaPlayer interface.  It provides faceted search results, as well as federated and "blended" search, and has contextual content recommendations and interface widgets, search suggestions, spell correction, and keyword merchandising   The search is designed to work with the company's Publishing, Workflow, Video and Advertising modules, but may not depend on them. It's SaaS and the pricing model is mysterious but probably fairly high. tags: cloud-hosted multimedia site-search ecommerce enterprise search engine href="http://www.diigo.com/user/searchtools/search-vendors">search-vendors The Open Vocabularies Service - SKOS Editor, repository for Controlled Vocabularies A collection of controlled vocabularies, a Visual Vocabulary Editor, to visualize relationships between concepts in complex classification systems.  Import/export formats include SKOS, HTML, Excel tags: taxonomy tagging concept-matching Solr Powered ISFDB – Part #7: Simple UI Low-key post blogging changes to the default Solr user interface with.   tags: search engine user experience [...]



links on autocomplete for search

Fri, 25 Mar 2011 22:30:24 GMT

Searchers Punt Early [netflix movie searching] Walter Underwood of Netflix posts about the minimal text that people use to search for movie titles.  Examples include frank g[ehry], baron mu[nchausen], and apoc[alypto]. Adding auto-complete to the search field improved usability considerably. tags: autocomplete search log analysis analysis Typeahead Search With CouchDB | Couchbase Blog Algorithm and code for implementing autocomplete with CouchDB, based on frequency of words in the index. tags: autocomplete APIs JavaScript Organizing query completions for web search Creating autocomplete suggestions based on query logs, click-through patterns and reformulation, and clustering them with an appropriate label.  The result is compared to web search logs, but not tested with users or click-through analysis.(PDF available from author's page) tags: autocomplete clustering click-tracking search log analysis US patents on "autocomplete" A scary number of software patent applications on autocomplete. tags: autocomplete Advancing search query autocompletion services with more and better suggestions An academic approach to efficiency in generating autocompletion suggestions, and evaluating the results using only query logs as input.  I think that more traditional relevance testing with user assessments and click-tracking is much more likely to be meaningful.  (PDF is available at the authors' publications pages) tags: autocomplete relevance evaluation search log analysis Posted from Diigo. The rest of my favorite links are here.[...]



links: RSS & Social Media usability, Hash URLs, Adobe Flash to HTML

Fri, 25 Mar 2011 18:28:48 GMT




Avi is Program Chair for ESS Fall

Wed, 23 Mar 2011 02:19:40 GMT

I'm happy to announce that I'm the new Program Chair for the Enterprise Search Summit Fall conference. It's really exciting to work on this!

I'll be posting a call for presentations soon, so please think about what you might want to talk about.

Enterprise Search Summit Fall 2011
November 1-3, in Washington DC, with KMWorld, Taxonomy Boot Camp, and SharePoint Summit

Any suggestions?



Judge rules against Google Books settlement

Wed, 23 Mar 2011 00:48:03 GMT

I think this is a Really Good Thing, and the Judge agrees, saying that the settlement went way too far planning

"to implement a forward-looking business arrangement that would grant Google significant rights to exploit entire books, without permission of the copyright owners. Indeed, the ASA would give Google a significant advantage over competitors, rewarding it for engaging in wholesale copying of copyrighted works without permission, while releasing claims well beyond those presented in the case.”

"OPINION: In the end, I conclude that the ASA is not fair, adequate, and reasonable."

Read the whole thing: http://docs.justia.com/cases/federal/district-courts/new-york/nysdce/1:2005cv08136/273913/971/

justia Google Books category: http://dockets.justia.com/docket/new-york/nysdce/1:2005cv08136/273913/ (general)



links: Boilerplate code library, enterprise relevance, HTML5

Tue, 08 Mar 2011 22:30:32 GMT

boilerpipe - removes clutter around web page content (java code library) The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page. The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings. Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate. Boilerpipe is a Java library written by Christian Kohlschütter. It is released under the Apache License 2.0. The algorithms used by the library are based on (and extending) some concepts of the paper "Boilerplate Detection using Shallow Text Features" by Christian Kohlschütter et al., presented at WSDM 2010 -- The Third ACM International Conference on Web Search and Data Mining New York City, NY USA. Click here to read the paper and the presentation slides. A video of the presentation is freely available on Videolectures.net (turn speaker balance to the left to improve audio quality). Commercial support is available through Kohlschütter Search Intelligence. tags: analysis APIs indexing What makes relevance such a challenge in the enterprise? (sharepoint & fast search blog) Nice overview of why internal search is often worse than web search: mainly that there's little meaningful linking within an intranet, little incentive to make a site easily searchable, and security issues with access control.  The post recommends realistic expectations, not indexing low-value content, looking at third-party relevance tools, offering scope or zoned search, and tagging content. tags: enterprise search engines intranets overviews relevance HTML5 specification, w3 Complex and difficult to read, though I can tell they're trying to make it easier. tags: site-search web-search research HTML5 - A Step Forward Towards Semantic Web Nice introduction to the new structural tags in HTML5: section, article, aside, header, hgroup, footer, and nav, and new content tags: figure, video, audio, canvas. tags: semantic search web-search Posted from Diigo. The rest of my favorite links are here.[...]



notes on Google CSE maximum results, Yahoo BOSS (Build Your Own Search Server), clouds from Attivio

Wed, 02 Mar 2011 22:30:36 GMT

Posted from Diigo. The rest of my favorite links are here.




New InfoDocket information service

Mon, 28 Feb 2011 22:30:18 GMT

Posted from Diigo. The rest of my favorite links are here.