Back to Table of contents

Primeur weekly 2013-09-23

Exascale supercomputing

Supercomputers for all: The next frontier for Europe's high performance computing ...

The Cloud

DataDirect Networks ushers in a new era of scale, speed and data insight for web-scale and Big Data applications ...

Cybersecurity researcher joins the ranks of the 'brilliant' ...

IBM commits $1 billion to fuel Linux and open source innovation on Power systems ...

Siemens plans North American HR technology Cloud migration to Oracle Fusion HCM with help from KPMG ...

HP announces Cloud-driven enhancements to application and catabase archiving solution ...

EuroFlash

AMS-IX starts new regional Internet exchange hub in Mombasa, Kenya, in collaboration with KIXP ...

Altair announces most successful UK technology event to date ...

Graphene photodetector integrated into computer chip ...

PRACE pre-commercial procurement of whole system design for energy efficient HPC ...

Severo Ochoa (SO) programme at Barcelona Supercomputing Center ...

Quantum entanglement only dependent upon area ...

University of Leicester hosts national supercomputing meeting, showcasing science from the smallest and largest scales ...

USFlash

Six vendors complete test event to become part of OpenFabrics Alliance Interoperability Logo Programme ...

OpenFabrics Alliance announces formation of OpenFramework Working Group ...

DataDirect Networks and ASG partner to offer a private Cloud solution for high performance, high availability archive and collaboration ...

DataDirect Networks to power business-critical broadcast system servicing for Starz ...

Connecting the future ...

$50 million NSF grant to advance cyberinfrastructure for Big Data in life sciences ...

Supercomputing the transition from ordinary to extraordinary forms of matter ...

Cray adds large memory and shared memory solutions to its line of Cray CS300 cluster supercomputers ...

On the road to fault-tolerant quantum computing ...

Bold idea for 'Big Data' ...

NASA uses supercomputer to create future climate projections for the United States ...

Supercomputer sheds weight, gains mobility, and becomes "Server-on-the-Go" ...

Graphene could yield cheaper optical chips ...

Scaling up personalized query results for next generation of search engines ...

Scaling up personalized query results for next generation of search engines


North Carolina State University
18 Sep 2013 Raleigh - North Carolina State University researchers have developed a way for search engines to provide users with more accurate, personalized search results. The challenge in the past has been how to scale this approach up so that it doesn't consume massive computer resources. Now the researchers have devised a technique for implementing personalized searches that is more than 100 times more efficient than previous approaches.

At issue is how search engines handle complex or confusing queries. For example, if a user is searching for faculty members who do research on financial informatics, that user wants a list of relevant web pages from faculty, not the pages of graduate students mentioning faculty or news stories that use those terms. That's a complex search.

"Similarly, when searches are ambiguous with multiple possible interpretations, traditional search engines use impersonal techniques. For example, if a user searches for the term 'jaguar speed', the user could be looking for information on the Jaguar supercomputer, the jungle cat or the car", stated Dr. Kemafor Anyanwu, an assistant professor of computer science at North Carolina (NC) State and senior author of a paper on the research. "At any given time, the same person may want information on any of those things, so profiling the user isn't necessarily very helpful."

Dr. Anyanwu's team has come up with a way to address the personalized search problem by looking at a user's "ambient query context", meaning they look at a user's most recent searches to help interpret the current search. Specifically, they look beyond the words used in a search to associated concepts to determine the context of a search. So, if a user's previous search contained the word "conservation" it would be associated with concepts likes "animals" or "wildlife" and even "zoos". Then, a subsequent search for "jaguar speed" would push results about the jungle cat higher up in the results - and not the automobile or supercomputer. And the more recently a concept has been associated with a search, the more weight it is given when ranking results of a new search.

Search engines have also tried to identify patterns in user clicking behaviour on search results to identify the most probable user intent for a search. However, such techniques are impersonal and are applied on a global basis. So, if the most frequent click pattern for a set of keywords is in a particular context, then that context becomes the context associated with queries for most or all users - even if your recent search history indicates that your query context is about jungle cats.

"What we are doing is different", Dr. Anyanwu stated. "We are identifying the context of search terms for individual users in real time and using that to determine a user's intention for a specific query at a specific time. This allows us to deal more effectively with more complex searches than traditional search engines. Such searches are becoming more prevalent as people now use the Web as a key knowledge base supporting different types of tasks."

While Dr. Anyanwu and her team developed a context-aware personalized search technique over a year ago, the challenge has been how to scale this approach up. "Because running an ambient context programme for every user would take an enormous amount of computing resources, and that is not feasible", Dr. Anyanwu stated.

However, Dr. Anyanwu's research team has now come up with a technique that includes new ways to represent data, new ways to index that data so that it can be accessed efficiently, and a new computing architecture for organizing those indexes. The new technique makes a significant difference.

"Our new indexing and search computing architecture allows us to support personalized search for about 2,900 concurrent users using an 8GB machine, whereas an earlier approach supported only 17 concurrent users. This makes the concept more practical, and moves us closer to the next generation of search engines", Dr. Anyanwu stated.

The paper, "Personalizing Search: A Case for Scaling Concurrency in Multi-Tenant Semantic Web Search Systems", will be presented at the 2013 IEEE International Conference on Big Data being held October 6-9 in Santa Clara, California. Lead author of the paper is Dr. Haizhou Fu, a former Ph.D. student at NC State. The paper was co-authored by Hyeongsik Kim, a Ph.D. student at NC State. The research was supported by the National Science Foundation.
Source: North Carolina State University

Back to Table of contents

Primeur weekly 2013-09-23

Exascale supercomputing

Supercomputers for all: The next frontier for Europe's high performance computing ...

The Cloud

DataDirect Networks ushers in a new era of scale, speed and data insight for web-scale and Big Data applications ...

Cybersecurity researcher joins the ranks of the 'brilliant' ...

IBM commits $1 billion to fuel Linux and open source innovation on Power systems ...

Siemens plans North American HR technology Cloud migration to Oracle Fusion HCM with help from KPMG ...

HP announces Cloud-driven enhancements to application and catabase archiving solution ...

EuroFlash

AMS-IX starts new regional Internet exchange hub in Mombasa, Kenya, in collaboration with KIXP ...

Altair announces most successful UK technology event to date ...

Graphene photodetector integrated into computer chip ...

PRACE pre-commercial procurement of whole system design for energy efficient HPC ...

Severo Ochoa (SO) programme at Barcelona Supercomputing Center ...

Quantum entanglement only dependent upon area ...

University of Leicester hosts national supercomputing meeting, showcasing science from the smallest and largest scales ...

USFlash

Six vendors complete test event to become part of OpenFabrics Alliance Interoperability Logo Programme ...

OpenFabrics Alliance announces formation of OpenFramework Working Group ...

DataDirect Networks and ASG partner to offer a private Cloud solution for high performance, high availability archive and collaboration ...

DataDirect Networks to power business-critical broadcast system servicing for Starz ...

Connecting the future ...

$50 million NSF grant to advance cyberinfrastructure for Big Data in life sciences ...

Supercomputing the transition from ordinary to extraordinary forms of matter ...

Cray adds large memory and shared memory solutions to its line of Cray CS300 cluster supercomputers ...

On the road to fault-tolerant quantum computing ...

Bold idea for 'Big Data' ...

NASA uses supercomputer to create future climate projections for the United States ...

Supercomputer sheds weight, gains mobility, and becomes "Server-on-the-Go" ...

Graphene could yield cheaper optical chips ...

Scaling up personalized query results for next generation of search engines ...