Back to Table of contents

Primeur weekly 2016-09-26

Focus

OpenPOWER and IBM at the verge of the era of cognitive computing ...

Exascale supercomputing

Supercomputers receive funding to help predict and modify new materials ...

Focus on Europe

Calling on experts to propose tutorials and workshops for ISC 2017 ...

Adept project concludes with open-source release of energy measurement tools ...

Specific Grant Agreement One signed by the European Commission and the Human Brain Project ...

University of Amsterdam coordinates cognition research in Human Brain Project ...

New EU publication on Science, research and innovation performance of the European Union ...

Middleware

Quantum names Molly Rector Vice President of Marketing ...

IBM Research and MIT collaborate to advance frontiers of artificial intelligence in real-world audio-visual comprehension technologies ...

Hardware

UC San Diego Extension and San Diego Supercomputer Center launch Modern Data Science Academy ...

TSMC and Synopsys collaboration delivers innovative technologies for the High Performance Compute (HPC) Platform ...

Cadence and TSMC advance 7nm FinFET designs for mobile and HPC platforms ...

Volkswagen innovates manufacturing with move to Verne Global ...

Kinetica unveils GPU-accelerated database for analyzing streaming data with enhanced performance, visualization and high availability ...

CORNAMI announces $3 million funding led by Impact Venture Capital ...

Mellanox Technologies appoints Alinka Flaminia as Senior Vice President and General Counsel ...

DataCore and Supermicro offer ultra-high performance enterprise class Hyper-converged solutions ...

TYAN HPC platforms add support for NVIDIA Tesla P100, P40 and P4 GPUs ...

CENATE adds powerful testbed from NVIDIA ...

Reconfigurable chaos-based microchips offer possible solution to Moore's law ...

New Hikari supercomputer starts solar HVDC ...

India's North East region's fastest supercomputer launched by honourable Union Minister for HRD at IIT Guwahati ...

DDN leverages Lustre leadership to deliver new scalability and management capabilities and open source contributions ...

Applications

Termination of lethal arrhythmia with light ...

Artificial intelligence reveals mechanism behind brain tumour ...

Tracking down the origin of mercury contamination in human hair ...

Next generation of statistical tools to be developed for the Big Data age ...

British Artificial Intelligence firm first in Europe to use advanced deep learning supercomputer ...

Denver International Airport selects Panasonic Weather Solutions for ground operations weather forecasting ...

Ansys and TSMC empower chip manufacturers to design cutting-edge multi-die chip-package systems ...

New study shows nickel graphene can be tuned for optimal fracture strength ...

Meet Rutgers' RADICAL supercomputing guru ...

NASA Career Award winner uses Blue Waters supercomputer to mine crop yield data ...

Southeast students and faculty optimize knowledge at Supercomputing Symposium ...

The Cloud

Oracle unveils its next generation Cloud strategy: Intelligent Applications ...

Oracle announces the industry's most comprehensive offering for analytics in the Cloud ...

IBM Power Systems and Red Hat extend collaboration for next-generation Cloud platforms ...

Trifacta v4 extends enterprise data wrangling to any user, any data, any Cloud ...

Salesforce introduces Salesforce Einstein - artificial intelligence for everyone ...

Trifacta v4 extends enterprise data wrangling to any user, any data, any Cloud


20 Sep 2016 San Francisco - Trifacta, a global expert in data wrangling, has released Trifacta v4. The latest release expands upon Trifacta's award-winning approach to data wrangling, with capabilities specifically designed to work for more users, more diverse data sources and within more Cloud environments.

"We're seeing tremendous demand for solutions that can put data preparation capabilities into the hands of business users, where the requirements and desires of analytic outcomes are best understood. Trifacta has established itself in the fast-growing self-service data preparation market and is continuing to build meaningful differentiation into their product as evidenced by the v4 release. Making the process of data wrangling easier and faster for a wider set of sources and deployment environments is critical to enterprise adoption", stated Stewart Bond, research director, IDC.

Trifacta v4 features the general availability of Builder, a new menu-driven workflow to guide users through data wrangling steps. The latest release includes the general availability of the Photon Compute Engine, improving the scale of data that users are able to wrangle on-the-fly, directly within the Trifacta application. Photon provides an optimized engine for datasets that do not require parallel processing within Trifacta's Intelligent Execution architecture. The v4 release also expands support for customers deploying Trifacta in Cloud environments such as Amazon Web Services, Google Cloud Platform and Microsoft Azure, while extending the ability of users to directly connect to a variety of enterprise data sources, including Microsoft SQL Server, MySQL, Oracle, PostgreSQL and Teradata.

"At Nordea Bank, we are constantly striving to improve the timeliness, accuracy and level of trust in our data to internal and external stakeholders. Trifacta v4 will enable us to involve our business subject matter experts more efficiently than ever before. This has allowed us to fundamentally reduce time to market and cost of managing data while demonstrably increasing the quality of our data products", stated Alasdair Anderson, executive vice president of data engineering, Nordea Bank.

Novelties in Trifacta v4 include:

  • Enhanced User Experience

A core focus area of the v4 release is to enrich Trifacta's unique data wrangling user experience by offering a new workflow for building data preparation steps. The addition of Builder to the Trifacta interface augments the ability of users to wrangle data without the need to utilize scripts. Builder is designed to guide users through complex data wrangling tasks, providing greater ease-of-use whether simply selecting a suggested transform or using drop-down menu options to build wrangling steps from scratch. With Builder, the process of preparing data is dramatically simplified by intelligently breaking down the steps of each wrangling task to enhance how non-technical users handle common and complex data.

"At Sanofi, a key corporate strategy is improving our processing of data across technical groups to provide more concise treatment, improve operational efficiency and reduce security risks. Trifacta is a core part of our success because it gives the Infrastructure Management Team the ability to manage large, diverse data sets and wrangle them into the formats we need for analysis. We’re excited about the release of v4 and especially how Builder will enable a broader set of users within Sanofi to intuitively prepare data in a simple, guided workflow. We hope to see more groups and departments use Trifacta moving forward for their data wrangling as we move to make it a service on our data analytics platforms", stated Jason Stoute, senior manager of infrastructure architecture, Sanofi.

The v4 release also expands upon Trifacta's blend of data visualization and machine learning to guide users through common data wrangling tasks. With pattern profiling, users visualize common and anomalous text patterns that are automatically detected within each column. The addition of fuzzy join allows users to blend together disparate data sources with similar values but non-exact matches. v4 also features the debut of column lineage, a breakthrough visual technique to expose the lineage of how each attribute or column within a dataset originated. With operationalization, v4 allows end users to set and manage end-to-end data wrangling workflows in a completely self-service process.

  • Improved Performance & Scale

The latest version delivers greater performance and scale for working with data directly within the Trifacta application, and an optimized in-memory data processing engine for data sets that do not require parallel processing. The general availability of the Photon Compute Engine enables users to wrangle a 100x larger volume of data on-the-fly, directly within the application, while still maintaining the fluid experience and immediate feedback, both of which are core to Trifacta's user experience.

For files, Photon enables users to transform entire data sets completely on-the-fly within the application, and also integrates seamlessly with Trifacta's Intelligent Execution architecture, complementing existing data processing engines Spark and MapReduce. Photon was specifically built to underpin Trifacta and provides unmatched performance and scale for data wrangling use cases when compared to other interactive computing engines. As part of v4, Trifacta has also enhanced support for executing transformations at scale, leveraging the Spark data processing framework by adding support for Spark 2.0.

"As an analyst, I spend much of my time exploring and refining data sets, running analysis, and examining the outcome to find the best solution to the business problem in front of me. The workflow is extremely important to my process. Delays and interruption can lead to hours of lost time on a project. With Trifacta, the data wrangling process is seamless, making it much easier for me to be productive and efficient. The addition of Photon improves upon what is already a great user experience by allowing us to interactively work with greater volumes of data while maintaining the same fluid workflow", stated Mike Riegling, supply chain data analyst, PepsiCo.

  • Extended Cloud Deployment and Data Source Connectivity

With v4, customers benefit from expanded support for deploying Trifacta in the Cloud through integrations with Amazon Web Services, Google Cloud Platform and Microsoft Azure. For Amazon Web Services, Trifacta provides integration with Amazon S3 and Redshift as input and output sources and deployment on EC2. Trifacta v4 also supports the Google Cloud Platform ecosystem with support for Google Cloud Storage and BigQuery as input and output sources, data processing via Google Dataflow and deployment on Google Compute Engine. The Microsoft Azure Cloud platform is also supported in v4. Trifacta adds support for deployment on Microsoft Azure HDI and can integrate data from Azure Blob Storage.

"We're seeing tremendous growth in the enterprise adoption of Microsoft Azure for critical analytics and business intelligence processes. A challenge customers mention to us is the need for a more effective process for cleaning and joining together diverse data. With Trifacta's added support and integration with Microsoft Azure Storage and Microsoft HDInsight as part of their v4 release, customers will now be able to accelerate these analytics processes with an industry-leading data wrangling solution for the Cloud", stated Tiffany Wissner, head of Big Data marketing, Microsoft.

Trifacta has also expanded support for creating live connections to common relational sources such as Microsoft SQL Server, MySQL, Oracle, PostgreSQL and Teradata. Unlike approaches that force customers to make copies of data prior to preparation, Trifacta creates a live connection, streaming in live data from external sources to incorporate directly into the wrangling process. v4 also includes the initial release of Trifacta's connectivity API giving customers and partners the ability to seamlessly integrate Trifacta with external data and services.

"Trifacta v4 represents our most significant release since the launch of the company. From the beginning, our goal has been to provide a self-service data preparation solution that helps customers connect their Big Data strategy to business value. As the leader in data wrangling, we're excited about the innovations v4 will deliver to the more than 3,500 companies using our products today", stated Adam Wilson, CEO, Trifacta.

To meet Trifacta team members and see a live demo of the v4 release, you can visit Trifacta at Strata + Hadoop World in New York at booth 539 from September 26-29, 2016.

To learn more about Trifacta v4, you can register for Trifacta's upcoming webinar to hear product management present an in-depth review and live demo of the v4 release.
Source: Trifacta

Back to Table of contents

Primeur weekly 2016-09-26

Focus

OpenPOWER and IBM at the verge of the era of cognitive computing ...

Exascale supercomputing

Supercomputers receive funding to help predict and modify new materials ...

Focus on Europe

Calling on experts to propose tutorials and workshops for ISC 2017 ...

Adept project concludes with open-source release of energy measurement tools ...

Specific Grant Agreement One signed by the European Commission and the Human Brain Project ...

University of Amsterdam coordinates cognition research in Human Brain Project ...

New EU publication on Science, research and innovation performance of the European Union ...

Middleware

Quantum names Molly Rector Vice President of Marketing ...

IBM Research and MIT collaborate to advance frontiers of artificial intelligence in real-world audio-visual comprehension technologies ...

Hardware

UC San Diego Extension and San Diego Supercomputer Center launch Modern Data Science Academy ...

TSMC and Synopsys collaboration delivers innovative technologies for the High Performance Compute (HPC) Platform ...

Cadence and TSMC advance 7nm FinFET designs for mobile and HPC platforms ...

Volkswagen innovates manufacturing with move to Verne Global ...

Kinetica unveils GPU-accelerated database for analyzing streaming data with enhanced performance, visualization and high availability ...

CORNAMI announces $3 million funding led by Impact Venture Capital ...

Mellanox Technologies appoints Alinka Flaminia as Senior Vice President and General Counsel ...

DataCore and Supermicro offer ultra-high performance enterprise class Hyper-converged solutions ...

TYAN HPC platforms add support for NVIDIA Tesla P100, P40 and P4 GPUs ...

CENATE adds powerful testbed from NVIDIA ...

Reconfigurable chaos-based microchips offer possible solution to Moore's law ...

New Hikari supercomputer starts solar HVDC ...

India's North East region's fastest supercomputer launched by honourable Union Minister for HRD at IIT Guwahati ...

DDN leverages Lustre leadership to deliver new scalability and management capabilities and open source contributions ...

Applications

Termination of lethal arrhythmia with light ...

Artificial intelligence reveals mechanism behind brain tumour ...

Tracking down the origin of mercury contamination in human hair ...

Next generation of statistical tools to be developed for the Big Data age ...

British Artificial Intelligence firm first in Europe to use advanced deep learning supercomputer ...

Denver International Airport selects Panasonic Weather Solutions for ground operations weather forecasting ...

Ansys and TSMC empower chip manufacturers to design cutting-edge multi-die chip-package systems ...

New study shows nickel graphene can be tuned for optimal fracture strength ...

Meet Rutgers' RADICAL supercomputing guru ...

NASA Career Award winner uses Blue Waters supercomputer to mine crop yield data ...

Southeast students and faculty optimize knowledge at Supercomputing Symposium ...

The Cloud

Oracle unveils its next generation Cloud strategy: Intelligent Applications ...

Oracle announces the industry's most comprehensive offering for analytics in the Cloud ...

IBM Power Systems and Red Hat extend collaboration for next-generation Cloud platforms ...

Trifacta v4 extends enterprise data wrangling to any user, any data, any Cloud ...

Salesforce introduces Salesforce Einstein - artificial intelligence for everyone ...