The free Data Job Fair brings together the people from all kinds of data-related jobs and the compaines looking for them.
Sample areas of expertise
DW developer | Data modeller | DW architect | BI specialist | Datamining expert | Big Data engineer | Data scientist
Data Cinema
As a warm-up exercise we will show short movies on the world of data and data analysis. The program will consist of – among others – several spectacular data visualization clips and TED lectures. The final selection will be defined by votes of the audience.
Data Careers talks
There are many different paths leading to the world of data. We will invite several data professionals from different companies, who will tell us how they found their own way.
HR Pitch Competition
We are running a HR pitch competition for the exhibitors. 5 out of these companies will have the opportunity to present their companies and try to persuade audience about why they are the best place to work for. The best pitch title will be decided by the audience.
Online Data Job Board
The open positions will be available online as well, so everyone who cannot attend the event has the chance to check out them later.
Detailed program, list of exhibitors and speakers available here: datajobfair.hu
This event is FREE.
This talk will go through the history and current state of processing engines for Hadoop, in particular, focussing SQL engines on Hadoop. We will, then, dive deep into one of the SQL processing engines for Hadoop - Cloudera Impala.
The Cloudera Impala project is pioneering the next generation of Hadoop capabilities: the convergence of fast SQL queries with the capacity, scalability, and flexibility of a Hadoop cluster. WithImpala, the Hadoop community now has an open-sourced codebase that helps users query data stored in HDFS and Apache HBase in real time, using familiar SQL syntax. In contrast with other SQL-on-Hadoop initiatives, Impala's operations are fast enough to do interactively on native Hadoop data rather than in long-running batch jobs. Now you have the freedom to discover relationships and explore what-if scenarios on Big Data datasets. By taking advantage of Hadoop's infrastructure, Impala lets you avoid traditional data warehouse obstacles like rigid schema design and the cost of expensive ETL jobs.In the Flink runtime layer both batch and streaming jobs are executed as a common data flow graph thus unifying batch and stream processing in an elegant way. Flink provides a more straight-forward and transparent approach than the lambda architecture or other state of the art solutions. Flink also provides exactly-once processing guarantees for streaming programs with a combination of upstream backup and consistent user state snapshots.
The highly efficient runtime layer offers competitive performance compared to current streaming solutions with a rich and expressive API. This talk will focus on the API and runtime features of Flink Streaming in comparison with current industry standard streaming solutions.
A workshopon való részvétel előfeltételei:
A workshopra olyanok jelentkezőket várunk, akik érdeklődnek a data science iránt. A workshopon történő részvétel előképzettséget nem igényel.