BXS/LR/SDE

Pełna informacja dotycząca przetwarzania danych osobowych jest dostępna w tym miejscu.

 

Pełną informację odnośnie przetwarzania Twoich danych osobowych znajdziesz 
Samsung R&D Institute Poland

Senior Data Engineer

Samsung R&D Institute PolandO firmie

  • Warsaw Spire, plac Europejski 1, 00-844 Warszawa, Polska

    Warszawa, mazowieckie
  • Ważna jeszcze miesiąc
    15 Luty 2020
  • Pełny etat
  • Specjalista
top
linia
Senior Data Engineer
Nr ref.: BXS/LR/SDE
linia

About our Team

We are co-creators of Bixby – a next-generation service that changes the way users interact with their devices. Our laboratory is a place where engineers, researchers and expert linguists collaborate on innovative products for the multilingual European market. We are in the process of developing speech recognition and speech synthesising software.

linia

Role and Responsibilities

  • Design and implementation of data workflows for continuous, scalable data ingestion, integration, validation and delivery of data products for ML algorithms development.
  • Design and evaluation of data workflow architectures, mentoring junior staff.
  • Development of automation solutions for “human-in-the-loop” NLP data production and quality assurance e.g. anonymization, annotation, speech transcription, translation etc.
  • Development of ML algorithms for increasing efficiency of NLP data production process.
  • Collaboration with external companies, language experts and other R&D centers.

Technologies in use

  • Python
  • PostgreSQL
  • Jenkins
  • Airflow
  • Bash
  • Perl
  • Neo4j
  • Lucene
 

Skills and Qualifications

  • MSc or PhD in Computer Science, Signal Processing, Electronic Engineering or equivalent.
  • Several years of hands-on experience in software engineering, Python and Linux.
  • Experience in translating the overall business needs to development and execution.
  • Understanding of data modelling, architecture and workflow management solutions.
  • Experience in building data ingestion pipelines from multiple sources.
  • Experience in building data processing pipelines comprising multiple formats of data.
  • Experience with scalable, distributed data infrastructures using SQL and NoSQL databases.
  • Experience in development of solutions relying on Machine Learning algorithms.
  • Experience with continuous delivery tools like Jenkins.
  • Experience with code versioning tools, such as Git.
  • Ability to write test-driven reusable code that is easy to maintain and well documented.
  • Ability to work effectively in a multi-disciplinary and multi-cultural team.
  • Ability to communicate effectively across different teams and delegate tasks to junior staff.
  • Knowledge of English at a level that enables reading and writing technical documentation.

Nice to have

  • Database Reliability Engineering knowledge/experience.
  • Service Reliability Engineering knowledge/experience.
  • Scientific language experience (Scala, R, Python) 
  • Scientific writing experience (publications, blogging)
  • Experience/knowledge of ASR frameworks (Kaldi, wav2letter, etc.)
  • Experience/knowledge of audio signal processing techniques
  • Experience with data workflow management tools e.g. Airflow, Nifi, Luigi, Prefect etc.
  • Experience with Graph database solutions e.g. Neo4J or ArangoDB, etc.
  • Experience with full-text search engines e.g. ElasticSearch, Solr, Lucene etc.
 

We offer

Team

  • Friendly working atmosphere
  • Wide range of trainings and a huge support in developing algorithmic skills
  • Opportunity to work in multiple projects
  • Working with the latest technologies on the market
  • Monthly integration budget
  • Possibility to attend local and foreign conferences
  • Start of work between 7 a.m. and 10 a.m.

Benefits

  • Private medical care (possibility to add family members for free)
  • Multisport card
  • Life insurance
  • Lunch card
  • Variety of discounts (Samsung products, theaters, restaurants)
  • Unlimited free access to Copernicus Science Center for you and your friends
  • Possibility to test new Samsung products


Equipment

  • PC workstation/Laptop + 2 external monitors
  • OS: Linux, Windows

 

Location:

  • Office in Warsaw Spire near metro station
linia

Interested?

Please note that we will contact only chosen candidates.



Administratorem Pana/Pani danych osobowych jest SAMSUNG ELECTRONICS POLSKA Sp. z o.o., z siedzibą w Warszawie, adres: ul. Postępu 14, 02-676 Warszawa.

About our Team

We are co-creators of Bixby – a next-generation service that changes the way users interact with their devices. Our laboratory is a place where engineers, researchers and expert linguists collaborate on innovative products for the multilingual European market. We are in the process of developing speech recognition and speech synthesising software.

Senior Data EngineerNumer ref.: BXS/LR/SDE

Role and Responsibilities

  • Design and implementation of data workflows for continuous, scalable data ingestion, integration, validation and delivery of data products for ML algorithms development.
  • Design and evaluation of data workflow architectures, mentoring junior staff.
  • Development of automation solutions for “human-in-the-loop” NLP data production and quality assurance e.g. anonymization, annotation, speech transcription, translation etc.
  • Development of ML algorithms for increasing efficiency of NLP data production process.
  • Collaboration with external companies, language experts and other R&D centers.

Technologies in use

  • Python
  • PostgreSQL
  • Jenkins
  • Airflow
  • Bash
  • Perl
  • Neo4j
  • Lucene
 

Skills and Qualifications

  • MSc or PhD in Computer Science, Signal Processing, Electronic Engineering or equivalent.
  • Several years of hands-on experience in software engineering, Python and Linux.
  • Experience in translating the overall business needs to development and execution.
  • Understanding of data modelling, architecture and workflow management solutions.
  • Experience in building data ingestion pipelines from multiple sources.
  • Experience in building data processing pipelines comprising multiple formats of data.
  • Experience with scalable, distributed data infrastructures using SQL and NoSQL databases.
  • Experience in development of solutions relying on Machine Learning algorithms.
  • Experience with continuous delivery tools like Jenkins.
  • Experience with code versioning tools, such as Git.
  • Ability to write test-driven reusable code that is easy to maintain and well documented.
  • Ability to work effectively in a multi-disciplinary and multi-cultural team.
  • Ability to communicate effectively across different teams and delegate tasks to junior staff.
  • Knowledge of English at a level that enables reading and writing technical documentation.

Nice to have

  • Database Reliability Engineering knowledge/experience.
  • Service Reliability Engineering knowledge/experience.
  • Scientific language experience (Scala, R, Python) 
  • Scientific writing experience (publications, blogging)
  • Experience/knowledge of ASR frameworks (Kaldi, wav2letter, etc.)
  • Experience/knowledge of audio signal processing techniques
  • Experience with data workflow management tools e.g. Airflow, Nifi, Luigi, Prefect etc.
  • Experience with Graph database solutions e.g. Neo4J or ArangoDB, etc.
  • Experience with full-text search engines e.g. ElasticSearch, Solr, Lucene etc.
 

We offer

Team

  • Friendly working atmosphere
  • Wide range of trainings and a huge support in developing algorithmic skills
  • Opportunity to work in multiple projects
  • Working with the latest technologies on the market
  • Monthly integration budget
  • Possibility to attend local and foreign conferences
  • Start of work between 7 a.m. and 10 a.m.

Benefits

  • Private medical care (possibility to add family members for free)
  • Multisport card
  • Life insurance
  • Lunch card
  • Variety of discounts (Samsung products, theaters, restaurants)
  • Unlimited free access to Copernicus Science Center for you and your friends
  • Possibility to test new Samsung products


Equipment

  • PC workstation/Laptop + 2 external monitors
  • OS: Linux, Windows

 

Location:

  • Office in Warsaw Spire near metro station

Interested?

Please note that we will contact only chosen candidates.



Administratorem Pana/Pani danych osobowych jest SAMSUNG ELECTRONICS POLSKA Sp. z o.o., z siedzibą w Warszawie, adres: ul. Postępu 14, 02-676 Warszawa.

Ogłoszenie archiwalne