Data Engineer

Predictice is hiring!

About

Predictice, la première solution de justice prédictive 🚀

Predictice, startup fondée en janvier 2016, simplifie la recherche et l'analyse de l'information juridique grâce à l'intelligence artificielle. Notre moteur permet à des milliers de professionnels du droit de gagner en performance, de fidéliser leur clientèle et d'augmenter leurs revenus.

Des centaines de cabinets d’avocats et entreprises sont aujourd’hui clients, dont plusieurs structures du CAC 40. La clientèle est répartie un peu partout en France et nous travaillons en étroite collaboration avec les barreaux de plusieurs grandes villes.

Le projet est fortement soutenu par les pouvoirs publics (Ministère de la Justice, BPI France) ainsi que par l’écosystème start-up : Predictice est alumni de Télécom ParisTech, NUMA (saison 11) et Village by CA.

Nous sommes une équipe déterminée qui souhaite mettre son talent au service d’une aventure humaine intense : réunir le monde de la Data et celui du Droit.

Voici nos valeurs, si tu t'y reconnais lance-toi :

  • Bienveillance (soit humble, le reste viendra)
  • Dévouement (le travail c'est pas toute la vie mais quand même)
  • Anticipation (avoir un coup d'avance c'est le minimum quand tu construis la justice de demain)
  • Transmission (les ingénieurs et les juristes ont pleins de choses à se raconter tu verras)

Job Description

What’s the job ?

Predictice is looking for a full-time Data Engineer to join the team in Paris. You will work on a team creating and improving services and tools that process legal documents in order to automatically extract and structure information. Working on an agile team means taking on all kinds of new challenges and keeping updated on cutting edge frameworks and architectures. You will have about 2+ years of Data Engineering experience, but primarily you are motivated, eager to learn, and proficient at overcoming technical difficulties. A plethora of tools and framework we are using can be found here.

What you will do

  • Design, prototype, implement, test and debug software, primarily in Scala and Python
  • Build distributed pipelines to collect, clean and move data from and to Data Lakes
  • Spot, evaluate and integrate new data sources for internal or product services
  • Build and maintain ELT scheduled pipelines
  • Ensure the data is ready for use
  • Build and fine tune data pipeline architectures for maximal scalability
  • Work with your team, product, marketing, and other engineers to move features through the development life-cycle

Preferred Experience

What we are looking for (musts)

  • Strong computer science fundamentals such as algorithms and data structures
  • Experience with the development of data pipeline infrastructures (Cron, Airflow, Luigi, Kedro. etc)
  • Experience with Apache Spark and Hadoop ecosystem
  • Knowledge of Scala/Python, or at least fluent with one of the two languages with a desire and ability to learn a new one
  • Excellent problem solving ability and initiative
  • Careful attention to detail in writing, reviewing and testing code

Nice to have

  • Experience with ELK (Elasticsearch & friends)
  • Experience with containers and orchestration tools
  • Experience with ETL AWS services or other IaaS
  • Comfortable in English (for reading documentation and watching presentations)

Your advantages:

  • Corporate culture & strong human values
  • Great atmosphere & ideal working environment @Wework
  • Opportunity to become a key element of a startup in full acceleration
  • Super good health insurance (Alan.eu)
  • Stock options (BSPCE) plan

Recruitment Process

  • General interviews (CEO, CTO and/or team member)
  • Problem resolution (home)
  • Technical interview

Additional Information

  • Contract Type: Full-Time
  • Start Date: 08 March 2018
  • Location: Paris, France (75009)
  • Education Level: Master's Degree
  • Experience: > 2 years