Data Engineer
Data Engineer
Come work on fantastically high-scale systems with us! Blis is an award-winning, global leader and technology innovator in big data analytics and advertising. We help brands such as McDonald's, Samsung, and Mercedes Benz to understand and effectively reach their best audiences.
In doing this, we are industry champions in our commitment to the ethical use of data and believe people should have control over their data and privacy. With offices across four continents, we are a truly international company with a global culture and scale. We’re headquartered in the UK, financially successful and stable, and looking forward to continued growth. We’d love it if you could join us on the journey!
We are looking for solid and experienced Data Engineers to work on building out secure, automated, scalable pipelines on GCP. We receive over 350gb of data an hour and respond to 400,000 decision requests each second, with petabytes of analytical data to work with.
We tackle challenges across almost every major discipline of data science, including classification, clustering, optimisation, and data mining. You will be responsible for building stable production level pipelines maximising the efficiency of cloud compute to ensure that data is properly enabled for operational and scientific cause.
This is a growing team with big responsibilities and exciting challenges ahead of it, as we look to reach the next 10x level of scale and intelligence. Our employees are passionate about teamwork and technology and we are looking for someone who wants to make a difference within a growing, successful company.
At Blis, Data Engineers are a combination of software engineers, cloud engineers, and data processing engineers. They actively design and build production pipeline code, typically in Python, whilst having practical experience in ensuring, policing, and measuring for good data governance, quality, and efficient consumption. To run an efficient landscape we are ideally looking for candidates that are comfortable with event- driven automation across also aspects of our operational pipelines.
As a Blis data engineer, we seek to understand the data and problem definition and find efficient solutions, so critical thinking is a key component to efficient pipelines and effective reuse, this must include defining the pipelines for the correct controls and recovery points not only function and scale. Across the team, everyone supports each other through mentoring, brainstorming, and pairing up. They have a passion for delivering products that delight and astound our customers and that have a long-lasting impact on the business. They do this while also optimising themselves and the team for long-lasting agility, which is often synonymous with practicing Good Engineering. They are almost always adherents of Lean Development and work well in environments with significant amounts of freedom and ambitious goals.
Shift : 12 pm - 8 pm
Key responsibilities
- Design, build, monitor, and support large scale data processing pipelines.
- Support, mentor, and pair with other members of the team to advance our team’s capabilities and capacity.
- Help Blis explore and exploit new data streams to innovative and support commercial and technical growth
- Work closely with Product and be comfortable with taking, making and delivering against fast paced decisions to delight our customers. This ideal candidate will be comfortable with fast feature delivery with a robust engineered follow up.
Skills and experience
- 5+ years direct experience delivering robust performant data pipelines within the constraints of direct SLA’s and commercial financial footprints.
- Proven experience in architecting, developing, and maintaining Apache Druid and Imply platforms, with a focus on DevOps practices and large-scale system re-architecture
- Mastery of building Pipelines in GCP maximising the use of native and native supporting technologies e.g. Apache Airflow
- Mastery of Python for data and computational tasks with fluency in data cleansing, validation and composition techniques.
- Hands-on implementation and architectural familiarity with all forms of data sourcing i.e streaming data, relational and non-relational databases, and distributed processing technologies (e.g. Spark)
- Fluency with all appropriate python libraries typical of data science e.g. pandas, scikit-learn, scipy, numpy, MLlib and/or other machine learning and statistical libraries
- Advanced knowledge of cloud based services specifically GCP
- Excellent working understanding of server-side Linux
- Professional in managing and updating on tasks ensuring appropriate levels of documentation, testing and assurance around their solutions.
Desired
- Experience optimizing both code and config in Spark, Hive, or similar tools
- Practical experience working with relational databases, including advanced operations such as partitioning and indexing
- Knowledge and experience with tools like AWS Athena or Google BigQuery to solve data-centric problems
- Understanding and ability to innovate, apply, and optimize complex algorithms and statistical techniques to large data structures
- Experience with Python Notebooks, such as Jupyter, Zeppelin, or Google Datalab to analyze, prototype, and visualize data and algorithmic output
About us
Blis is the geo-powered advertising tech stack. We’ve built a radically different omnichannel advertising solution structured on geography, not identity. Audience Explorer is our powerful audience planning platform delivering actionable intelligence & insight to advertisers.
With Blis, advertisers can plan unified audiences with data from premium partners, connected by geo. Buy audiences using smart cookie less technology that can double performance and halve costs. Measure the audience, not just the channel, with patent-pending omnichannel measurement technology.
Established in the UK in 2004, Blis now operates in more than 40 offices across five continents. Working with the world’s largest and most successful companies, as well as every major media agency.
As an equal opportunity employer, we treat all our employees and job applicants fairly and equally. We oppose all forms of unlawful and unfair discrimination and take all reasonable steps to create a work environment in which all employees are treated with respect and dignity. We don't condone or tolerate any form of harassment, by employees or by others who do business with us.
Our values
Brave
We're leaders not followers
An innovation and growth mindset helps us solve everyday challenges and achieve breakthroughs. Our passion drives
us to innovate. We don’t see barriers, just possibilities.
We take ownership and hold ourselves accountable for outcomes, good and bad – and we don’t pass the buck.
Love our clients
We're client obsessed
We do what we say and build trusted relationships with our partners for the long term. We act with integrity. We put our clients at the centre of our business. We obsess over the best insights, ideas and solutions to deliver WOW and work with honesty and accountability to get it done.
Inclusive
We're one team
We are empathetic and embrace diversity. Everyone has
a voice and can bring their authentic self to work. We care
about and support each other – with humility and good
humour. Mutual respect and wellbeing are key. We strive
to eliminate bias and be open and transparent.
Solutions driven
We're action oriented
Speed matters in business, so we're solution-driven and
action-oriented. We value simplification and calculated risk taking. We are lean, agile and resourceful self-starters.
We collaborate and break silos, working thoughtfully and
with urgency to solve problems, while learning from mistakes
and celebrating wins.
- Department
- Engineering
- Locations
- Mumbai
- Remote status
- Hybrid Remote
Data Engineer
Loading application form