Job Description: The Data Engineering team is one of the core technology teams of Lumiq.ai and is responsible for creating all the Data related products and platforms which scale for any amount of data, users, and processing. The team also interacts with our customers to work out solutions, create technical architectures and deliver the products and solutions. If you are someone who is always pondering how to make things better, how technologies can interact, how various tools, technologies, and concepts can help a customer or how a customer can use our products, then Lumiq is the place of opportunities. Who are you? Enthusiast is your middle name. You know what’s new in Big Data technologies and how things are moving Apache is your toolbox and you have been a contributor to open source projects or have discussed the problems with the community on several occasions You use cloud for more than just provisioning a Virtual Machine Vim is friendly to you and you know how to exit Nano You check logs before screaming about an error You are a solid engineer who writes modular code and commits in GIT You are a doer who doesn’t say “no” without first understanding You understand the value of documentation of your work You are familiar with Machine Learning Ecosystem and how you can help your fellow Data Scientists to explore data and create production-ready ML pipelines Eligibility At least 2 years of Data Engineering Experience Have interacted with Customers Must Have Skills: Amazon Web Services (AWS) - EMR, Glue, S3, RDS, EC2, Lambda, SQS, SES Apache Spark Python Scala PostgreSQL Git Linux Good to have Skills: Apache NiFi Apache Kafka Apache Hive Docker Amazon Certification
Description Deep experience and understanding of Apache Hadoop and surrounding technologies required; Experience with Spark, Impala, Hive, Flume, Parquet and MapReduce. Strong understanding of development languages to include: Java, Python, Scala, Shell Scripting Expertise in Apache Spark 2. x framework principals and usages. Should be proficient in developing Spark Batch and Streaming job in Python, Scala or Java. Should have proven experience in performance tuning of Spark applications both from application code and configuration perspective. Should be proficient in Kafka and integration with Spark. Should be proficient in Spark SQL and data warehousing techniques using Hive. Should be very proficient in Unix shell scripting and in operating on Linux. Should have knowledge about any cloud based infrastructure. Good experience in tuning Spark applications and performance improvements. Strong understanding of data profiling concepts and ability to operationalize analyses into design and development activities Experience with best practices of software development; Version control systems, automated builds, etc. Experienced in and able to lead the following phases of the Software Development Life Cycle on any project (feasibility planning, analysis, development, integration, test and implementation) Capable of working within the team or as an individual Experience to create technical documentation
The hunt is for a strong Java Resources and team players with the ability to manage effective relationships with a wide range of stakeholders (customers & team members alike). Incumbent will demonstrate personal commitment and accountability to ensure standards are continuously sustained and improved both within the internal teams, and with partner organizations and suppliers. Skills : Java Experience: 7 to 9 Years Designation : Lead Engineer Location: Pune (Hinjewadi Phase -2) Position: Permanent Notice Period: 2 Months 1. Role Description: Extensive Java(1.8+) and J2EE development experience. Good knowledge of Design patterns(Creational/behavioural and architectural). In depth knowledge and experience of working with Spring(Boot, Core, MVC, Security, Batch, Cloud), Hibernate, Maven, Gradle etc. Proficient in Databases like Mysql, Oracle, Postgres, MongoDB Experience of JMS queues(ActiveMQ/RabbitMQ/Kafka) Proficient in writing unit and integration test cases. Should have working knowledge of linux in order to be able to deploy monitor and maintain a application. Should have knowledge about source control and deployment tools like GIT, Jenkins, bitbucket etc. Knowledge on micro services and full stack architectures is an additional plus. Should have knowledge in performance engineering and be able to do required optimizations. Ability to perform code reviews/ensure best practices. Alongside the candidate should posses excellent communication skills, and should be able to mentor a team when required. Those who are Interested can share their resume on firstname.lastname@example.org
We are young and passionate team building our own product. You will enjoy working with us.