See all the jobs at Indix here:
Software Engineer - Data Science
, ,
Pre-requisites:
- Bachelor's degree (BS or equivalent) in a quantitative field (e.g. Mathematics, Statistics, Engineering, Physics, Econometrics, Computer Science)
- 3+ years of hands on experience in writing high performing code with exposure to multi-threading, distributed computing, scalability, redundancy, and code optimization.
- Hands on experience in machine learning, developing algorithms, data modeling, investigating patterns of data and data visualization
- Having handled greater than 1 TB of structured, unstructured or semi-structured data, is a plus.
- Strong academic and technical background in a quantitative discipline and a good track record of applying advanced classes of models to difficult problems
- Strong written and verbal English communication skills
- Self-motivated, persistent and “Never Give Up” attitude
- Passion for innovation and adaptability to a lean startup culture
- Comfortable with ambiguity and enjoys the challenge of creating new solutions for difficult big data problems and able to deal with very large and incomplete data sets
- Good appreciation for design thinking and strong data visualization skills
- Able to work with minimal supervision, independently and / or as a member of a team
Job Description
- Information retrieval or acquisition of new data sets, parsing the data sets, filtering, curating and organizing large volumes of data through use of tools and other automated means
- Mining for data patterns in very large data sets and representing data visually
- Interacting with data dynamically and analyzing data using variety of tools or by building custom tools
- Building and deploying state-of-the-art, data-driven predictive models and algorithms to solve analytical problems
- Building search indexes and relevance tuning algorithms
- Internet Search
- Data mining for patterns or statistics
- Insights will guide product direction
Technical skill set
- Strong programming background in either C++/Java programming languages with work experience.
- Programming experience in relational platforms like SQL and non-relational technologies like Hadoop, Map Reduce, Hive, Mahout and Pig
- Experience in information retrieval libraries like Lucene/Solr