Offres d'emploi Big Data Administrator

À propos du poste Big Data Administrator

Responsible for the operation, maintenance, configuration, and development of Ligadata systems, VMWare Greenplum data warehouse, and ETL pipelines in Informatica for telecom operators.

Principal Duties And Responsibilities

  • Be responsible for maintaining large-scale data pipelines and systems.
  • Manage and optimize ETL pipelines in Informatica, Apache Nifi ,Kafka ensuring efficient data movement across platforms.
  • Troubleshoot and resolve issues related to data warehouse pipelines , ensuring data integrity and performance.
  • Develop and implement pipelines that extract, transform, and load data into an information product that helps the organization reach its strategic goals
  • Focus on ingesting, storing, processing, and analyzing large datasets
  • Create scalable, high-performance web services for tracking data
  • Translate complex technical and functional requirements into detailed designs
  • Investigate alternatives for data storing and processing to ensure implementation of the most streamlined solutions
  • Serve as a mentor for junior staff members by conducting technical training sessions and reviewing project outputs

Responsibilities

  • Develop and maintain data pipelines using ETL processes
  • Work closely with data science team to implement data analytics pipelines
  • Help define data governance policies and support data-versioning processes
  • Maintain security and data privacy, working closely with data protection officer
  • Analyze vast number of data stores to uncover insights
  • Identify and automate the collection of valuable data sources.
  • Preprocess structured and unstructured data.
  • Analyze large datasets to uncover trends and patterns.
  • Develop predictive models and machine learning algorithms.
  • Implement ensemble modeling techniques.
  • Present data-driven insights using advanced visualization tools.
  • Propose actionable solutions and strategies to address business challenges.
  • Collaborate closely with engineering and product development teams.

Education And Qualification

  • Degree in computer science, mathematics, engineering, data science or equivalent
  • Liga data or Cloudera Administration Experience is a must
  • Awareness of big data components.
  • Expertise in Data Science and Statistics.
  • Skills in Data Analytics and Data Analysis.
  • Proficiency in Data Visualization.
  • Strong problem-solving and critical thinking abilities.
  • Background in machine learning and artificial intelligence.
  • Experience with Python, Spark, and Hive, Scoop, Flume
  • Understanding of data-warehousing and data-modeling techniques
  • Knowledge of industry-wide visualization and analytics tools (ex: Tableau, R)
  • Strong data engineering skills
  • Experience with streaming frameworks such as Kafka,streamset
  • Knowledge of Core Java, Linux, SQL, python and any scripting language

Experience

  • 7+ Years of Relevant Experience

Special Skills

  • Independent problem solver, self-driven and possess great tact
  • Pleasant personality, excellent communications and presentation skills
  • Enjoys new challenges and interest in technologies and services
  • Good command of spoken and written English
  • Ability to work under pressure and flexible, willing extend working hours to exceptionally complete projects
  • Ability to work in a team, professionally and personally with different reporting schemes
  • Effective time management