About this role
Min. Experience Required:  Fresh graduates are encouraged to apply, as we value and support the growth of fresh talent.  Preference will be given to candidates who possess hands-on experience with AI tools and Machine Learning/ Deep Learning techniques.  Good level of understanding and hands-on experience of SQL, Python, Big Data and other databases/data sources. DESCRIPTION OF RESPONSIBILITIES:  Conceive, plan, develop, and deliver data solutions using Python and open-source tools (e.g., Apache Airflow, Apache Kafka, Apache Spark) to create and optimize robust data pipelines for various day-to-day operational tasks.  Develop and maintain data set processes for modeling, mining, and production by leveraging Python libraries such as Pandas, NumPy, and Scikit-learn, as well as AI frameworks like TensorFlow and PyTorch for advanced analytics.  Integrate cutting-edge data management techniques and open-source technologies (e.g., Docker for containerization and Kubernetes for orchestration) into existing infrastructures to enhance scalability and efficiency.  Connect securely to diverse data sources, understand their limitations, and ingest data into various storage systems, data lakes and Processing frameworks (using tools like Apache Hive, Apche Nifi, Hbase, Hadoop, Apche Spark, Apache Pig, Apache Flink), ensuring proper content, format, and integrity throughout the data lifecycle.  Collaborate with cross-functional teams and departments to gather requirements, provide consulting, and deliver workshops, ensuring seamless integration of AI and big data solutions.  Employ a data-driven approach to accelerate performance by utilizing AI tools and libraries such as Keras, TensorFlow, and PyTorch, integrating these into existing workflows.  Forecast and generate predictive reporting using machine learning models and Python-based frameworks to provide real-time insights and drive strategic decision-making.  Optimize and automate day to day processes and operations using AI tools.  Optimize and automate day-to-day processes using AI tools and open-source software, enhancing operational efficiency and reducing manual overhead.  Preference will be given to candidates with NLP and large language model (LLM) experience. Experience with tools and libraries such as Hugging Face Transformers, spaCy, NLTK, and familiarity with LLMs  Analysis and data modeling of raw data related to different stakeholders within organization  Working with cross-functional teams and departments for requirement gathering.  Developing solutions for effective data modeling and data pipelining.  Integrate new data management techniques, technologies, and software engineering tools into existing structures and platforms  Implementing database and web application technologies to design, develop, and evaluate innovative AI based solutions.  Automating periodical reports and statistical analysis  Merge BI platforms with enterprise systems and applications Functional Skills / Knowledge Areas  Excellent Interpersonal skills  Self-Motivated: Proactive, independent, and able to prioritize tasks in a fast-paced environment.  Exceptional communication skills and highly collaborative Team Player with ability to work under intense deadlines Qualification:  BS/BE in Electrical /Computer / Computer Science /Computer Engineering/Software Engineering/IT/Artificial Intelligence/Data Science, recognized by HEC or from PEC in case of engineering.  HEC verified degrees as well as previous experience certificates (in the original) at the time of interview. Location of Job: Islamabad Other Details: Contractual position (renewable).
