
Md Abdul Latif
Development
Dhaka, Bangladesh
Skills
Data Science
About
MD. ABDUL LATIF's skills align with Database Specialists (Information and Communication Technology). MD. also has skills associated with Programmers (Information and Communication Technology). MD. ABDUL LATIF has 30 years of work experience, with 12 years of management experience, including a mid-level position.
View more
Work Experience
Freelancer
May 2018 - Present
- Several big data machine learning initiatives involving the design, development and development of predictive modelling solutions: * Large Scale data analysis on YouTube Dataset using Spark and Hadoop in java and solved the following problems: * Top 20 videos with largest number of views. * Top 20 videos with largest number of likes. * Top 20 videos with largest number of dislikes. * Top 10 viewed categories. * Most watched categories per year * Analysis on all relevant Marketing Customer Data and develop focused customer retention programs Using only Scala map-reduce Spark API * Analysing Consumer Behaviour Using MBA Association Rule Mining. * Movie Recommender System. * H & M Personalized Fashion Recommendations. Development Tools: PySpark, Hadoop, Matplotlib, Pandas, NumPy, Seaborn, etc. * Water Quality Prediction. Classifier: AdaBoostClassifier, BaggingClassifier, GradientBoostingClassifier, DecisionTreeClassifier, KNeighborsClassifier. * Pneumonia Detection Using CNN. * Predictors of mental health illness. Classifier: Logistic Regression, Decision tree, Random forests, Bagging, Boosting, Stacking. Development Tools: Python, Sklearn, TensorFlow, h2o, keras, pandas, NumPy, matplotlib, seaborn, etc. * Spark-Scala application that creates a machine learning model for Predicting the arrival delay of commercial flights. Machin Learning Models are given bellow. * 1. Linear Regression * 2. Random Forest Trees * 3. Gradient-Boosted Trees * An Apache-Spark application with Java that infers qualitative data regarding the car accidents. * Number of lethal accidents per week throughout the entire dataset. * Number of accidents and average number of lethal accidents per week. * Number of accidents and percentage of deaths per contributing factor. * Analysis of Data Using Hadoop, MapReduce, Java, Hive, Pig. * Number of Accidents Per Month * Number of Accidents vs Hour of Day * Number of Accidents Per Day of the Week * Side of the Road Percentage * Number of Accidents Per State * Top 10 Accident Prone States and so on * Health care products Analysed using Big Data Technologies. * Numbers of each product sold: *Hadoop MapReduce, Java. * Top 10 products for each rating: * Hadoop MapReduce, Java. * Customer's products list with count of products purchased: * Hadoop MapReduce, Java. * Total Ratings count in the entire dataset: * Apache Pig. * Verified Products along with their minimum and maximum ratings: * Hive. * Verified/non-verified purchase of the overall products: * MongoDB. * Big Data Hadoop project for analysis of superstore sales data to find insights and interacts with Drop-Down Navigation Menu, developed in Java. Dependencies: Cloudera Hadoop. * Building and Mining Data Warehouse from UK car accidents data in flat files using Microsoft SQL Server and SSIS. * Design and development of a data warehouse of orders, sales, deliveries and payments for a commercial company using SQL Server, MySQL, CSV files, Talend Open Studio and Excel. * Drug Classification with Algorithms-Azure Databricks: Spark, Python, etc. * Prediction of tip or no-tip (1/0) for a taxi trip-Azure Databricks: Spark, Scala, etc. * Designed and developed credit scoring algorithms to predict which customer would possibly default for payment in the future. This project analysed customer's payment patterns and narrow down to cases where they are most likely to default. Decision trees and Random Forests classifier are used. * Designed and developed the solution to predict whether flight departures were delayed or not (binary classification), based on airport (origin and destination) and weather (e.g., windspeed, humidity, temperature) features. Logistic Regression, Gradient Boosting Tree and Random Forest Classifier are used. To improve performance of models' hyper-parameter tuning with cross-validation is used. * Amazon products Review twitter Sentiment Analysis. * Designed and development of models in this project include logistic and linear regression, random forests, and gradient-boosted trees (GBTs) for Prediction of the tip amount for a data set and prediction of tip or no tip for that dataset. To improve performance of models used cross-validation and hyper-parameter sweeping. * Environment: Hadoop Cluster, Spark, Pyspark etc. * Data Visualization: Matplotlib, Pandas etc. * The developed solution is to predict a loan default by users for large dataset provided by Lending Club. The models of this application are Random Forest and GBTs. * Building a Linear Regression Model for predicting house prices. * Predicting heart disease using logistic regression using a heart disease dataset to predict the occurrence of disease based on various attributes. * Environment: Hadoop Cluster, Spark, Java etc. * Data Visualization: JfreeChart. * Developed NLP models for Topic Extraction, Sentiment Analysis * Breast cancer analysis using a logistic regression model using a hospital's breast cancer dataset, where model helps to predict whether a breast lump is benign or malignant. * Topic Modelling using LDA. * Environment: Hadoop Cluster, Spark, Scala etc. * Data Visualization: Vegas, Breeze. * Bank Marketing Project using Scikit-learn: Logistic Regression, KNeighborsClassifier, DecisionTreeClassifier, RandomForestClassifier, GaussianNB, XGBClassifier, GradientBoostingClassifier, SVC. * Twitter Sentiment Analysis using Scikit-learn: Logistic Regression, SVC, MLPClassifier. * Twitter Sentiment Analysis using LSTM and BERT. * Fake News Detection. * A-B-testing. * Hypothesis testing. * Colour Detection. * Handwritten Digit Recognition. * Traffic Signs Recognition. * Detecting Parkinson's Disease with XGBoost. * Chatbot Project. * Python: NumPy, Pandas, Scikit-learn, matplotlib, TensorFlow, Keras, PyTorch, Tensor, OpenCV, etc. * Big Data Analysis, finding the patterns in crimes using Hadoop, Hive, Pig and Excel. * Determination of average and top 20 records pattern of datasets using Hadoop, MapReduce and Java. * Determination of top ten most viewed movies with their movies name using Hadoop, MapReduce and Java. * The project is to know how the genres have ranked by Average Rating, for each profession and age group. * In this project the following results from titanic dataset are determined using Pig MapReduce. * The average age of the people (both male and female) who died in the tragedy and * How many persons survived or died. * Analysis of Datasets for play-by-play baseball statistics using Pig MapReduce. * A Project with a Kafka producer who fetches data from twitter and sends it to a topic by an application in java and stores data into Hadoop (HDFS) using flume and analysed for different results. * Covid19-Tweet-Data-Collection of real time Tweets of current affairs of covid-19 using Kafka high throughput producer by an application in java & stored into Elasticsearch using Logstash, Kibana and analysed for different results.
Manager
MIS & SD
January 2009 - December 2018
- Bangladesh. Responsibilities: Data Analysis, Data Engineering, DBA, Database Application Development, System Analysis design, monitoring etc: Major Works: Some Examples of SSIS, SSAS & SSRS Projects: SSIS: Design and development of CentralDW Datawarehouse: Database: SQL Server 2014 and 2016, SSDT 2017 SSAS: Multidimensional Database: Analytical solutions that enables users need to view aggregations at different levels, i.e., to view business results for item basis and then drill down to see details for item types, then item categories, then item Sections, then Item subsections, then Item Heads and individual items, analyse in excel, MDX to query the cube and retrieve data. The cubes for profit and gross profit margin, KPIs, etc. Database: SQL Server 2016 and SSDT 2017 Tabular Data Model projects that can summarize data at multiple levels. Specifically, dates must be organized in temporal hierarchies, such as years, semesters, quarters, and months. Use of DAX, Cost, Profit, Margin, KPIs etc. SSRS: SQL Server Reporting Services is used for creating, publishing, and managing reports. Various kinds of reports are designed and developed for the organizational requirements, some of them enhanced to group and aggregate data, support interactive drill-down, in some cases charts and graphs are used for data visualizations. Asset Tracking System. (Web based) Module Description: Basic Layout, Asset Details, Operation Subsidiary, Statement Layouts, Documentation Layout, Servicing Layout, etc. Software and Platform: Visual Studio 2015, SQL Server 2014. Other Skill sets used: C#, ASP. Net, MVC, JavaScript, jQuery, Crystal Report, etc. Knitting Management System. (Web based) Module Description: Program start-up & Details, NPT Status, Fabric Inspection, Roll Wise Production Up gradation, Yarn Demand Notification, Shift Handover, Fabric Rejection, etc. Software and Platform: Visual Studio 2015, SQL Server 2014. Other Skill sets used: C#, ASP. Net, Crystal Report. Visual SourceSafe. Supply Chain Management System. (Web based) Module Description: Basic Arrangement, Item Basic, Item Subsidiary, Operation Basic, Operation Core, Enterprise Operation, Requisition operation, Material General Operation, Borrowing Operation, Transfer Operation, Stock Balance, Supplier Item Operation, Wastage Management etc. Software and Platform: Visual Studio 2010, SQL Server 2014. Other Skill sets used: C#, ASP. Net, JavaScript, jQuery, Ajax, CSS, Crystal Report, etc. Human Resources Management System. (Web based) Module Description: Attendance, Leave Management, Payroll, Training Management, Support and Logistic, Compliance Management, Recruitment etc. Software and Platform: Visual Studio 2010, SQL Server 2014. Other Skill sets used: C#, ASP. Net, Crystal Report. Visual SourceSafe. Integrated Production Management System. (Desktop Based) Module Description: Production, Inventory, Purchase, Stock Management etc. Software and Platform: Visual Studio 2008, SQL Server 2005. Other Skill sets used: C#, Crystal Report. Readymade Garments Store Management. (Desktop Based) Software and Platform: Visual Studio 2008, SQL Server 2005. Other Skill sets used: C#, Crystal Report. Fabric Store Management System. (Desktop Based) Software and Platform: Visual Basic 6.0, SQL Server 2000, Crystal Report. Integrated Inventory Management System. (Desktop Based) Module Description: Dyes & Chemical, Yarns and Grey Fabric, Finished Fabrics, Mechanical, Electrical, Stationary, Hardware, Medicine, Sewing and Printing martials, Construction, Furniture, Lab Item etc. Software and Platform: Visual Basic 6.0, SQL Server 2000, Crystal Report. And Other Desktop based Software's such as Cutting Status Management System, Dyeing Recipe Operation System, Accessories Store Management System, Swatch Management System, Order Sheet Processing System and so on. Software and Platform: Visual Basic 6.0, SQL Server 2000, Crystal Report.
Consultant
Store Inventory System
January 2007 - December 2008
- Accounting System. Module: Trial Balance, Ledger Maintain, Balance Sheet, Profit and Loss statements etc. Store Inventory System. Module Description: Inventory, Purchase, Stock Management. Software and Platform: Visual Basic 6.0, SQL Server 7.0, Crystal Report.
System Manager
Meghna Knit Composite Ltd
January 2006 - December 2007
- Noman Group - Dhaka, Bangladesh. Responsibilities: Project Planning, System Analysis, Design, Development, Testing, Implementation etc. Designing and Implementing Databases, Backup, Maintenance of Database etc. Major Work: Store Inventory Management System. Module Description: Inventory, Purchase, Stock Management. Payroll Software. Software and Platform: Visual Basic 6.0, SQL Server 7.0, Crystal Report.
Senior Programmer
Systech Computers Ltd
January 2004 - December 2005
- Major Work: Enterprise resource planning (ERP)
Freelancer
Nanakhi Casting Ind. Ltd
January 2003 - December 2004
- Major Work: Accounting System, Pay Roll System, Store Inventory.
Programmer
Sinha Rotor Spinning Ltd, Shaurav Engineering, Long Way Corporation
January 1998 - December 2003
- Bangladesh. Major Work:
Freelancer
HISHAB (Accounting Software)
January 1994 - December 1998
- Major Work: Store Inventory, World Wide Sales Analysis System, Student Bill Processing System, Financial Deposit Scheme Management System, etc.