Documentation
< All Topics
Print

3.3 Model Training

Traditionally, we split data into training and validation sets in 80:20 ratio. Before we start the model training, make sure you have ingested data into Momentum and created training and validation sets. If you have not done so, here are links that will helps you to prepare the data. 

How to ingest data Momentum 

How to create training and test sets using Transformer 

How to create a data processing pipeline 

To explain the process of training a model, we will use deep learning or artificial neural network based multi-layer perceptron classifier that predicts if a machine will fail given its operating condition. We have ingested the AI4I 2020 Predictive Maintenance Dataset (https://archive.ics.uci.edu/ml/datasets/AI4I+2020+Predictive+Maintenance+Dataset), and created 80% training and 20% test sets.  

To train a model, e.g., multi-layer perceptron classifier, here are the steps: 

  1. Expand “ML Model” from under the “Machine Learning” section of the left side menu panel, and click “ML Home” to launch the ML home page. 
  1. Click “Create New Model”.  
  1. From the “Supervised Learning: Classification” drop down, select “Deep Learning/ Artificial Neural Network/Multilayer Perceptron Classifier.  
  1. Fill out the form as described below, and shown in Figure 3.1a and 3.1b for example: 
  1. Model Name: give a meaningful name to identify this model, for example Machine_Failure_Prediction_Model 
  1. Give a version number, just in case you need to build multiple versions of this model. 
  1. In the Feature Field text area, supply a comma separated list of all features you want the model to learn from. In this example, we are using the following features: 

Air_temperature,Process_temperature,Rotational_speed,Torque,Tool_wear_in_min 

Listing 3.1: Feature list 

  1. Categorical/Non-numerical fields: comma separated list of all features that are not numeric. We will not have such field, so we will leave it empty. 
  1. OneHot Encodable Field: Categorical and non-numeric fields should be encoded. Leave it empty in this case. 
  1. Target Field that needs to be predicted. Machine_failure is our target field. 
  1. Number of Classes: The machine failure data has only two classes – 0 means no failure and 1 means failure. We will fill 2 in this field. 
  1. Scale Features: It is generally a good idea to scale the features, we will select “yes”. 
  1. Feature excluded from scaling: We will leave this field empty. 
  1. Number of hidden layers: This is to configure the neural network. We will start with 3 hidden layers. 
  1. Number of nodes in each hidden layer: We will use varying number of nodes in each layer. For example, 19,11,9 to indicate we want to use 19 nodes in the first hidden layer, 11 in the second and 9 in the last hidden layer. If you want to use the same number of neurons in each hidden layer, use a single number. For example, if we enter 16 in this field, all hidden layers will have 16 neurons. 
  1. Max Number of Iteration: We are starting with 1000. If the algorithm converges before it reaches the max 1000 iterations, the training will automatically stop to avoid unnecessary computation and time.  
  1. Training: Test ratio to split the training data internally into this ratio. We are using 0.8:0.2 for 80% and 20% split. 
  1. Max Core: This is to parallelize the training by using multiple CPU cores of the cluster. We are going to use 16 cores as our dataset is not large. For large dataset, using more or all available CPU cores of the cluster will speed up the training process. 
  1. Memory per Core: For most training 4GB per core should be sufficient but may be increased for a large and complex model. 
  1. The number of partitions is not used at this time and is reserved for the future. 

Figure 3.1a: Showing a part of the neural network configuration form. 

Figure 3.1b: Showing the remaining part of the neural networks form 

  1. After submitting the form, a rectangular config widget is created on the main body of the page. 
  1. Expand Transformer from the left menu panel and click on the training set transformer to bring it to the main page. 
  1. Click on the “Out” on the training set rectangle and click on the “In” of the model config rectangle. This will join the training dataset to the model config (shown in Figure 12 below) 
  1. Saving the configuration will take the screen to the ML home page. 
  1. Click “Run” located at the top menu to start the model training. 

Figure 3.2: Screen showing model configuration 

Table of Contents

Lester Firstenberger

Lester is recognized nationally as a regulatory attorney and expert in consumer finance, securitization, mortgage, and banking law.

Lester is recognized nationally as a regulatory attorney and expert in consumer finance, securitization, mortgage, and banking law. In a variety of capacities, over the past 30 years as an attorney, Mr. Firstenberger has represented the interests of numerous financial institutions in transactions valued in excess of one trillion dollars. He was appointed to and served a three-year term as a member of the Consumer Advisory Council of the Board of Governors of the Federal Reserve System. He has extensive governmental relations experience in the US and Canada at both the federal and state and provincial levels.

Shamshad (Sam) Ansari is an author, inventor, and thought leader in the fields of computer vision, machine learning, artificial intelligence, and cognitive science. He has extensive experience in high scale, distributed, and parallel computing. Sam currently serves as an Adjunct Professor at George Mason University, teaching graduate- level programs within the Data Analytics Engineering department of the Volgenau School of Engineering. His areas of instruction encompass machine learning, natural language processing, and computer vision, where he imparts his knowledge and expertise to aspiring professionals.

Having authored multiple publications on topics such as machine learning, RFID, and high-scale enterprise computing, Sam’s contributions extend beyond academia. Sam’s book, titled “Building Computer Vision Applications Using Artificial Neural Networks,” has garnered acclaim with two published editions. It received recognition as one of the top 10 books ever written on this subject by bookauthority.org, highlighting the significant impact and quality of Sam’s contributions to the field. He holds four US patents related to healthcare AI, showcasing his innovative mindset and practical application of technology.

Throughout his extensive 20+ years of experience in enterprise software development, Sam has been involved with several tech startups and early-stage companies. He has played pivotal roles in building and expanding tech teams from the ground up, contributing to their eventual acquisition by larger organizations. At the beginning of his career, he worked with esteemed institutions such as the US Department of Defense (DOD) and IBM, honing his skills and knowledge in the industry.

Currently, Sam serves as the President and CEO of Accure, Inc., an AI company that he founded. He is the creator, architect, and a significant contributor to Momentum AI, a no-code platform that encompasses data engineering, machine learning, AI, MLOps, data warehousing, and business intelligence. Throughout his career, Sam has made notable contributions in various domains including healthcare, retail, supply chain, banking and finance, and manufacturing. Demonstrating his leadership skills, he has successfully managed teams of software engineers, data scientists, and DevSecOps professionals, leading them to deliver exceptional results. Sam earned his bachelor’s degree in engineering from Birsa Institute of Technology (BIT) Sindri and subsequently a Master’s degree from the prestigious Indian Institute of Information Technology and Management Kerala (IIITM-K).

Moghisuddin Raza

Mogishuddin Raza is a technology leader. As the COO of Accure he is having global product delivery responsibility along with overall strategic and operational responsibility.

Mogishuddin Raza is a technology leader. As the COO of Accure he is having global product delivery responsibility along with overall strategic and operational responsibility.

Having extensive background in technology product development and integration, in particular to Enterprise storage, virtualization, cloud computing, high availability & business continuity technology/solutions, and Big Data & related technologies. Has been passionate and evangelizing the usage of Big data technologies using Momentum to implement advanced analytics (descriptive and predictive) to directly impact the business via an intuitive set of use cases.

Having approximately two decades of experience in high-tech industries which includes big MNCs corporate like EMC Corp and Hewlett-Packard to mid-size organization such as Netkraft, Trados Inc driving transformation in strategizing, planning and architecting product engineering, execution and delivery of high quality products releases within budget & time.

Skilled in all aspects of big MNCs as well as company startups and growth including: strategizing, business planning, market research, finance, product development and profit margins & revenue management. Excellent leadership and people motivation skills. Expert in managing cross-functional, cross cultural global team and building strategic partnership in the global virtual matrix team environment.

Overall, a senior software business professional, skilled in the management of people, resources and partnerships which enables building an eco system for a winning organization.



Rajesh Kumar Nedungadi

Scion of A Former Royal House of Kerala, India President Garuttman Group, USA. Rajesh is an entrepreneur & visionary specializing in International Business Strategy and Market Development with focus on Middle East & North America. With over 20 years’ experience in international trade, Rajesh is an expert on Business Strategy Development, Market Opportunity Development and International Sales & Marketing of Products and Services including the IT Industry. Rajesh is working as Managing Partner / Board Member of many companies including, Globistic Company USA, Castlewick Companies, USA.

Former Gartner Analyst having authored or co-authored over 280 research notes, on emerging technologies like AI, SD-WAN, 5G, mobile video, cloud CDN, IoT, SASE in Cybersecurity, 6G, etc. in the past decade. A frequent speaker at tech events, he is often quoted in leading institutions like CNN, Wall St. Journal, etc. He is a former CTO of one of the first video/WiFi smartphone firms. Currently also a Cybersecurity Advisor at Lionfish Tech Advisors, and on the advisory board for a few startups.

 Sharma helped contribute to fly-by-wire standards used in avionics and holds an Engineering degree in Computer Systems Engineering from Carleton University in Canada and completed graduate coursework in AI/ML from there.

His Engineering Thesis was on the application of Hopfield’s Neural Networks applied to Combinatorial Optimization.

Mike joined Accure as the Senior Vice President of Sales after leaving MarkLogic where he worked for 17 years and held several executive sales positions including his last role leading the global sales strategy. Prior to MarkLogic, Mike spent 10 years at Autonomy as VP of sales for their Business Process Management practice. He began his career in database technology with Informix in the Chicago area where he led Channel Sales for Central US.

When not driving Accure’s sales, Mike spends time with his lovely wife, Meg, two daughters, Leigh Rose and Georgia, and his puppy Kalua (“Lulu”).