Amir Jafari

Amir Jafari, Ph.D.

Data Scientist & AI Researcher

Google Cloud Faculty Expert  ·  IEEE Member  ·  10+ Years in AI Research

Deep Learning NLP & Generative AI Neural Networks Time Series Google Faculty Expert

About Me

Amir Jafari, Ph.D., is an AI Researcher and Google Cloud Faculty Expert. His research focuses on Generative AI, Multi-Agent Systems, Natural Language Processing (NLP), Natural Language Generation (NLG), and Predictive Modeling—areas where he has made sustained contributions over more than a decade.

His work spans a broad spectrum of artificial intelligence methodologies, including deep learning, large language model (LLM) design and orchestration, statistical modeling, signal processing, and optimization of recurrent neural networks.

In recent years, his primary research has centered on Generative AI architectures and autonomous multi-agent systems—developing frameworks in which collaborative agents address complex, real-world problems at scale—alongside continued advances in NLP/NLG and GPU-accelerated training of large-scale models.

Education

Ph.D. in Electrical and Computer Engineering Oklahoma State University  ·  2012 – 2016
M.Sc. in Mechatronics Engineering American University of Sharjah  ·  2009 – 2011

Contact

Research Interests

  • Natural Language Processing & Generation
  • Generative AI & Multi-Agent Systems
  • Large Language Models (LLMs)
  • Deep Learning & Neural Network Design
  • Machine Learning & Predictive Modeling
  • Time Series Analysis & Forecasting
  • Computer Vision & Image Processing
  • Recurrent Neural Networks (RNN / LSTM / GRU)
  • Statistical Modeling & Bayesian Theory
  • GPU-Accelerated Big Data Training

Awards & Recognition

  • Google Cloud Faculty Expert (2020)
  • IEEE Member (2016)
  • Guest Editor, Neural Computing and Applications
  • Peer Reviewer, Neural Computing & Applications
  • Nvidia Computer Vision Certification
  • DataCamp Data Science Certification

Citation Metrics

  • Total Citations: 336+
  • h-index: 8
  • i10-index: 7

Skills & Expertise

Technical competencies spanning AI research, software engineering, and cloud platforms.

AI / ML Domains

Machine Learning Deep Learning Neural Network Design Generative AI Multi-Agent Systems Natural Language Processing Natural Language Generation Large Language Models Computer Vision Predictive Modeling Time Series Analysis Bayesian Theory Stochastic Systems Clustering (SOM, K-Means) Optimization (LM, GD, SCG) System Dynamics Nonlinear Control

Frameworks & Libraries

Python TensorFlow 2.x PyTorch 2.x Keras 3.x LangChain LangGraph LangSmith Streamlit Linux / C / Bash

Cloud Platforms

Google Cloud (GCP) Amazon Web Services (AWS) Microsoft Azure GPU Computing Multi-core Processing

NLP / LLM Models

BERT / XLNET / Electra LLAMA 2 Falcon Dolly QLoRA Adapters Reformer Longformer Whisper Wav2Vec DeepSpeech UniSpeech

Research & Grants

Active research programs in AI, neural networks, and data-driven decision making.

Multi-Agent AI & LLMs

Research on large language model orchestration, multi-agent architectures, and Natural Language Generation (NLG) and Understanding (NLU) for complex real-world tasks.

LLMsMulti-AgentGenerative AI

Neural Network Design & Training

Novel training methods for recurrent neural networks, efficient optimization algorithms, and clustering methods for detecting network extrapolation in adaptive control systems.

RNNOptimizationNARX

Time Series & Forecasting

Deep learning approaches for temporal forecasting using LSTM, GRU, and Transformers, with comparative studies against classical ARMA/ARIMA baselines.

LSTMTransformersARIMA

Medical AI & Computer Vision

Application of CNNs and pre-trained architectures (DenseNet, ResNet) to medical image analysis, surgical outcome prediction, and disease detection.

CNNMedical AIGAN

Funded Projects

2024

TRAILS Seed Project — Trustworthy AI Systems

National Science Foundation (NSF)  ·  Co-Principal Investigator

June 2024 – Present  ·  Advancing research on trustworthiness of AI through novel algorithmic approaches and validation methodologies.

2025

Energy Technology and Decision Making Research Initiative

Alliance for a Sustainable Future  ·  Principal Investigator

May 2025 – Present  ·  Investigating intersection of energy technologies and decision-making frameworks for sustainable resource utilization.

Teaching

Graduate-level data science courses with a focus on theory, implementation, and real-world application.

DATS 6312 — Natural Language Processing

Encoder and decoder language models, transformers, BERT variants, fine-tuning strategies, and NLP pipelines for real-world applications.

NLPTransformersLLMs

DATS 6202 — Machine Learning

Supervised and unsupervised learning, regression, classification, ensemble methods, and model evaluation and selection.

Scikit-learnSupervisedUnsupervised

DATS 6203 — Deep Learning

Neural network architectures — MLP, CNN, RNN, LSTM, GRU, Transformers — with hands-on GPU training using TensorFlow and PyTorch.

PyTorchTensorFlowGPU

DATS 6103 — Data Mining

Knowledge discovery, pattern recognition, clustering, association rules, and data preprocessing at scale.

ClusteringPatternsPython

DATS 6401 — Data Visualization

Visual communication of data insights using Python libraries and interactive dashboards for exploratory analysis.

MatplotlibPlotlyDashboards

DATS 6450 — Cloud Computing

Cloud infrastructure for data science — GCP, AWS, and Azure — including distributed computing and big data processing frameworks.

GCPAWSAzure

DATS 6501 — Data Science Capstone

End-to-end applied data science projects integrating machine learning, data engineering, and communication skills.

Applied DSMentorshipResearch

Publications

Peer-reviewed journal articles, conference papers, and preprints.

Full Publication List on Google Scholar

For a complete, always up-to-date list of publications with citation counts and full metrics, visit my Google Scholar profile.

Open Google Scholar Profile
336+
Total Citations
8
h-index
7
i10-index
26
Publications

Most Cited Works

Loading…

All Publications

Sorted newest first. To add a paper, edit data/publications.json and push to GitHub.

Loading publications…

Selected Projects

Research and applied projects spanning AI, healthcare, finance, and engineering.

Large Language Modeling — Multi-Agent Systems

Ongoing research on Natural Language Generation (NLG) and Natural Language Understanding (NLU) using multi-agent architectures and state-of-the-art LLMs including LLAMA 2, Dolly, and QLoRA fine-tuning methods.

CurrentLLMsMulti-Agent

Automatic Speech Scoring

An automated speech scoring engine that classifies audio files using transcription and language models in series. Transcriber and language models are applied in a pipeline to assess speech quality. Applicable to education and the music industry.

Speech AINLP

Predicting S&P 500 with Recurrent Neural Networks

Developed a NARX (Nonlinear Auto Regressive with Exogenous inputs) model for S&P 500 prediction. Efficient RNN training methods based on novel error surface analysis were used. Results compared against classical ARMA and ARIMA baselines.

Finance AINARX

Intracranial Hemorrhage Detection

Processed over one million CAT scan images from raw DICOM files to detect and classify intracranial hemorrhage and bleeding types. Used pre-trained DenseNet, ResNet, and ImageNet architectures.

Medical AICNN

Take-Home STEM Labs (THL)

Developed a curated repository of take-home laboratory experiences for undergraduate STEM courses, enabling remote access where lab experiences were previously impractical. Materials cost less than a textbook and are available online.

EducationSTEM

Neural Network Observer for Elastic Drive Systems

Developed a Diagonal Recurrent Neural Network (DRNN) observer combined with elastic drive system dynamics to identify nonlinear friction characteristics (Coulomb and viscous friction torques). Significantly outperforms traditional linear observers.

Control SystemsRNN

Autonomous Ground Vehicle — Fuzzy Logic Navigation

Designed and built an autonomous ground vehicle from scratch, programmed with an Arduino embedded system. Used fuzzy logic algorithms for obstacle avoidance and path following in real-time environments.

RoboticsFuzzy Logic

Neural Model Reference Adaptive Control

Novel training methods for RNNs applied to system identification. A novelty sampling method using self-organizing maps collects data wisely; clustering detects extrapolation. Applied to magnetic levitation system control (simulated and experimental).

Adaptive ControlRNN

Open Source Packages

Python packages published on PyPI for the data science and deep learning community.

Time Series Toolbox

A comprehensive Python toolbox for time series analysis and forecasting, covering classical statistical models (ARMA, ARIMA) and modern deep learning architectures (LSTM, Transformers).

PyPITime SeriesForecasting
PyPI Docs

NN Design Demos

Python demonstrations paired with the Neural Network Design and Neural Network Design: Deep Learning textbooks. Interactive visualizations of neural network concepts.

PyPINeural NetworksEducation
PyPI

NNSOM

A novel clustering neural network package implementing Self-Organizing Maps (SOM) and advanced unsupervised learning methods for machine learning and deep learning projects.

PyPIClusteringSOM
PyPI Docs

Book: Neural Network Design — Deep Learning

An open-source textbook covering neural network design principles and deep learning methodologies. Features solved problems and practical exercises. Companion package available on PyPI as nndesigndemos.

GitHub Repository

Time Series Toolbox

timeseries-toolbox  ·  Python Package

A comprehensive Python toolbox for time series analysis and forecasting. Covers classical statistical models (ARMA, ARIMA, Box-Jenkins) and modern deep learning architectures (LSTM, GRU, Transformers, NARX). Designed for researchers and practitioners working on sequential data problems in finance, engineering, and science.

Install pip install timeseries-toolbox

Classical Statistical Models

ARMA, ARX, ARIMA, and Box-Jenkins methods for classical statistical time series analysis and forecasting.

ARIMAARMABox-Jenkins

Deep Learning Models

LSTM, GRU, and Transformer-based architectures for sequence modeling and multi-step ahead forecasting.

LSTMGRUTransformers

Recurrent Neural Networks

NARX, NAR, Focused Time Delay, and Elman network implementations for nonlinear time series identification.

NARXNARElman

NN Design Demos

nndesigndemos  ·  Python Package

A collection of Python demonstrations paired with the Neural Network Design and Neural Network Design: Deep Learning textbooks by Hagan, Demuth, Beale, and Jafari. Provides interactive visualizations and hands-on exercises to reinforce theoretical concepts. All classic MATLAB-based textbook demos have been reimplemented in Python.

Install pip install nndesigndemos

Textbook Companion

Interactive demos paired chapter-by-chapter with the Neural Network Design textbooks for structured, hands-on learning.

TextbookInteractive

Core NN Concepts

Visualizations covering perceptrons, backpropagation, training algorithms, recurrent networks, and deep learning architectures.

MLPBackpropTraining

Python Implementation

All classic MATLAB-based textbook demos reimplemented in Python for global accessibility and open-source contribution.

PythonOpen Source

NNSOM

NNSOM  ·  Python Package

A novel clustering neural network package for machine learning and deep learning projects. Implements Self-Organizing Maps (SOM) and advanced unsupervised learning methods designed to handle high-dimensional data with interpretable cluster representations. Suitable for exploratory data analysis, anomaly detection, and visualization.

Install pip install NNSOM

Self-Organizing Maps

Full SOM implementation with competitive learning, neighborhood functions, and topographic map visualization for high-dimensional data.

SOMUnsupervised

Advanced Clustering

Novel clustering methods with support for sequential and high-dimensional data, going beyond traditional K-Means approaches.

ClusteringK-Means

Visualization Tools

Built-in tools for visualizing cluster maps, U-matrices, distance matrices, and data distributions in trained SOMs.

VisualizationU-Matrix