Adventures in Machine Learning » TensorFlow on Feedspot

A2C Advantage Actor Critic in TensorFlow 2

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In a previous post, I gave an introduction to Policy Gradient reinforcement learning. Policy gradient-based reinforcement learning relies on using neural networks to learn an action policy for the control of agents in an environment. This is opposed to controlling agents based on neural network estimations of a value-based function, such as the Q value in deep Q learning. However, there are problems with straight Monte-Carlo based methods of policy gradient learning as covered in the previously mentioned policy gradient post. In particular, one significant problem is a high variance in the lea ..read more

Visit website

Python TensorFlow Tutorial – Build a Neural Network

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

Updated for TensorFlow 2 Google’s TensorFlow has been a hot topic in deep learning recently. The open source software, designed to allow efficient computation of data flow graphs, is especially suited to deep learning tasks. It is designed to be executed on single or multiple CPUs and GPUs, making it a good option for complex deep learning tasks. In its most recent incarnation – version 1.0 – it can even be run on certain mobile operating systems. This introductory tutorial to TensorFlow will give an overview of some of the basic concepts of TensorFlow in Python.&n ..read more

Visit website

Bayes Theorem, maximum likelihood estimation and TensorFlow Probability

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

A growing trend in deep learning (and machine learning in general) is a probabilistic or Bayesian approach to the problem. Why is this? Simply put – a standard deep learning model produces a prediction, but with no statistically robust understanding of how confident the model is in the prediction. This is important in the understanding of the limitations of model predictions, and also if one wants to do probabilistic modeling of any kind. There are also other applications, such as probabilistic programming and being able to use domain knowledge, but more on that in another post. The ..read more

Visit website

Policy Gradient Reinforcement Learning in TensorFlow 2

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In a series of recent posts, I have been reviewing the various Q based methods of deep reinforcement learning (see here, here, here, here and so on). Deep Q based reinforcement learning operates by training a neural network to learn the Q value for each action a of an agent which resides in a certain state s of the environment. The policy which guides the actions of the agent in this paradigm operates by a random selection of actions at the beginning of training (the epsilon greedy method), but then the agent will select actions based on the highest Q value predicted in eac ..read more

Visit website

Atari Space Invaders and Dueling Q RL in TensorFlow 2

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In previous posts (here and here) I introduced Double Q learning and the Dueling Q architecture. These followed on from posts about deep Q learning, and showed how double Q and dueling Q learning is superior to vanilla deep Q learning. However, these posts only included examples of simplistic environments like the OpenAI Cartpole environment. These types of environments are good to learn on, but more complicated environments are both more interesting and fun. They also demonstrate better the complexities of implementing deep reinforcement learning in realistic cases. In this post, I’ll use sim ..read more

Visit website

Dueling Q networks in TensorFlow 2

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In this post, we’ll be covering Dueling Q networks for reinforcement learning in TensorFlow 2. This reinforcement learning architecture is an improvement on the Double Q architecture, which has been covered here. In this tutorial, I’ll introduce the Dueling Q network architecture, it’s advantages and how to build one in TensorFlow 2. We’ll be running the code on the Open AI gym‘s CartPole environment so that readers can train the network quickly and easily. In future posts, I’ll be showing results on Atari environments which are more complicated. For an introduction to reinforcement learning ..read more

Visit website

Introduction to ResNet in TensorFlow 2

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In previous tutorials, I’ve explained convolutional neural networks (CNN) and shown how to code them. The convolutional layer has proven to be a great success in the area of image recognition and processing in machine learning. However, state of the art techniques don’t involve just a few CNN layers. Rather, they can be very deep, consisting of 10s to >100 numbers of layers. One of the most successful CNN architectures developed has been the ResNet architecture. It was first introduced in 2015 (see this paper) and won the ILSVRC 2015 image classification task. The winning ResNet consisted o ..read more

Visit website

Double Q reinforcement learning in TensorFlow 2

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In previous posts (here and here), deep Q reinforcement learning was introduced. In these posts, examples were presented where neural networks were used to train an agent to act within an environment to maximize rewards. The neural network was trained using something called Q-learning. However, deep Q learning (DQN) has a flaw – it can be unstable due to biased estimates of future rewards, and this slows learning. In this post, I’ll introduce Double Q learning which can solve this bias problem and produce better Q-learning outcomes. We’ll be running a Double Q network on a modified version of ..read more

Visit website

Transfer learning in TensorFlow 2 tutorial

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

In this post, I’m going to cover the very important deep learning concept called transfer learning. Transfer learning is the process whereby one uses neural network models trained in a related domain to accelerate the development of accurate models in your more specific domain of interest. For instance, a deep learning practitioner can use one of the state-of-the-art image classification models, already trained, as a starting point for their own, more specialized, image classification task. In this tutorial, I’ll be showing you how to perform transfer learning using an advanced, pre-train ..read more

Visit website

An introduction to Global Average Pooling in convolutional neural networks

Adventures in Machine Learning » TensorFlow

by andrew.t1981@gmail.com

3y ago

For those familiar with convolutional neural networks (if you’re not, check out this post), you will know that, for many architectures, the final set of layers are often of the fully connected variety. This is like bolting a standard neural network classifier onto the end of an image processor. The convolutional neural network starts with a series of convolutional (and, potentially, pooling) layers which create feature maps which represent different components of the input images. The fully connected layers at the end then “interpret” the output of these features maps and make category predict ..read more

Visit website

Follow Adventures in Machine Learning » TensorFlow on FeedSpot