Posit AI Blog

Posit AI Blog https://blogs.rstudio.com/tensorflow/ News, concepts, and applications as regards deep learning, probabilistic computation, distributed computing and machine learning automation from R. Posit AI Blog https://blogs.rstudio.com/tensorflow/images/favicon.png https://blogs.rstudio.com/tensorflow/ Distill Wed, 30 Oct 2024 00:00:00 +0000 Introducing mall for R...and Python Edgar Ruiz https://blogs.rstudio.com/tensorflow/posts/2024-10-30-mall We are proud to introduce the {mall} package. With {mall}, you can use a local LLM to run NLP operations across a data frame. (sentiment, summarization, translation, etc). {mall} has been simultaneously released to CRAN and PyPi (as an extension to Polars). Python R LLM Polars Natural Language Processing Tabular Data https://blogs.rstudio.com/tensorflow/posts/2024-10-30-mall Wed, 30 Oct 2024 00:00:00 +0000 Introducing Keras 3 for R Tomasz Kalinowski https://blogs.rstudio.com/tensorflow/posts/2024-05-21-keras3 We are thrilled to introduce {keras3}, the next version of the Keras R package. {keras3} is a ground-up rebuild of {keras}, maintaining the beloved features of the original while refining and simplifying the API based on valuable insights gathered over the past few years. TensorFlow/Keras R https://blogs.rstudio.com/tensorflow/posts/2024-05-21-keras3 Tue, 21 May 2024 00:00:00 +0000 News from the sparkly-verse Edgar Ruiz https://blogs.rstudio.com/tensorflow/posts/2024-04-22-sparklyr-updates Highlights to the most recent updates to `sparklyr` and friends Packages/Releases Spark R https://blogs.rstudio.com/tensorflow/posts/2024-04-22-sparklyr-updates Mon, 22 Apr 2024 00:00:00 +0000 Chat with AI in RStudio Edgar Ruiz https://blogs.rstudio.com/tensorflow/posts/2024-04-04-chat-with-llms-using-chattr Interact with Github Copilot and OpenAI's GPT (ChatGPT) models directly in RStudio. The `chattr` Shiny add-in makes it easy for you to interact with these and other Large Language Models (LLMs). Generenative Models Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2024-04-04-chat-with-llms-using-chattr Thu, 04 Apr 2024 00:00:00 +0000 Hugging Face Integrations Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2023-07-12-hugging-face-integrations Hugging Face rapidly became a very popular platform to build, share and collaborate on deep learning applications. We have worked on integrating the torch for R ecosystem with Hugging Face tools, allowing users to load and execute language models from their platform. Torch Releases R https://blogs.rstudio.com/tensorflow/posts/2023-07-12-hugging-face-integrations Wed, 12 Jul 2023 00:00:00 +0000 Understanding LoRA with a minimal example Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2023-06-22-understanding-lora LoRA (Low Rank Adaptation) is a new technique for fine-tuning deep learning models that works by reducing the number of trainable parameters and enables efficient task switching. In this blog post we will talk about the key ideas behind LoRA in a very minimal torch example. Torch Concepts R https://blogs.rstudio.com/tensorflow/posts/2023-06-22-understanding-lora Thu, 22 Jun 2023 00:00:00 +0000 GPT-2 from scratch with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-06-20-gpt2-torch Implementing a language model from scratch is, arguably, the best way to develop an accurate idea of how its engine works. Here, we use torch to code GPT-2, the immediate successor to the original GPT. In the end, you'll dispose of an R-native model that can make direct use of Hugging Face's pre-trained GPT-2 model weights. Torch R Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2023-06-20-gpt2-torch Tue, 20 Jun 2023 00:00:00 +0000 What are Large Language Models? What are they not? Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-06-20-llm-intro This is a high-level, introductory article about Large Language Models (LLMs), the core technology that enables the much-en-vogue chatbots as well as other Natural Language Processing (NLP) applications. It is directed at a general audience, possibly with some technical and/or scientific background, but no knowledge is assumed of either deep learning or NLP. Having looked at major model ingredients, training workflow, and mechanics of output generation, we also talk about what these models are not. Meta Concepts Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2023-06-20-llm-intro Tue, 20 Jun 2023 00:00:00 +0000 safetensors 0.1.0 Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2023-06-15-safetensors Announcing safetensors, a new R package allowing for reading and writing files in the safetensors format. Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2023-06-15-safetensors Thu, 15 Jun 2023 00:00:00 +0000 torch 0.11.0 Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2023-06-07-torch-0-11 torch v0.11.0 is now on CRAN. This release features much-enhanced support for executing JIT operations. We also amended loading of model parameters, and added a few quality-of-life improvements, like support for temporarily modifying the default torch device, support for specifying data types as strings, and many more. Torch Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2023-06-07-torch-0-11 Wed, 07 Jun 2023 00:00:00 +0000 LLaMA in R with Keras and TensorFlow Tomasz Kalinowski https://blogs.rstudio.com/tensorflow/posts/2023-05-25-llama-tensorflow-keras Implementation and walk-through of LLaMA, a Large Language Model, in R, with TensorFlow and Keras. TensorFlow/Keras R Generative Models Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2023-05-25-llama-tensorflow-keras Thu, 25 May 2023 00:00:00 +0000 Group-equivariant neural networks with escnn Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-05-09-group-equivariant-cnn-3 Escnn, built on PyTorch, is a library that, in the spirit of Geometric Deep Learning, provides a high-level interface to designing and training group-equivariant neural networks. This post introduces important mathematical concepts, the library's key actors, and essential library use. Torch R Concepts Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2023-05-09-group-equivariant-cnn-3 Tue, 09 May 2023 00:00:00 +0000 luz 0.4.0 Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2023-04-17-luz-0-4 luz v0.4.0 is now on CRAN. This release adds support for training models on ARM Mac GPUs, reduces the overhead of using luz, and makes it easier to checkpoint and resume failed runs. Torch Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2023-04-17-luz-0-4 Mon, 17 Apr 2023 00:00:00 +0000 torch 0.10.0 Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2023-04-14-torch-0-10 torch v0.10.0 is now on CRAN. This version upgraded the underlying LibTorch to 1.13.1, and added support for Automatic Mixed Precision. As an experimental feature, we now also support pre-built binaries, so you can install torch without having to deal with the CUDA installation. Torch Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2023-04-14-torch-0-10 Fri, 14 Apr 2023 00:00:00 +0000 De-noising Diffusion with torch Daniel Falbel Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-04-13-denoising-diffusion Currently, in generative deep learning, no other approach seems to outperform the family of diffusion models. Would you like to try for yourself? If so, our torch implementation of de-noising diffusion provides an easy-to-use, easy-to-configure interface. Torch R Image Recognition & Image Processing Generative Models https://blogs.rstudio.com/tensorflow/posts/2023-04-13-denoising-diffusion Thu, 13 Apr 2023 00:00:00 +0000 Deep Learning and Scientific Computing with R torch: the book Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-04-05-deep-learning-scientific-computing-R-torch Please allow us to introduce Deep Learning and Scientific Computing with R torch. Released in e-book format today, and available freely online, this book starts out by introducing torch basics. From there, it moves on to various deep-learning use cases. Finally, it shows how to use torch for more general topics, such as matrix computations and the Fourier Transform. Torch R Meta Concepts Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2023-04-05-deep-learning-scientific-computing-R-torch Wed, 05 Apr 2023 00:00:00 +0000 Implementing rotation equivariance: Group-equivariant CNN from scratch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-03-27-group-equivariant-cnn-2 We code up a simple group-equivariant convolutional neural network (GCNN) that is equivariant to rotation. The world may be upside down, but the network will know. Torch R Spatial Data Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2023-03-27-group-equivariant-cnn-2 Mon, 27 Mar 2023 00:00:00 +0000 Upside down, a cat's still a cat: Evolving image recognition with Geometric Deep Learning Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-03-09-group-equivariant-cnn-1 In this first in a series of posts on group-equivariant convolutional neural networks (GCNNs), meet the main actors — groups — and concepts (equivariance). With GCNNs, we finally revisit the topic of Geometric Deep Learning, a principled, math-driven approach to neural networks that has consistently been rising in scope and impact. Torch R Spatial Data Concepts https://blogs.rstudio.com/tensorflow/posts/2023-03-09-group-equivariant-cnn-1 Thu, 09 Mar 2023 00:00:00 +0000 AO, NAO, ENSO: A wavelet analysis example Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2023-01-19-torchwavelets El Niño-Southern Oscillation (ENSO), North Atlantic Oscillation (NAO), and Arctic Oscillation (AO) are atmospheric phenomena of global impact that strongly affect people's lives. ENSO, first and foremost, brings with it floods, droughts, and ensuing poverty, in developing countries in the Southern Hemisphere. Here, we use the new torchwavelets package to comparatively inspect patterns in the three series. Torch R Packages/Releases Time Series https://blogs.rstudio.com/tensorflow/posts/2023-01-19-torchwavelets Thu, 19 Jan 2023 00:00:00 +0000 Wavelet Transform - with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2022-10-27-wavelets torch does not have built-in functionality to do wavelet analysis. But we can efficiently implement what we need, making use of the Fast Fourier Transform (FFT). This post is a very first introduction to wavelets, suitable for readers that have not encountered it before. At the same time, it provides useful starter code, showing an (extensible) way to perform wavelet analysis in torch. It is an excerpt from the corresponding chapter in the forthcoming book, Deep Learning and Scientific Computing with R torch, to be published by CRC Press. Torch R Concepts https://blogs.rstudio.com/tensorflow/posts/2022-10-27-wavelets Thu, 27 Oct 2022 00:00:00 +0000 torch 0.9.0 Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2022-10-25-torch-0-9 torch v0.9.0 is now on CRAN. This version adds support for ARM systems running macOS, and brings significant performance improvements. Torch Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2022-10-25-torch-0-9 Tue, 25 Oct 2022 00:00:00 +0000 Discrete Fourier Transform - with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2022-10-20-dft About the Fourier Transform, it has been said that it is one of the greatest wonders of the universe. At the same time, it can be realized in a mere half-dozen lines of code. Even if in the end, you're just going to call torch's built-in functions directly, it helps to understand, and be able to reproduce in code, the ideas that underlie the magic. This post is an excerpt from the forthcoming book, Deep Learning and Scientific Computing with R torch, to be published by CRC Press. Torch R Concepts https://blogs.rstudio.com/tensorflow/posts/2022-10-20-dft Thu, 20 Oct 2022 00:00:00 +0000 Five ways to do least squares (with torch) Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2022-10-13-torch-linalg Get to know torch's linalg module, all while learning about different ways to do least-squares regression from scratch. This post is a condensed version of the corresponding chapter in the forthcoming book, Deep Learning and Scientific Computing with R torch, to be published by CRC Press. Torch R Concepts Tabular Data https://blogs.rstudio.com/tensorflow/posts/2022-10-13-torch-linalg Thu, 13 Oct 2022 00:00:00 +0000 Audio classification with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2022-10-06-audio-classification-torch Learn how to classify speech utterances with torch, making use of domain knowledge and deep learning. This post is a condensed version of the corresponding chapter in the forthcoming book, Deep Learning and Scientific Computing with R torch, to be published by CRC Press. Torch R Audio Processing https://blogs.rstudio.com/tensorflow/posts/2022-10-06-audio-classification-torch Thu, 06 Oct 2022 00:00:00 +0000 Introducing the text package Oscar Kjell Salvatore Giorgi H Andrew Schwartz https://blogs.rstudio.com/tensorflow/posts/2022-09-29-r-text The text package attempts to provide user-friendly access and pipelines to HuggingFace's transformer language models in R. Natural Language Processing R https://blogs.rstudio.com/tensorflow/posts/2022-09-29-r-text Tue, 04 Oct 2022 00:00:00 +0000 luz 0.3.0 Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2022-08-24-luz-0-3 luz version 0.3.0 is now on CRAN. luz is a high-level interface for torch. Torch Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2022-08-24-luz-0-3 Wed, 24 Aug 2022 00:00:00 +0000 TensorFlow and Keras 2.9 Tomasz Kalinowski https://blogs.rstudio.com/tensorflow/posts/2022-06-09-tf-2-9 New TensorFlow and Keras releases bring improvements big and small. TensorFlow/Keras Packages/Releases R https://blogs.rstudio.com/tensorflow/posts/2022-06-09-tf-2-9 Thu, 09 Jun 2022 00:00:00 +0000 Deep Learning with R, 2nd Edition Tomasz Kalinowski https://blogs.rstudio.com/tensorflow/posts/2022-05-31-deep-learning-with-R-2e Announcing the release of "Deep Learning with R, 2nd Edition," a book that shows you how to get started with deep learning in R. TensorFlow/Keras R https://blogs.rstudio.com/tensorflow/posts/2022-05-31-deep-learning-with-R-2e Tue, 31 May 2022 00:00:00 +0000 Community spotlight: Fun with torchopt Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2022-05-18-torchopt Today, we want to call attention to a highly useful package in the torch ecosystem: torchopt. It extends torch by providing a set of popular optimization algorithms not available in the base library. As this post will show, it is also fun to use! Torch R Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2022-05-18-torchopt Wed, 18 May 2022 00:00:00 +0000 torch outside the box Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2022-04-27-torch-outside-the-box Sometimes, a software's best feature is the one you've added yourself. This post shows by example why you may want to extend torch, and how to proceed. It also explains a bit of what is going on in the background. Torch R https://blogs.rstudio.com/tensorflow/posts/2022-04-27-torch-outside-the-box Wed, 27 Apr 2022 00:00:00 +0000 Pre-processing layers in keras: What they are and how to use them Sigrid Keydana Tomasz Kalinowski https://blogs.rstudio.com/tensorflow/posts/2021-12-09-keras-preprocessing-layers For keras, the last two releases have brought important new functionality, in terms of both low-level infrastructure and workflow enhancements. This post focuses on an outstanding example of the latter category: a new family of layers designed to help with pre-processing, data-augmentation, and feature-engineering tasks. TensorFlow/Keras R https://blogs.rstudio.com/tensorflow/posts/2021-12-09-keras-preprocessing-layers Thu, 09 Dec 2021 00:00:00 +0000 Revisiting Keras for R Sigrid Keydana Tomasz Kalinowski https://blogs.rstudio.com/tensorflow/posts/2021-11-18-keras-updates It's been a while since this blog featured content about Keras for R, so you might've thought that the project was dormant. It's not! In fact, Keras for R is better than ever, with two recent releases adding powerful capabilities that considerably lighten previously tedious tasks. This post provides a high-level overview. Future posts will go into more detail on some of the most helpful new features, as well as dive into the powerful low-level enhancements that make the former possible. Packages/Releases TensorFlow/Keras R https://blogs.rstudio.com/tensorflow/posts/2021-11-18-keras-updates Thu, 18 Nov 2021 00:00:00 +0000 Train in R, run on Android: Image segmentation with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-10-29-segmentation-torch-android We train a model for image segmentation in R, using torch together with luz, its high-level interface. We then JIT-trace the model on example input, so as to obtain an optimized representation that can run with no R installed. Finally, we show the model being run on Android. Torch Image Recognition & Image Processing R https://blogs.rstudio.com/tensorflow/posts/2021-10-29-segmentation-torch-android Fri, 29 Oct 2021 00:00:00 +0000 Beyond alchemy: A first look at geometric deep learning Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-08-26-geometric-deep-learning Geometric deep learning is a "program" that aspires to situate deep learning architectures and techniques in a framework of mathematical priors. The priors, such as various types of invariance, first arise in some physical domain. A neural network that well matches the domain will preserve as many invariances as possible. In this post, we present a very conceptual, high-level overview, and highlight a few applications. Concepts Meta R https://blogs.rstudio.com/tensorflow/posts/2021-08-26-geometric-deep-learning Thu, 26 Aug 2021 00:00:00 +0000 torch: Just-in-time compilation (JIT) for R-less model deployment Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-08-10-jit-trace-module Using the torch just-in-time (JIT) compiler, it is possible to query a model trained in R from a different language, provided that language can make use of the low-level libtorch library. This post shows how. In addition, we try to untangle a bit of the terminological jumble surrounding the topic. Torch R https://blogs.rstudio.com/tensorflow/posts/2021-08-10-jit-trace-module Tue, 10 Aug 2021 00:00:00 +0000 Starting to think about AI Fairness Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-07-15-AI-fairness The topic of AI fairness metrics is as important to society as it is confusing. Confusing it is due to a number of reasons: terminological proliferation, abundance of formulae, and last not least the impression that everyone else seems to know what they're talking about. This text hopes to counteract some of that confusion by starting from a common-sense approach of contrasting two basic positions: On the one hand, the assumption that dataset features may be taken as reflecting the underlying concepts ML practitioners are interested in; on the other, that there inevitably is a gap between concept and measurement, a gap that may be bigger or smaller depending on what is being measured. In contrasting these fundamental views, we bring together concepts from ML, legal science, and political philosophy. R Concepts Meta AI & Society https://blogs.rstudio.com/tensorflow/posts/2021-07-15-AI-fairness Thu, 15 Jul 2021 00:00:00 +0000 sparklyr.sedona: A sparklyr extension for analyzing geospatial data Yitao Li https://blogs.rstudio.com/tensorflow/posts/2021-07-07-sparklyr-sedona We are excited to announce the availability of sparklyr.sedona, a sparklyr extension making geospatial functionalities of the Apache Sedona library easily accessible from R. R Packages/Releases Distributed Computing Spatial Data https://blogs.rstudio.com/tensorflow/posts/2021-07-07-sparklyr-sedona Wed, 07 Jul 2021 00:00:00 +0000 sparklyr 1.7: New data sources and spark_apply() capabilities, better interfaces for sparklyr extensions, and more! Yitao Li https://blogs.rstudio.com/tensorflow/posts/2021-07-06-sparklyr-1.7.0-released Sparklyr 1.7 delivers much-anticipated improvements, including R interfaces for image and binary data sources, several new spark_apply() capabilities, and better integration with sparklyr extensions. R Packages/Releases Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2021-07-06-sparklyr-1.7.0-released Tue, 06 Jul 2021 00:00:00 +0000 Que haja luz: More light for torch! Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-06-17-luz Today, we're introducing luz, a high-level interface to torch that lets you train neural networks in a concise, declarative style. In some sense, it is to torch what Keras is to TensorFlow: It provides both a streamlined workflow and powerful ways for customization. Torch R Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2021-06-17-luz Thu, 17 Jun 2021 00:00:00 +0000 torch for optimization Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-04-22-torch-for-optimization Torch is not just for deep learning. Its L-BFGS optimizer, complete with Strong-Wolfe line search, is a powerful tool in unconstrained as well as constrained optimization. Torch R https://blogs.rstudio.com/tensorflow/posts/2021-04-22-torch-for-optimization Tue, 27 Apr 2021 00:00:00 +0000 sparklyr 1.6: weighted quantile summaries, power iteration clustering, spark_write_rds(), and more Yitao Li https://blogs.rstudio.com/tensorflow/posts/2021-03-25-sparklyr-1.6.0-released The sparklyr 1.6 release introduces weighted quantile summaries, an R interface to power iteration clustering, spark_write_rds(), as well as a number of dplyr-related improvements. R Packages/Releases Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2021-03-25-sparklyr-1.6.0-released Thu, 25 Mar 2021 00:00:00 +0000 torch time series, final episode: Attention Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-03-19-forecasting-time-series-with-torch_4 We conclude our mini-series on time-series forecasting with torch by augmenting last time's sequence-to-sequence architecture with a technique both immensely popular in natural language processing and inspired by human (and animal) cognition: attention. Torch R Time Series https://blogs.rstudio.com/tensorflow/posts/2021-03-19-forecasting-time-series-with-torch_4 Fri, 19 Mar 2021 00:00:00 +0000 torch time series, take three: Sequence-to-sequence prediction Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-03-16-forecasting-time-series-with-torch_3 In our overview of techniques for time-series forecasting, we move on to sequence-to-sequence models. Architectures in this family are commonly used in natural language processing (NLP) tasks, such as machine translation. With NLP, however, significant pre-processing is required before proceeding to model definition and training. In staying with our familiar numerical series, we can fully concentrate on the concepts. Torch R Time Series https://blogs.rstudio.com/tensorflow/posts/2021-03-16-forecasting-time-series-with-torch_3 Tue, 16 Mar 2021 00:00:00 +0000 torch time series continued: A first go at multi-step prediction Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-03-11-forecasting-time-series-with-torch_2 We continue our exploration of time-series forecasting with torch, moving on to architectures designed for multi-step prediction. Here, we augment the "workhorse RNN" by a multi-layer perceptron (MLP) to extrapolate multiple timesteps into the future. Torch R Time Series https://blogs.rstudio.com/tensorflow/posts/2021-03-11-forecasting-time-series-with-torch_2 Thu, 11 Mar 2021 00:00:00 +0000 Introductory time-series forecasting with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-03-10-forecasting-time-series-with-torch_1 This post is an introduction to time-series forecasting with torch. Central topics are data input, and practical usage of RNNs (GRUs/LSTMs). Upcoming posts will build on this, and introduce increasingly involved architectures. Torch R Time Series https://blogs.rstudio.com/tensorflow/posts/2021-03-10-forecasting-time-series-with-torch_1 Wed, 10 Mar 2021 00:00:00 +0000 First mlverse survey results – software, applications, and beyond Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-02-17-survey Last month, we conducted our first survey on mlverse software, covering topics ranging from area of application through software usage to user wishes and suggestions. In addition, the survey asked about thoughts on social impacts of AI/ML. This post presents the results, and tries to address some of the things that came up. Meta R https://blogs.rstudio.com/tensorflow/posts/2021-02-17-survey Wed, 17 Feb 2021 00:00:00 +0000 torch, tidymodels, and high-energy physics Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-02-11-tabnet Today we introduce tabnet, a torch implementation of "TabNet: Attentive Interpretable Tabular Learning" that is fully integrated with the tidymodels framework. Per se, already, tabnet was designed to require very little data pre-processing; thanks to tidymodels, hyperparameter tuning (so often cumbersome in deep learning) becomes convenient and even, fun! Torch R Tabular Data https://blogs.rstudio.com/tensorflow/posts/2021-02-11-tabnet Thu, 11 Feb 2021 00:00:00 +0000 Simple audio classification with torch Athos Damiani https://blogs.rstudio.com/tensorflow/posts/2021-02-04-simple-audio-classification-with-torch This article translates Daniel Falbel's post on "Simple Audio Classification" from TensorFlow/Keras to torch/torchaudio. Torch R Audio Processing https://blogs.rstudio.com/tensorflow/posts/2021-02-04-simple-audio-classification-with-torch Thu, 04 Feb 2021 00:00:00 +0000 Forecasting El Niño-Southern Oscillation (ENSO) Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2021-02-02-enso-prediction El Niño-Southern Oscillation (ENSO) is an atmospheric phenomenon, located in the tropical Pacific, that greatly affects ecosystems as well as human well-being on a large portion of the globe. We use the convLSTM introduced in a prior post to predict the Niño 3.4 Index from spatially-ordered sequences of sea surface temperatures. Torch R Image Recognition & Image Processing Time Series Spatial Data https://blogs.rstudio.com/tensorflow/posts/2021-02-02-enso-prediction Tue, 02 Feb 2021 00:00:00 +0000 Convolutional LSTM for spatial forecasting Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-12-17-torch-convlstm In forecasting spatially-determined phenomena (the weather, say, or the next frame in a movie), we want to model temporal evolution, ideally using recurrence relations. At the same time, we'd like to efficiently extract spatial features, something that is normally done with convolutional filters. Ideally then, we'd have at our disposal an architecture that is both recurrent and convolutional. In this post, we build a convolutional LSTM with torch. Torch R Image Recognition & Image Processing Time Series Spatial Data https://blogs.rstudio.com/tensorflow/posts/2020-12-17-torch-convlstm Thu, 17 Dec 2020 00:00:00 +0000 torch 0.2.0 - Initial JIT support and many bug fixes Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2020-12-15-torch-0.2.0-released The torch 0.2.0 release includes many bug fixes and some nice new features like initial JIT support, multi-worker dataloaders, new optimizers and a new print method for nn_modules. Torch R Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2020-12-15-torch-0.2.0-released Tue, 15 Dec 2020 00:00:00 +0000 sparklyr 1.5: better dplyr interface, more sdf_* functions, and RDS-based serialization routines Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-12-14-sparklyr-1.5.0-released Unlike all three previous sparklyr releases, the recent release of sparklyr 1.5 placed much more emphasis on enhancing existing sparklyr features rather than creating new ones. As a result, many valuable suggestions from sparklyr users were taken into account and were successfully addressed in a long list of bug fixes and improvements. R Packages/Releases Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2020-12-14-sparklyr-1.5.0-released Mon, 14 Dec 2020 00:00:00 +0000 Brain image segmentation with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-11-30-torch-brain-segmentation The need to segment images arises in various sciences and their applications, many of which are vital to human (and animal) life. In this introductory post, we train a U-Net to mark lesioned regions on MRI brain scans. Torch R Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2020-11-30-torch-brain-segmentation Mon, 30 Nov 2020 00:00:00 +0000 torch for tabular data Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-11-03-torch-tabular How not to die from poisonous mushrooms. Also: How to use torch for deep learning on tabular data, including a mix of categorical and numerical features. Torch R Tabular Data https://blogs.rstudio.com/tensorflow/posts/2020-11-03-torch-tabular Tue, 03 Nov 2020 00:00:00 +0000 Classifying images with torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-10-19-torch-image-classification We learn about transfer learning, input pipelines, and learning rate schedulers, all while using torch to tell apart species of beautiful birds. Torch R Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2020-10-19-torch-image-classification Mon, 19 Oct 2020 00:00:00 +0000 sparklyr.flint 0.2: ASOF Joins, OLS Regression, and additional summarizers Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-10-12-sparklyr-flint-0.2.0-released We are excited to announce a number of powerful, new functionalities and improvements which are now part of sparklyr.flint 0.2! R Packages/Releases Time Series https://blogs.rstudio.com/tensorflow/posts/2020-10-12-sparklyr-flint-0.2.0-released Mon, 12 Oct 2020 00:00:00 +0000 Optimizers in torch Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-10-09-torch-optim Today, we wrap up our mini-series on torch basics, adding to our toolset two abstractions: loss functions and optimizers. Torch R https://blogs.rstudio.com/tensorflow/posts/2020-10-09-torch-optim Fri, 09 Oct 2020 00:00:00 +0000 Using torch modules Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-10-07-torch-modules In this third installment of our mini-series introducing torch basics, we replace hand-coded matrix operations by modules, considerably simplifying our toy network's code. Torch R https://blogs.rstudio.com/tensorflow/posts/2020-10-07-torch-modules Wed, 07 Oct 2020 00:00:00 +0000 Introducing torch autograd Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-10-05-torch-network-with-autograd With torch, there is hardly ever a reason to code backpropagation from scratch. Its automatic differentiation feature, called autograd, keeps track of operations that need their gradients computed, as well as how to compute them. In this second post of a four-part series, we update our simple, hand-coded network to make use of autograd. Torch R https://blogs.rstudio.com/tensorflow/posts/2020-10-05-torch-network-with-autograd Mon, 05 Oct 2020 00:00:00 +0000 Getting familiar with torch tensors Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-10-01-torch-network-from-scratch In this first installment of a four-part miniseries, we present the main things you will want to know about torch tensors. As an illustrative example, we'll code a simple neural network from scratch. Torch R https://blogs.rstudio.com/tensorflow/posts/2020-10-01-torch-network-from-scratch Thu, 01 Oct 2020 00:00:00 +0000 sparklyr 1.4: Weighted Sampling, Tidyr Verbs, Robust Scaler, RAPIDS, and more Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-09-30-sparklyr-1.4.0-released Sparklyr 1.4 is now available! This release comes with delightful new features such as weighted sampling and tidyr verbs support for Spark dataframes, robust scaler for standardizing data based on median and interquartile range, spark_connect interface for RAPIDS GPU acceleration plugin, as well as a number of dplyr-related improvements. R Packages/Releases Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2020-09-30-sparklyr-1.4.0-released Wed, 30 Sep 2020 00:00:00 +0000 Please allow me to introduce myself: Torch for R Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-09-29-introducing-torch-for-r Today, we are excited to introduce torch, an R package that allows you to use PyTorch-like functionality natively from R. No Python installation is required: torch is built directly on top of libtorch, a C++ library that provides the tensor-computation and automatic-differentiation capabilities essential to building neural networks. Packages/Releases Torch R https://blogs.rstudio.com/tensorflow/posts/2020-09-29-introducing-torch-for-r Tue, 29 Sep 2020 00:00:00 +0000 Introducing sparklyr.flint: A time-series extension for sparklyr Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-09-07-sparklyr-flint We are pleased to announce that sparklyr.flint, a sparklyr extension for analyzing time series at scale with Flint, is now available on CRAN. Flint is an open-source library for working with time-series in Apache Spark which supports aggregates and joins on time-series datasets. R Time Series https://blogs.rstudio.com/tensorflow/posts/2020-09-07-sparklyr-flint Mon, 07 Sep 2020 00:00:00 +0000 An introduction to weather forecasting with deep learning Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-09-01-weather-prediction A few weeks ago, we showed how to forecast chaotic dynamical systems with deep learning, augmented by a custom constraint derived from domain-specific insight. Global weather is a chaotic system, but of much higher complexity than many tasks commonly addressed with machine and/or deep learning. In this post, we provide a practical introduction featuring a simple deep learning baseline for atmospheric forecasting. While far away from being competitive, it serves to illustrate how more sophisticated and compute-intensive models may approach that formidable task by means of methods situated on the "black-box end" of the continuum. R TensorFlow/Keras Time Series Spatial Data https://blogs.rstudio.com/tensorflow/posts/2020-09-01-weather-prediction Tue, 01 Sep 2020 00:00:00 +0000 Training ImageNet with R Javier Luraschi https://blogs.rstudio.com/tensorflow/posts/2020-08-24-training-imagenet-with-r This post explores how to train large datasets with TensorFlow and R. Specifically, we present how to download and repartition ImageNet, followed by training ImageNet across multiple GPUs in distributed environments using TensorFlow and Apache Spark. R TensorFlow/Keras Distributed Computing Data Management https://blogs.rstudio.com/tensorflow/posts/2020-08-24-training-imagenet-with-r Mon, 24 Aug 2020 00:00:00 +0000 Deepfake detection challenge from R Turgut Abdullayev https://blogs.rstudio.com/tensorflow/posts/2020-08-18-deepfake A couple of months ago, Amazon, Facebook, Microsoft, and other contributors initiated a challenge consisting of telling apart real and AI-generated ("fake") videos. We show how to approach this challenge from R. Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2020-08-18-deepfake Tue, 18 Aug 2020 00:00:00 +0000 FNN-VAE for noisy time series forecasting Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-07-31-fnn-vae-for-noisy-timeseries In the last part of this mini-series on forecasting with false nearest neighbors (FNN) loss, we replace the LSTM autoencoder from the previous post by a convolutional VAE, resulting in equivalent prediction performance but significantly lower training time. In addition, we find that FNN regularization is of great help when an underlying deterministic process is obscured by substantial noise. R TensorFlow/Keras Time Series Generative Models https://blogs.rstudio.com/tensorflow/posts/2020-07-31-fnn-vae-for-noisy-timeseries Fri, 31 Jul 2020 00:00:00 +0000 State-of-the-art NLP models from R Turgut Abdullayev https://blogs.rstudio.com/tensorflow/posts/2020-07-30-state-of-the-art-nlp-models-from-r Nowadays, Microsoft, Google, Facebook, and OpenAI are sharing lots of state-of-the-art models in the field of Natural Language Processing. However, fewer materials exist how to use these models from R. In this post, we will show how R users can access and benefit from these models as well. Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2020-07-30-state-of-the-art-nlp-models-from-r Thu, 30 Jul 2020 00:00:00 +0000 Parallelized sampling using exponential variates Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-07-29-parallelized-sampling How can the seemingly iterative process of weighted sampling without replacement be transformed into something highly parallelizable? Turns out a well-known technique based on exponential variates accomplishes exactly that. Concepts Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2020-07-29-parallelized-sampling Wed, 29 Jul 2020 00:00:00 +0000 Time series prediction with FNN-LSTM Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-07-20-fnn-lstm In a recent post, we showed how an LSTM autoencoder, regularized by false nearest neighbors (FNN) loss, can be used to reconstruct the attractor of a nonlinear, chaotic dynamical system. Here, we explore how that same technique assists in prediction. Matched up with a comparable, capacity-wise, "vanilla LSTM", FNN-LSTM improves performance on a set of very different, real-world datasets, especially for the initial steps in a multi-step forecast. R TensorFlow/Keras Time Series Generative Models https://blogs.rstudio.com/tensorflow/posts/2020-07-20-fnn-lstm Mon, 20 Jul 2020 00:00:00 +0000 sparklyr 1.3: Higher-order Functions, Avro and Custom Serializers Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-07-16-sparklyr-1.3.0-released Sparklyr 1.3 is now available, featuring exciting new functionalities such as integration of Spark higher-order functions and data import/export in Avro and in user-defined serialization formats. Packages/Releases Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2020-07-16-sparklyr-1.3.0-released Thu, 16 Jul 2020 00:00:00 +0000 Deep attractors: Where deep learning meets chaos Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-06-24-deep-attractors In nonlinear dynamics, when the state space is thought to be multidimensional but all we have for data is just a univariate time series, one may attempt to reconstruct the true space via delay coordinate embeddings. However, it is not clear a priori how to choose dimensionality and time lag of the reconstruction space. In this post, we show how to use an autoencoder architecture to circumvent the problem: Given just a scalar series of observations, the autoencoder directly learns to represent attractors of chaotic systems in adequate dimensionality. R TensorFlow/Keras Time Series Generative Models https://blogs.rstudio.com/tensorflow/posts/2020-06-24-deep-attractors Wed, 24 Jun 2020 00:00:00 +0000 Easy PixelCNN with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-05-29-pixelcnn PixelCNN is a deep learning architecture - or bundle of architectures - designed to generate highly realistic-looking images. To use it, no reverse-engineering of arXiv papers or search for reference implementations is required: TensorFlow Probability and its R wrapper, tfprobability, now include a PixelCNN distribution that can be used to train a straightforwardly-defined neural network in a parameterizable way. R Image Recognition & Image Processing TensorFlow/Keras Probabilistic ML/DL Generative Models https://blogs.rstudio.com/tensorflow/posts/2020-05-29-pixelcnn Fri, 29 May 2020 00:00:00 +0000 Hacking deep learning: model inversion attack by example Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-05-15-model-inversion-attacks Compared to other applications, deep learning models might not seem too likely as victims of privacy attacks. However, methods exist to determine whether an entity was used in the training set (an adversarial attack called member inference), and techniques subsumed under "model inversion" allow to reconstruct raw data input given just model output (and sometimes, context information). This post shows an end-to-end example of model inversion, and explores mitigation strategies using TensorFlow Privacy. R Privacy & Security TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2020-05-15-model-inversion-attacks Fri, 15 May 2020 00:00:00 +0000 Towards privacy: Encrypted deep learning with Syft and Keras Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-04-29-encrypted_keras_with_syft Deep learning need not be irreconcilable with privacy protection. Federated learning enables on-device, distributed model training; encryption keeps model and gradient updates private; differential privacy prevents the training data from leaking. As of today, private and secure deep learning is an emerging technology. In this post, we introduce Syft, an open-source framework that integrates with PyTorch as well as TensorFlow. In an example use case, we obtain private predictions from a Keras model. R Privacy & Security TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2020-04-29-encrypted_keras_with_syft Wed, 29 Apr 2020 00:00:00 +0000 sparklyr 1.2: Foreach, Spark 3.0 and Databricks Connect Yitao Li https://blogs.rstudio.com/tensorflow/posts/2020-04-21-sparklyr-1.2.0-released A new sparklyr release is now available. This sparklyr 1.2 release features new functionalities such as support for Databricks Connect, a Spark backend for the 'foreach' package, inter-op improvements for working with Spark 3.0 preview, as well as a number of bug fixes and improvements addressing user-visible pain points. R Packages/Releases Distributed Computing https://blogs.rstudio.com/tensorflow/posts/2020-04-21-sparklyr-1.2.0-released Tue, 21 Apr 2020 00:00:00 +0000 pins 0.4: Versioning Javier Luraschi https://blogs.rstudio.com/tensorflow/posts/2020-04-13-pins-04 A new release of pins is available on CRAN today. This release adds support to time travel across dataset versions, which improves collaboration and protects your code from breaking when remote resources change unexpectedly. R Packages/Releases Data Management https://blogs.rstudio.com/tensorflow/posts/2020-04-13-pins-04 Mon, 13 Apr 2020 00:00:00 +0000 A first look at federated learning with TensorFlow Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-04-08-tf-federated-intro The term "federated learning" was coined to describe a form of distributed model training where the data remains on client devices, i.e., is never shipped to the coordinating server. In this post, we introduce central concepts and run first experiments with TensorFlow Federated, using R. Privacy & Security TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2020-04-08-tf-federated-intro Wed, 08 Apr 2020 00:00:00 +0000 Introducing: The RStudio AI Blog The Multiverse Team https://blogs.rstudio.com/tensorflow/posts/2020-04-01-rstudio-ai-blog This blog just got a new title: RStudio AI Blog. We explain why. Meta https://blogs.rstudio.com/tensorflow/posts/2020-04-01-rstudio-ai-blog Mon, 30 Mar 2020 00:00:00 +0000 Infinite surprise - the iridescent personality of Kullback-Leibler divergence Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-02-19-kl-divergence Kullback-Leibler divergence is not just used to train variational autoencoders or Bayesian networks (and not just a hard-to-pronounce thing). It is a fundamental concept in information theory, put to use in a vast range of applications. Most interestingly, it's not always about constraint, regularization or compression. Quite on the contrary, sometimes it is about novelty, discovery and surprise. Probabilistic ML/DL Concepts https://blogs.rstudio.com/tensorflow/posts/2020-02-19-kl-divergence Wed, 19 Feb 2020 00:00:00 +0000 NumPy-style broadcasting for R TensorFlow users Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-01-24-numpy-broadcasting Broadcasting, as done by Python's scientific computing library NumPy, involves dynamically extending shapes so that arrays of different sizes may be passed to operations that expect conformity - such as adding or multiplying elementwise. In NumPy, the way broadcasting works is specified exactly; the same rules apply to TensorFlow operations. For anyone who finds herself, occasionally, consulting Python code, this post strives to explain. TensorFlow/Keras Concepts https://blogs.rstudio.com/tensorflow/posts/2020-01-24-numpy-broadcasting Fri, 24 Jan 2020 00:00:00 +0000 First experiments with TensorFlow mixed-precision training Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2020-01-13-mixed-precision-training TensorFlow 2.1, released last week, allows for mixed-precision training, making use of the Tensor Cores available in the most recent NVidia GPUs. In this post, we report first experimental results and provide some background on what this is all about. TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2020-01-13-mixed-precision-training Mon, 13 Jan 2020 00:00:00 +0000 Differential Privacy with TensorFlow Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-12-20-differential-privacy Differential Privacy guarantees that results of a database query are basically independent of the presence in the data of a single individual. Applied to machine learning, we expect that no single training example influences the parameters of the trained model in a substantial way. This post introduces TensorFlow Privacy, a library built on top of TensorFlow, that can be used to train differentially private deep learning models from R. Privacy & Security TensorFlow/Keras Time Series https://blogs.rstudio.com/tensorflow/posts/2019-12-20-differential-privacy Fri, 20 Dec 2019 00:00:00 +0000 tfhub: R interface to TensorFlow Hub Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2019-12-18-tfhub-0.7.0 TensorFlow Hub is a library for the publication, discovery, and consumption of reusable parts of machine learning models. A module is a self-contained piece of a TensorFlow graph, along with its weights and assets, that can be reused across different tasks in a process known as transfer learning. TensorFlow/Keras Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2019-12-18-tfhub-0.7.0 Wed, 18 Dec 2019 00:00:00 +0000 Gaussian Process Regression with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-12-10-variational-gaussian-process Continuing our tour of applications of TensorFlow Probability (TFP), after Bayesian Neural Networks, Hamiltonian Monte Carlo and State Space Models, here we show an example of Gaussian Process Regression. In fact, what we see is a rather "normal" Keras network, defined and trained in pretty much the usual way, with TFP's Variational Gaussian Process layer pulling off all the magic. Probabilistic ML/DL TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-12-10-variational-gaussian-process Tue, 10 Dec 2019 00:00:00 +0000 Getting started with Keras from R - the 2020 edition Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-11-27-gettingstarted-2020 Looking for materials to get started with deep learning from R? This post presents useful tutorials, guides, and background documentation on the new TensorFlow for R website. Advanced users will find pointers to applications of new release 2.0 (or upcoming 2.1!) features alluded to in the recent TensorFlow 2.0 post. Packages/Releases TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-11-27-gettingstarted-2020 Wed, 27 Nov 2019 00:00:00 +0000 Variational convnets with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-11-13-variational-convnet In a Bayesian neural network, layer weights are distributions, not tensors. Using tfprobability, the R wrapper to TensorFlow Probability, we can build regular Keras models that have probabilistic layers, and thus get uncertainty estimates "for free". In this post, we show how to define, train and obtain predictions from a probabilistic convolutional neural network. Probabilistic ML/DL Time Series TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-11-13-variational-convnet Wed, 13 Nov 2019 00:00:00 +0000 tfprobability 0.8 on CRAN: Now how can you use it? Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-11-07-tfp-cran Part of the r-tensorflow ecosystem, tfprobability is an R wrapper to TensorFlow Probability, the Python probabilistic programming framework developed by Google. We take the occasion of tfprobability's acceptance on CRAN to give a high-level introduction, highlighting interesting use cases and applications. Probabilistic ML/DL Packages/Releases TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-11-07-tfp-cran Thu, 07 Nov 2019 00:00:00 +0000 Innocent unicorns considered harmful? How to experiment with GPT-2 from R Sigrid Keydana Javier Luraschi https://blogs.rstudio.com/tensorflow/posts/2019-10-23-gpt-2 Is society ready to deal with challenges brought about by artificially-generated information - fake images, fake videos, fake text? While this post won't answer that question, it should help form an opinion on the threat exerted by fake text as of this writing, autumn 2019. We introduce gpt2, an R package that wraps OpenAI's public implementation of GPT-2, the language model that early this year surprised the NLP community with the unprecedented quality of its creations. Natural Language Processing Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2019-10-23-gpt-2 Wed, 23 Oct 2019 00:00:00 +0000 TensorFlow 2.0 is here - what changes for R users? Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-10-08-tf2-whatchanges TensorFlow 2.0 was finally released last week. As R users we have two kinds of questions. First, will my keras code still run? And second, what is it that changes? In this post, we answer both and, then, give a tour of exciting new developments in the r-tensorflow ecosystem. TensorFlow/Keras Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2019-10-08-tf2-whatchanges Tue, 08 Oct 2019 00:00:00 +0000 On leapfrogs, crashing satellites, and going nuts: A very first conceptual introduction to Hamiltonian Monte Carlo Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-10-03-intro-to-hmc TensorFlow Probability, and its R wrapper tfprobability, provide Markov Chain Monte Carlo (MCMC) methods that were used in a number of recent posts on this blog. These posts were directed to users already comfortable with the method, and terminology, per se, which readers mainly interested in deep learning won't necessarily be. Here we try to make up leeway, introducing Hamitonian Monte Carlo (HMC) as well as a few often-heard "buzzwords" accompanying it, always striving to keep in mind what it is all "for". Bayesian Modeling Concepts https://blogs.rstudio.com/tensorflow/posts/2019-10-03-intro-to-hmc Thu, 03 Oct 2019 00:00:00 +0000 BERT from R Turgut Abdullayev https://blogs.rstudio.com/tensorflow/posts/2019-09-30-bert-r A deep learning model - BERT from Google AI Research - has yielded state-of-the-art results in a wide variety of Natural Language Processing (NLP) tasks. In this tutorial, we will show how to load and train the BERT model from R, using Keras. Natural Language Processing TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-09-30-bert-r Mon, 30 Sep 2019 00:00:00 +0000 So, how come we can use TensorFlow from R? Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-08-29-using-tf-from-r Have you ever wondered why you can call TensorFlow - mostly known as a Python framework - from R? If not - that's how it should be, as the R packages keras and tensorflow aim to make this process as transparent as possible to the user. But for them to be those helpful genies, someone else first has to tame the Python. TensorFlow/Keras Meta Concepts https://blogs.rstudio.com/tensorflow/posts/2019-08-29-using-tf-from-r Thu, 29 Aug 2019 00:00:00 +0000 Image segmentation with U-Net Daniel Falbel Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-08-23-unet In image segmentation, every pixel of an image is assigned a class. Depending on the application, classes could be different cell types; or the task could be binary, as in "cancer cell yes or no?". Area of application notwithstanding, the established neural network architecture of choice is U-Net. In this post, we show how to preprocess data and train a U-Net model on the Kaggle Carvana image segmentation data. Image Recognition & Image Processing TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-08-23-unet Fri, 23 Aug 2019 00:00:00 +0000 Modeling censored data with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-07-31-censored-data In this post we use tfprobability, the R interface to TensorFlow Probability, to model censored data. Again, the exposition is inspired by the treatment of this topic in Richard McElreath's Statistical Rethinking. Instead of cute cats though, we model immaterial entities from the cold world of technology: This post explores durations of CRAN package checks, a dataset that comes with Max Kuhn's parsnip. Bayesian Modeling TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-07-31-censored-data Wed, 31 Jul 2019 00:00:00 +0000 TensorFlow feature columns: Transforming your data recipes-style Daniel Falbel Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-07-09-feature-columns TensorFlow feature columns provide useful functionality for preprocessing categorical data and chaining transformations, like bucketization or feature crossing. From R, we use them in popular "recipes" style, creating and subsequently refining a feature specification. In this post, we show how using feature specs frees cognitive resources and lets you focus on what you really want to accomplish. What's more, because of its elegance, feature-spec code reads nice and is fun to write as well. TensorFlow/Keras Tabular Data https://blogs.rstudio.com/tensorflow/posts/2019-07-09-feature-columns Tue, 09 Jul 2019 00:00:00 +0000 Dynamic linear models with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-06-25-dynamic_linear_models_tfprobability Previous posts featuring tfprobability - the R interface to TensorFlow Probability - have focused on enhancements to deep neural networks (e.g., introducing Bayesian uncertainty estimates) and fitting hierarchical models with Hamiltonian Monte Carlo. This time, we show how to fit time series using dynamic linear models (DLMs), yielding posterior predictive forecasts as well as the smoothed and filtered estimates from the Kálmán filter. Probabilistic ML/DL Time Series https://blogs.rstudio.com/tensorflow/posts/2019-06-25-dynamic_linear_models_tfprobability Mon, 24 Jun 2019 00:00:00 +0000 Adding uncertainty estimates to Keras models with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-06-05-uncertainty-estimates-tfprobability As of today, there is no mainstream road to obtaining uncertainty estimates from neural networks. All that can be said is that, normally, approaches tend to be Bayesian in spirit, involving some way of putting a prior over model weights. This holds true as well for the method presented in this post: We show how to use tfprobability, the R interface to TensorFlow Probability, to add uncertainty estimates to a Keras model in an elegant and conceptually plausible way. Probabilistic ML/DL TensorFlow/Keras Concepts https://blogs.rstudio.com/tensorflow/posts/2019-06-05-uncertainty-estimates-tfprobability Wed, 05 Jun 2019 00:00:00 +0000 Hierarchical partial pooling, continued: Varying slopes models with TensorFlow Probability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-05-24-varying-slopes This post builds on our recent introduction to multi-level modeling with tfprobability, the R wrapper to TensorFlow Probability. We show how to pool not just mean values ("intercepts"), but also relationships ("slopes"), thus enabling models to learn from data in an even broader way. Again, we use an example from Richard McElreath's "Statistical Rethinking"; the terminology as well as the way we present this topic are largely owed to this book. Bayesian Modeling TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-05-24-varying-slopes Fri, 24 May 2019 00:00:00 +0000 Tadpoles on TensorFlow: Hierarchical partial pooling with tfprobability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-05-06-tadpoles-on-tensorflow This post is a first introduction to MCMC modeling with tfprobability, the R interface to TensorFlow Probability (TFP). Our example is a multi-level model describing tadpole mortality, which may be known to the reader from Richard McElreath's wonderful "Statistical Rethinking". Bayesian Modeling TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-05-06-tadpoles-on-tensorflow Mon, 06 May 2019 00:00:00 +0000 Experimenting with autoregressive flows in TensorFlow Probability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-04-24-autoregressive-flows Continuing from the recent introduction to bijectors in TensorFlow Probability (TFP), this post brings autoregressivity to the table. Using TFP through the new R package tfprobability, we look at the implementation of masked autoregressive flows (MAF) and put them to use on two different datasets. Probabilistic ML/DL Generative Models TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2019-04-24-autoregressive-flows Wed, 24 Apr 2019 00:00:00 +0000 Auto-Keras: Tuning-free deep learning from R Juan Cruz Rodriguez https://blogs.rstudio.com/tensorflow/posts/2019-04-16-autokeras Sometimes in deep learning, architecture design and hyperparameter tuning pose substantial challenges. Using Auto-Keras, none of these is needed: We start a search procedure and extract the best-performing model. This post presents Auto-Keras in action on the well-known MNIST dataset. TensorFlow/Keras Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2019-04-16-autokeras Tue, 16 Apr 2019 00:00:00 +0000 Getting into the flow: Bijectors in TensorFlow Probability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-04-05-bijectors-flows Normalizing flows are one of the lesser known, yet fascinating and successful architectures in unsupervised deep learning. In this post we provide a basic introduction to flows using tfprobability, an R wrapper to TensorFlow Probability. Upcoming posts will build on this, using more complex flows on more complex data. Probabilistic ML/DL TensorFlow/Keras Concepts Generative Models https://blogs.rstudio.com/tensorflow/posts/2019-04-05-bijectors-flows Fri, 05 Apr 2019 00:00:00 +0000 Math, code, concepts: A third road to deep learning Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-03-15-concepts-way-to-dl Not everybody who wants to get into deep learning has a strong background in math or programming. This post elaborates on a concepts-driven, abstraction-based way to learn what it's all about. Meta Concepts https://blogs.rstudio.com/tensorflow/posts/2019-03-15-concepts-way-to-dl Fri, 15 Mar 2019 00:00:00 +0000 Audio classification with Keras: Looking closer at the non-deep learning parts Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-02-07-audio-background Sometimes, deep learning is seen - and welcomed - as a way to avoid laborious preprocessing of data. However, there are cases where preprocessing of sorts does not only help improve prediction, but constitutes a fascinating topic in itself. One such case is audio classification. In this post, we build on a previous post on this blog, this time focusing on explaining some of the non-deep learning background. We then link the concepts explained to updated for near-future releases TensorFlow code. TensorFlow/Keras Concepts Audio Processing https://blogs.rstudio.com/tensorflow/posts/2019-02-07-audio-background Thu, 07 Feb 2019 00:00:00 +0000 Discrete Representation Learning with VQ-VAE and TensorFlow Probability Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-01-24-vq-vae Mostly when thinking of Variational Autoencoders (VAEs), we picture the prior as an isotropic Gaussian. But this is by no means a necessity. The Vector Quantised Variational Autoencoder (VQ-VAE) described in van den Oord et al's "Neural Discrete Representation Learning" features a discrete latent space that allows to learn impressively concise latent representations. In this post, we combine elements of Keras, TensorFlow, and TensorFlow Probability to see if we can generate convincing letters resembling those in Kuzushiji-MNIST. TensorFlow/Keras Probabilistic ML/DL Generative Models https://blogs.rstudio.com/tensorflow/posts/2019-01-24-vq-vae Thu, 24 Jan 2019 00:00:00 +0000 Getting started with TensorFlow Probability from R Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2019-01-08-getting-started-with-tf-probability TensorFlow Probability offers a vast range of functionality ranging from distributions over probabilistic network layers to probabilistic inference. It works seamlessly with core TensorFlow and (TensorFlow) Keras. In this post, we provide a short introduction to the distributions layer and then, use it for sampling and calculating probabilities in a Variational Autoencoder. TensorFlow/Keras Probabilistic ML/DL Generative Models https://blogs.rstudio.com/tensorflow/posts/2019-01-08-getting-started-with-tf-probability Tue, 08 Jan 2019 00:00:00 +0000 Concepts in object detection Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-12-18-object-detection-concepts As shown in a previous post, naming and locating a single object in an image is a task that may be approached in a straightforward way. This is not the same with general object detection, though - naming and locating several objects at once, with no prior information about how many objects are supposed to be detected. In this post, we explain the steps involved in coding a basic single-shot object detector: Not unlike SSD (Single-shot Multibox Detector), but simplified and designed not for best performance, but comprehensibility. TensorFlow/Keras Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-12-18-object-detection-concepts Tue, 18 Dec 2018 00:00:00 +0000 Entity embeddings for fun and profit Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-11-26-embeddings-fun-and-profit Embedding layers are not just useful when working with language data. As "entity embeddings", they've recently become famous for applications on tabular, small-scale data. In this post, we exemplify two possible use cases, also drawing attention to what not to expect. TensorFlow/Keras Tabular Data https://blogs.rstudio.com/tensorflow/posts/2018-11-26-embeddings-fun-and-profit Mon, 26 Nov 2018 00:00:00 +0000 You sure? A Bayesian approach to obtaining uncertainty estimates from neural networks Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-11-12-uncertainty_estimates_dropout In deep learning, there is no obvious way of obtaining uncertainty estimates. In 2016, Gal and Ghahramani proposed a method that is both theoretically grounded and practical: use dropout at test time. In this post, we introduce a refined version of this method (Gal et al. 2017) that has the network itself learn how uncertain it is. Image Recognition & Image Processing Probabilistic ML/DL TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2018-11-12-uncertainty_estimates_dropout Mon, 12 Nov 2018 00:00:00 +0000 Naming and locating objects in images Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-11-05-naming-locating-objects Object detection (the act of classifying and localizing multiple objects in a scene) is one of the more difficult, but very relevant in practice deep learning tasks. We'll build up to it in several posts. Here we start with the simpler tasks of naming and locating a single object. TensorFlow/Keras Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-11-05-naming-locating-objects Mon, 05 Nov 2018 00:00:00 +0000 Representation learning with MMD-VAE Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-10-22-mmd-vae Like GANs, variational autoencoders (VAEs) are often used to generate images. However, VAEs add an additional promise: namely, to model an underlying latent space. Here, we first look at a typical implementation that maximizes the evidence lower bound. Then, we compare it to one of the more recent competitors, MMD-VAE, from the Info-VAE (information maximizing VAE) family. TensorFlow/Keras Unsupervised Learning Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-10-22-mmd-vae Mon, 22 Oct 2018 00:00:00 +0000 Winner takes all: A look at activations and cost functions Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-10-11-activations-intro Why do we use the activations we use, and how do they relate to the cost functions they tend to co-appear with? In this post we provide a conceptual introduction. TensorFlow/Keras Concepts https://blogs.rstudio.com/tensorflow/posts/2018-10-11-activations-intro Thu, 11 Oct 2018 00:00:00 +0000 More flexible models with TensorFlow eager execution and Keras Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-10-02-eager-wrapup Advanced applications like generative adversarial networks, neural style transfer, and the attention mechanism ubiquitous in natural language processing used to be not-so-simple to implement with the Keras declarative coding paradigm. Now, with the advent of TensorFlow eager execution, things have changed. This post explores using eager execution with R. TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2018-10-02-eager-wrapup Tue, 02 Oct 2018 00:00:00 +0000 Collaborative filtering with embeddings Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-09-26-embeddings-recommender Embeddings are not just for use in natural language processing. Here we apply embeddings to a common task in collaborative filtering - predicting user ratings - and on our way, strive for a better understanding of what an embedding layer really does. TensorFlow/Keras Tabular Data https://blogs.rstudio.com/tensorflow/posts/2018-09-26-embeddings-recommender Wed, 26 Sep 2018 00:00:00 +0000 Image-to-image translation with pix2pix Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-09-20-eager-pix2pix Conditional GANs (cGANs) may be used to generate one type of object based on another - e.g., a map based on a photo, or a color video based on black-and-white. Here, we show how to implement the pix2pix approach with Keras and eager execution. TensorFlow/Keras Image Recognition & Image Processing Generative Models https://blogs.rstudio.com/tensorflow/posts/2018-09-20-eager-pix2pix Thu, 20 Sep 2018 00:00:00 +0000 Attention-based Image Captioning with Keras Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-09-17-eager-captioning Image captioning is a challenging task at intersection of vision and language. Here, we demonstrate using Keras and eager execution to incorporate an attention mechanism that allows the network to concentrate on image features relevant to the current state of text generation. Natural Language Processing TensorFlow/Keras Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-09-17-eager-captioning Mon, 17 Sep 2018 00:00:00 +0000 Neural style transfer with eager execution and Keras Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-09-10-eager-style-transfer Continuing our series on combining Keras with TensorFlow eager execution, we show how to implement neural style transfer in a straightforward way. Based on this easy-to-adapt example, you can easily perform style transfer on your own images. TensorFlow/Keras Generative Models Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-09-10-eager-style-transfer Mon, 10 Sep 2018 00:00:00 +0000 Getting started with deep learning in R Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-09-07-getting-started Many fields are benefiting from the use of deep learning, and with the R keras, tensorflow and related packages, you can now easily do state of the art deep learning in R. In this post, we want to give some orientation as to how to best get started. TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2018-09-07-getting-started Fri, 07 Sep 2018 00:00:00 +0000 Generating images with Keras and TensorFlow eager execution Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-08-26-eager-dcgan Generative adversarial networks (GANs) are a popular deep learning approach to generating new entities (often but not always images). We show how to code them using Keras and TensorFlow eager execution. TensorFlow/Keras Generative Models Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-08-26-eager-dcgan Sun, 26 Aug 2018 00:00:00 +0000 Attention-based Neural Machine Translation with Keras Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-07-30-attention-layer As sequence to sequence prediction tasks get more involved, attention mechanisms have proven helpful. A prominent example is neural machine translation. Following a recent Google Colaboratory notebook, we show how to implement attention in R. Natural Language Processing TensorFlow/Keras https://blogs.rstudio.com/tensorflow/posts/2018-07-30-attention-layer Mon, 30 Jul 2018 00:00:00 +0000 Classifying physical activity from smartphone data Nick Strayer https://blogs.rstudio.com/tensorflow/posts/2018-07-17-activity-detection Using Keras to train a convolutional neural network to classify physical activity. The dataset was built from the recordings of 30 subjects performing basic activities and postural transitions while carrying a waist-mounted smartphone with embedded inertial sensors. https://blogs.rstudio.com/tensorflow/posts/2018-07-17-activity-detection Tue, 17 Jul 2018 00:00:00 +0000 Predicting Sunspot Frequency with Keras Matt Dancho Sigrid Keydana https://blogs.rstudio.com/tensorflow/posts/2018-06-25-sunspots-lstm In this post we will examine making time series predictions using the sunspots dataset that ships with base R. Sunspots are dark spots on the sun, associated with lower temperature. Our post will focus on both how to apply deep learning to time series forecasting, and how to properly apply cross validation in this domain. TensorFlow/Keras Time Series https://blogs.rstudio.com/tensorflow/posts/2018-06-25-sunspots-lstm Mon, 25 Jun 2018 00:00:00 +0000 Simple Audio Classification with Keras Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2018-06-06-simple-audio-classification-keras In this tutorial we will build a deep learning model to classify words. We will use the Speech Commands dataset which consists of 65,000 one-second audio files of people saying 30 different words. TensorFlow/Keras Audio Processing https://blogs.rstudio.com/tensorflow/posts/2018-06-06-simple-audio-classification-keras Wed, 06 Jun 2018 00:00:00 +0000 GPU Workstations in the Cloud with Paperspace J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2018-04-02-rstudio-gpu-paperspace If you don't have local access to a modern NVIDIA GPU, your best bet is typically to run GPU intensive training jobs in the cloud. Paperspace is a cloud service that provides access to a fully preconfigured Ubuntu 16.04 desktop environment equipped with a GPU. Cloud https://blogs.rstudio.com/tensorflow/posts/2018-04-02-rstudio-gpu-paperspace Mon, 02 Apr 2018 00:00:00 +0000 lime v0.4: The Kitten Picture Edition Thomas Lin Pedersen https://blogs.rstudio.com/tensorflow/posts/2018-03-09-lime-v04-the-kitten-picture-edition A new major release of lime has landed on CRAN. lime is an R port of the Python library of the same name by Marco Ribeiro that allows the user to pry open black box machine learning models and explain their outcomes on a per-observation basis Packages/Releases TensorFlow/Keras Explainability Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2018-03-09-lime-v04-the-kitten-picture-edition Fri, 09 Mar 2018 00:00:00 +0000 Deep Learning for Cancer Immunotherapy Leon Eyrich Jessen https://blogs.rstudio.com/tensorflow/posts/2018-01-29-dl-for-cancer-immunotherapy The aim of this post is to illustrate how deep learning is being applied in cancer immunotherapy (Immuno-oncology or Immunooncology) - a cancer treatment strategy, where the aim is to utilize the cancer patient's own immune system to fight the cancer. TensorFlow/Keras Tabular Data https://blogs.rstudio.com/tensorflow/posts/2018-01-29-dl-for-cancer-immunotherapy Mon, 29 Jan 2018 00:00:00 +0000 Predicting Fraud with Autoencoders and Keras Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2018-01-24-keras-fraud-autoencoder In this post we will train an autoencoder to detect credit card fraud. We will also demonstrate how to train Keras models in the cloud using CloudML. The basis of our model will be the Kaggle Credit Card Fraud Detection dataset. TensorFlow/Keras Unsupervised Learning Cloud https://blogs.rstudio.com/tensorflow/posts/2018-01-24-keras-fraud-autoencoder Thu, 25 Jan 2018 00:00:00 +0000 Analyzing rtweet Data with kerasformula Pete Mohanty https://blogs.rstudio.com/tensorflow/posts/2018-01-24-analyzing-rtweet-data-with-kerasformula The kerasformula package offers a high-level interface for the R interface to Keras. It’s main interface is the kms function, a regression-style interface to keras_model_sequential that uses formulas and sparse matrices. We use kerasformula to predict how popular tweets will be based on how often the tweet was retweeted and favorited. TensorFlow/Keras Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2018-01-24-analyzing-rtweet-data-with-kerasformula Wed, 24 Jan 2018 00:00:00 +0000 Deep Learning With Keras To Predict Customer Churn Matt Dancho https://blogs.rstudio.com/tensorflow/posts/2018-01-11-keras-customer-churn Using Keras to predict customer churn based on the IBM Watson Telco Customer Churn dataset. We also demonstrate using the lime package to help explain which features drive individual model predictions. In addition, we use three new packages to assist with Machine Learning: recipes for preprocessing, rsample for sampling data and yardstick for model metrics. TensorFlow/Keras Tabular Data Explainability https://blogs.rstudio.com/tensorflow/posts/2018-01-11-keras-customer-churn Thu, 11 Jan 2018 00:00:00 +0000 R Interface to Google CloudML J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2018-01-10-r-interface-to-cloudml We are excited to announce the availability of the cloudml package, which provides an R interface to Google Cloud Machine Learning Engine. CloudML provides a number of services including on-demand access to training on GPUs and hyperparameter tuning to optimize key attributes of model architectures. Cloud Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2018-01-10-r-interface-to-cloudml Wed, 10 Jan 2018 00:00:00 +0000 Classifying Duplicate Questions from Quora with Keras Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2018-01-09-keras-duplicate-questions-quora In this post we will use Keras to classify duplicated questions from Quora. Our implementation is inspired by the Siamese Recurrent Architecture, with modifications to the similarity measure and the embedding layers (the original paper uses pre-trained word vectors) TensorFlow/Keras Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2018-01-09-keras-duplicate-questions-quora Tue, 09 Jan 2018 00:00:00 +0000 Word Embeddings with Keras Daniel Falbel https://blogs.rstudio.com/tensorflow/posts/2017-12-22-word-embeddings-with-keras Word embedding is a method used to map words of a vocabulary to dense vectors of real numbers where semantically similar words are mapped to nearby points. In this example we'll use Keras to generate word embeddings for the Amazon Fine Foods Reviews dataset. TensorFlow/Keras Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2017-12-22-word-embeddings-with-keras Fri, 22 Dec 2017 00:00:00 +0000 Time Series Forecasting with Recurrent Neural Networks François Chollet J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2017-12-20-time-series-forecasting-with-recurrent-neural-networks In this post, we'll review three advanced techniques for improving the performance and generalization power of recurrent neural networks. We'll demonstrate all three concepts on a temperature-forecasting problem, where you have access to a time series of data points coming from sensors installed on the roof of a building. TensorFlow/Keras Time Series https://blogs.rstudio.com/tensorflow/posts/2017-12-20-time-series-forecasting-with-recurrent-neural-networks Wed, 20 Dec 2017 00:00:00 +0000 Image Classification on Small Datasets with Keras François Chollet J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2017-12-14-image-classification-on-small-datasets Having to train an image-classification model using very little data is a common situation, in this article we review three techniques for tackling this problem including feature extraction and fine tuning from a pretrained network. TensorFlow/Keras Image Recognition & Image Processing https://blogs.rstudio.com/tensorflow/posts/2017-12-14-image-classification-on-small-datasets Thu, 14 Dec 2017 00:00:00 +0000 Deep Learning for Text Classification with Keras François Chollet J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2017-12-07-text-classification-with-keras Two-class classification, or binary classification, may be the most widely applied kind of machine-learning problem. In this excerpt from the book Deep Learning with R, you'll learn to classify movie reviews as positive or negative, based on the text content of the reviews. TensorFlow/Keras Natural Language Processing https://blogs.rstudio.com/tensorflow/posts/2017-12-07-text-classification-with-keras Thu, 07 Dec 2017 00:00:00 +0000 tfruns: Tools for TensorFlow Training Runs J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2017-10-04-tfruns The tfruns package provides a suite of tools for tracking, visualizing, and managing TensorFlow training runs and experiments from R. Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2017-10-04-tfruns Wed, 04 Oct 2017 00:00:00 +0000 Keras for R J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2017-09-06-keras-for-r We are excited to announce that the keras package is now available on CRAN. The package provides an R interface to Keras, a high-level neural networks API developed with a focus on enabling fast experimentation. TensorFlow/Keras Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2017-09-06-keras-for-r Tue, 05 Sep 2017 00:00:00 +0000 TensorFlow Estimators Yuan Tang https://blogs.rstudio.com/tensorflow/posts/2017-08-31-tensorflow-estimators-for-r The tfestimators package is an R interface to TensorFlow Estimators, a high-level API that provides implementations of many different model types including linear models and deep neural networks. Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2017-08-31-tensorflow-estimators-for-r Thu, 31 Aug 2017 00:00:00 +0000 TensorFlow v1.3 Released J.J. Allaire https://blogs.rstudio.com/tensorflow/posts/2017-08-17-tensorflow-v13-released The final release of TensorFlow v1.3 is now available. This release marks the initial availability of several canned estimators including DNNClassifier and DNNRegressor. Packages/Releases https://blogs.rstudio.com/tensorflow/posts/2017-08-17-tensorflow-v13-released Thu, 17 Aug 2017 00:00:00 +0000