深度学习路线

摘要

[1] A Neural Algorithm of Artistic Style (2016)

Abstract: In fine art, especially painting, humans have mastered the skill to create unique visual experiences through composing a complex interplay between the content and style of an image. Thus far the algorithmic basis of this process is unknown and there exists no artificial system with similar capabilities. However, in other key areas of visual perception such as object and face recognition near-human performance was recently demonstrated by a class of biologically inspired vision models called Deep Neural Networks. Here we introduce an artificial system based on a Deep Neural Network that creates artistic images of high perceptual quality. The system uses neural representations to separate and recombine content and style of arbitrary images, providing a neural algorithm for the creation of artistic images. Moreover, in light of the striking similarities between performance-optimised artificial neural networks and biological vision, our work offers a path forward to an algorithmic understanding of how humans create and perceive artistic imagery.

摘要： 在美术，特别是绘画中，人类已经掌握了通过在图像的内容和样式之间进行复杂的相互作用来创造独特的视觉体验的技能。到目前为止，该过程的算法基础是未知的，并且不存在具有类似功能的人工系统。但是，在其他一些视觉感知的关键领域，例如物体和面部识别，近来人类的表现被一类被称为深度神经网络的受生物启发的视觉模型所证明。在这里，我们介绍一种基于深度神经网络的人工系统，该系统可以创建高感知质量的艺术图像。该系统使用神经表示来分离和重组任意图像的内容和样式，为创建艺术图像提供了一种神经算法。此外，鉴于性能优化的人工神经网络与生物视觉之间的惊人相似性，我们的工作为算法理解人类如何创造和感知艺术意象提供了一条途径。

下载地址 | 返回目录 | [10.1167/16.12.326]

[2] A Neural Conversational Model (2015)

Abstract: Conversational modeling is an important task in natural language understanding and machine intelligence. Although previous approaches exist, they are often restricted to specific domains (e.g., booking an airline ticket) and require hand-crafted rules. In this paper, we present a simple approach for this task which uses the recently proposed sequence to sequence framework. Our model converses by predicting the next sentence given the previous sentence or sentences in a conversation. The strength of our model is that it can be trained end-to-end and thus requires much fewer hand-crafted rules. We find that this straightforward model can generate simple conversations given a large conversational training dataset. Our preliminary results suggest that, despite optimizing the wrong objective function, the model is able to converse well. It is able extract knowledge from both a domain specific dataset, and from a large, noisy, and general domain dataset of movie subtitles. On a domain-specific IT helpdesk dataset, the model can find a solution to a technical problem via conversations. On a noisy open-domain movie transcript dataset, the model can perform simple forms of common sense reasoning. As expected, we also find that the lack of consistency is a common failure mode of our model.

摘要： 会话建模是自然语言理解和机器智能中的重要任务。尽管存在先前的方法，但是它们通常限于特定的领域（例如，预订机票），并且需要手工制定的规则。在本文中，我们提出了一种用于此任务的简单方法，该方法使用最近提出的序列对框架进行测序。我们的模型通过在对话中给定前一个或多个句子的情况下预测下一个句子来进行交谈。我们的模型的优势在于可以端到端进行训练，因此需要的手工规则要少得多。我们发现，给定大量的会话训练数据集，这种简单的模型可以生成简单的会话。我们的初步结果表明，尽管优化了错误的目标函数，该模型仍能很好地进行对话。它既可以从特定于域的数据集，又可以从大型，嘈杂且通用的电影字幕域数据集中提取知识。在特定于域的IT服务台数据集上，该模型可以通过对话找到技术问题的解决方案。在嘈杂的开放域电影成绩单数据集上，该模型可以执行常识推理的简单形式。不出所料，我们还发现缺乏一致性是我们模型的常见故障模式。