机器人流程自动化-机器人流程自动化简写-机器人流程自动化简写
[]]它主要用于流程的自动化处理。
机器人流程自动化(RPA)是一种软件技术。它利用自动化机器人去模拟人类在计算机上的操作,从而执行重复性任务。并且,它能通过软件机器人自动执行基于规则的业务流程,减少人为操作,进而提高效率和准确性。
简单理解:
如果一个任务是重复性质且基于规则的,像打开 Excel 来处理数据,或者登录网站去提取信息,又或者发送邮件等这类情况,那么 RPA 机器人就能够代替人工去完成这些任务。
- 具备可审计、可追踪性,所有操作都可被记录,满足合规要求。- 控制中心():用于对多个机器人进行管理,比如。1. 任务介绍。
我们要利用 RPA 机器人来自动访问一个特定的网页,将网页中的数据(例如新闻标题)提取出来,然后把这些数据存入 Excel 中。
3.2 使用RPA 工具: +安装依赖
<p><pre> <code class="prism language-bash">pip <span class="token function">install</span> pyautogui selenium openpyxl
</code></pre></p>
3.3 代码实现
<p><pre> <code class="prism language-python"><span class="token keyword">import</span> time
<span class="token keyword">import</span> pyautogui
<span class="token keyword">from</span> selenium <span class="token keyword">import</span> webdriver
<span class="token keyword">from</span> openpyxl <span class="token keyword">import</span> Workbook
<span class="token comment">启动 WebDriver ,并且要确保已经安装了 ChromeDriver 。</span>
driver <span class="token operator">=</span> webdriver<span class="token punctuation">.</span>Chrome<span class="token punctuation">(</span><span class="token punctuation">)</span>
driver<span class="token punctuation">.</span>get<span class="token punctuation">(</span><span class="token string">“https://news.ycombinator.com/”这个链接。它指向一个特定的网络地址。该地址用于访问相关的新闻或资讯内容。许多人会通过这个链接来获取特定的信息。</span><span class="token punctuation">)</span><span class="token comment"># 访问新闻网站</span>
time<span class="token punctuation">.</span>sleep<span class="token punctuation">(</span><span class="token number">2</span><span class="token punctuation">)</span><span class="token comment"># 等待网页加载</span>
<span class="token comment"># 获取新闻标题</span>
titles <span class="token operator">=</span> driver<span class="token punctuation">.</span>find_elements<span class="token punctuation">(</span><span class="token string">"css selector"</span><span class="token punctuation">,</span> <span class="token string">".storylink"</span><span class="token punctuation">)</span>
<span class="token comment"># 创建 Excel 文件</span>
wb <span class="token operator">=</span> Workbook<span class="token punctuation">(</span><span class="token punctuation">)</span>
ws <span class="token operator">=</span> wb<span class="token punctuation">.</span>active
https://img0.baidu.com/it/u=770096206,4188466364&fm=253&fmt=JPEG&app=138&f=JPEG?w=889&h=500
ws<span class="token punctuation">.</span>append<span class="token punctuation">(</span><span class="token punctuation">[</span><span class="token string">"新闻标题"</span><span class="token punctuation">]</span><span class="token punctuation">)</span>
<span class="token comment"># 提取并写入 Excel</span>
<span class="token keyword">for</span> title <span class="token keyword">in</span> titles<span class="token punctuation">:</span>
ws<span class="token punctuation">.</span>append<span class="token punctuation">(</span><span class="token punctuation">[</span>title<span class="token punctuation">.</span>text<span class="token punctuation">]</span><span class="token punctuation">)</span>
<span class="token comment"># 保存 Excel</span>
wb<span class="token punctuation">.</span>save<span class="token punctuation">(</span><span class="token string">"news.xlsx"</span><span class="token punctuation">)</span>
<span class="token comment"># 关闭浏览器</span>
driver<span class="token punctuation">.</span>quit<span class="token punctuation">(</span><span class="token punctuation">)</span>
<span class="token keyword">print</span><span class="token punctuation">(</span><span class="token string">"数据抓取完成!"</span><span class="token punctuation">)</span>
</code></pre></p>
代码解析方面,其中有一点是 RPA 结合 AI 能够实现智能自动化,具体而言,4.1 部分是 OCR 识别发票。
RPA 有时需要处理图片或扫描件,比如要自动提取发票信息,这种情况下可以将其与 OCR(光学字符识别)技术相结合。
安装OCR
<p><pre> <code class="prism language-bash">pip <span class="token function">install</span> pytesseract pillow
</code></pre></p>
代码示例
<p><pre> <code class="prism language-python"><span class="token keyword">from</span> PIL <span class="token keyword">import</span> Image
<span class="token keyword">import</span> pytesseract
img <span class="token operator">=</span> Image<span class="token punctuation">.</span><span class="token builtin">open</span><span class="token punctuation">(</span><span class="token string">"invoice.png"</span><span class="token punctuation">)</span><span class="token comment"># 读取发票图片</span>
text <span class="token operator">=</span> pytesseract<span class="token punctuation">.</span>image_to_string<span class="token punctuation">(</span>img<span class="token punctuation">)</span>
https://img0.baidu.com/it/u=3456533217,1745153337&fm=253&fmt=JPEG&app=138&f=JPEG?w=889&h=500
<span class="token keyword">print</span><span class="token punctuation">(</span><span class="token string">"OCR 识别结果:"</span><span class="token punctuation">,</span> text<span class="token punctuation">)</span>
</code></pre></p>
4.2 NLP 处理邮件自动分类
如果 RPA 机器人需要对邮件进行分类,像“发票类邮件”以及“投诉类邮件”等。那么它可以将 NLP(自然语言处理)技术与之结合,从而实现智能分类。
安装
<p><pre> <code class="prism language-bash">pip <span class="token function">install</span> transformers
</code></pre></p>
代码示例
<p><pre> <code class="prism language-python"><span class="token keyword">from</span> transformers <span class="token keyword">import</span> pipeline
classifier <span class="token operator">=</span> pipeline<span class="token punctuation">(</span><span class="token string">"text-classification"</span><span class="token punctuation">)</span>
email_text <span class="token operator">=</span> <span class="token string"></span>
result <span class="token operator">=</span> classifier<span class="token punctuation">(</span>email_text<span class="token punctuation">)</span>
<span class="token keyword">print</span><span class="token punctuation">(</span>result<span class="token punctuation">)</span>
</code></pre></p>
6.1 云端的 RPA
许多 RPA 供应商都有提供 SaaS 这种形式,也就是云端 RPA。像 Cloud 这样的,能够实现对机器人的远程管理。
6.2 低代码 RPA
微软 Power 提供了一种方式,这种方式无需编写代码或者只需编写少量代码,使得非技术人员能够创建自动化流程。
6.3 AI + RPA
RPA 机器人结合 AI(例如)之后,能够处理更为复杂的任务,例如:
七、总结
本文介绍了 RPA 的基本概念,也介绍了其应用场景。并且通过代码实现了网页数据抓取这一实战案例,还实现了 OCR 识别发票这一实战案例,以及实现了邮件智能分类这一实战案例。
未来,RPA 会变得越来越智能。它如果结合 AI 的话,就能够处理更为复杂的任务。
建议学习平台,如 Blue Prism 等,然后进行深度实践。
页:
[1]