2. InfluxDB关键概念-低调大师

2. InfluxDB关键概念

2020-09-06 645

InfluxDB前篇介绍

Centos7 下 InfluxDB 从安装开始到入门

前一篇根据InfluxDB的官方开源文档进行了一次实践。这篇来继续看看InfluxDB的关键概念。

喜欢看英文开源文档的，可以访问InfluxDB key concepts，直接阅读关键概念。

如果不喜欢直接看英文的，就继续看我下面的翻译后描述吧。

InfluxDB的关键概念

在深入了解InfluxDB之前，熟悉数据库的一些关键概念是很好的。本文档简要介绍了这些概念和通用的InfluxDB术语。下面列出了涵盖的所有术语，但建议您从头到尾阅读本文档，以便更全面地了解我们最喜欢的时间序列数据库。

database	field key	field set
field value	measurement	point
retention policy	series	tag key
tag set	tag value	timestamp

如果想要更加详细地了解术语的定义，请查看术语表。

样本数据 Sample data

name: census（普查）

time	butterflies	honeybees	location	scientist
2015-08-18T00:00:00Z	12	23	1	langstroth
2015-08-18T00:00:00Z	1	30	1	perpetua
2015-08-18T00:06:00Z	11	28	1	langstroth
2015-08-18T00:06:00Z	3	28	1	perpetua
2015-08-18T05:54:00Z	2	11	2	langstroth
2015-08-18T06:00:00Z	1	10	2	langstroth
2015-08-18T06:06:00Z	8	23	2	perpetua
2015-08-18T06:12:00Z	7	22	2	perpetua

上面的样例数据表示两个科学家 langstroth 和 perpetua 在不同时间点以及不同地点记录蝴蝶和蜜蜂的数量。

将样本数据插入到influxDB中

root@d2918dc47850:/# influx
Connected to http://localhost:8086 version 1.7.2
InfluxDB shell version: 1.7.2
Enter an InfluxQL query
> show databases
name: databases
name
----
_internal
mydb
>
> use mydb
Using database mydb
>
>
> insert census,scientist=langstroth,location=1 butterflies=12,honeybees=23
> insert census,scientist=perpetua,location=1 butterflies=1,honeybees=30
> insert census,scientist=langstroth,location=1 butterflies=11,honeybees=28
> insert census,scientist=perpetua,location=1 butterflies=3,honeybees=28
> insert census,scientist=langstroth,location=2 butterflies=2,honeybees=11
> insert census,scientist=perpetua,location=2 butterflies=1,honeybees=10
> insert census,scientist=langstroth,location=2 butterflies=8,honeybees=23
> insert census,scientist=perpetua,location=2 butterflies=7,honeybees=22
>
> select * from census
name: census
time                butterflies honeybees location scientist
----                ----------- --------- -------- ---------
1546741552382793960 12          23        1        langstroth
1546741591954384804 1           30        1        perpetua
1546741614036950839 11          28        1        langstroth
1546741636651092337 3           28        1        perpetua
1546741656423108444 2           11        2        langstroth
1546741670749604756 1           10        2        perpetua
1546741686055646710 8           23        2        langstroth
1546741704010462064 7           22        2        perpetua
>

样本数据字段的含义说明

time : 在上面的数据中有一个名为time- InfluxDB中的所有数据都有该列。 time存储时间戳，以及timestamp以 RFC3339 UTC显示与特定数据关联的日期和时间。
butterflies和honeybees字段（fields）：这两个字段(fields)由字段键(field keys )和字段值(field values)组成。 字段键(field keys ) : butterflies和honeybees 则是表的字段名；字段值(field values)：可以是字符串，浮点数，整数或布尔值，并且由于InfluxDB是时间序列数据库，因此字段值始终与时间戳相关联。

示例数据中的字段值为：

在上面的数据中，字段键(field keys)和字段值(field values)对的集合构成了一个 字段集(field set)。以下是示例数据中的所有八个字段集：

butterflies = 12 honeybees = 23
butterflies = 1 honeybees = 30
butterflies = 11 honeybees = 28
butterflies = 3 honeybees = 28
butterflies = 2 honeybees = 11
butterflies = 1 honeybees = 10
butterflies = 8 honeybees = 23
butterflies = 7 honeybees = 22

字段(fields)是InfluxDB数据结构的必需部分。没有字段，您不能在InfluxDB中拥有数据。同样重要的是要注意：字段不能设置为索引。使用字段值作为过滤器的查询必须扫描与查询中的其他条件匹配的所有值，所以效率相对于标记（tag）查询偏低。其中标记（tag）查询可以设置索引，所以查询效率更高。

标记(tag) location和 scientist：示例数据中的最后两列（ location和 scientist）是标记。标签由标签键和标签值组成。 标签键和 标记值存储为字符串和记录元数据。示例数据中的标记键是 location和 scientist。标记键 location有两个标记值： 1和 2。标记键 scientist还有两个标记值： langstroth和 perpetua。

在上面的数据中， 标记集是所有标记键值对的不同组合。样本数据中的四个标记集是：

location = 1， scientist = langstroth
location = 2， scientist = langstroth
location = 1， scientist = perpetua
location = 2， scientist = perpetua

标签是可选的。您不需要在数据结构中包含标记，但通常最好使用它们，因为与字段不同，标记是索引的。这意味着对标签的查询更快，并且该标签非常适合存储常用查询元数据。

查询条件中，索引很重要

假设您注意到大多数查询都关注字段键的值，honeybees、butterflies查询语句如下：SELECT * FROM "census" WHERE "butterflies" = 1 SELECT * FROM "census" WHERE "honeybees" = 23

执行如下：

> SELECT * FROM "census" WHERE "butterflies" = 1
name: census
time                butterflies honeybees location scientist
----                ----------- --------- -------- ---------
1546741591954384804 1           30        1        perpetua
1546741670749604756 1           10        2        perpetua
>
> SELECT * FROM "census" WHERE "honeybees" = 23
name: census
time                butterflies honeybees location scientist
----                ----------- --------- -------- ---------
1546741552382793960 12          23        1        langstroth
1546741686055646710 8           23        2        langstroth
>

但是由于字段键(field key) 是没有索引的，在大规模数据查询的时候会扫描全表数据，此时效率就会很低，那么该如何去优化呢？

此时就应该将butterflies、honeybees 两个字段设置为tag，而location、scientist设置为field。

insert census,butterflies=1,honeybees=30  scientist="perpetua",location=1  
insert census,butterflies=11,honeybees=28 scientist="langstroth",location=1
insert census,butterflies=3,honeybees=28  scientist="perpetua",location=1  
insert census,butterflies=2,honeybees=11  scientist="langstroth",location=2
insert census,butterflies=1,honeybees=10  scientist="perpetua",location=2  
insert census,butterflies=8,honeybees=23  scientist="langstroth",location=2
insert census,butterflies=7,honeybees=22  scientist="perpetua",location=2  
insert census,butterflies=12,honeybees=23 scientist="langstroth",location=1

操作如下：

> use mydb
Using database mydb
>
## 查看有哪些表
> show measurements
name: measurements
name
----
census
cpu
temperature
## 清空表数据
> delete from census;
>
> select * from census;
>
## 插入数据
> insert census,butterflies=1,honeybees=30  scientist="perpetua",location=1
> insert census,butterflies=11,honeybees=28 scientist="langstroth",location=1
> insert census,butterflies=3,honeybees=28  scientist="perpetua",location=1  
> insert census,butterflies=2,honeybees=11  scientist="langstroth",location=2
> insert census,butterflies=1,honeybees=10  scientist="perpetua",location=2  
> insert census,butterflies=8,honeybees=23  scientist="langstroth",location=2
> insert census,butterflies=7,honeybees=22  scientist="perpetua",location=2  
> insert census,butterflies=12,honeybees=23 scientist="langstroth",location=1
>
> select * from census
name: census
time                butterflies honeybees location scientist
----                ----------- --------- -------- ---------
1546743438630926762 1           30        1        perpetua
1546743446986027738 11          28        1        langstroth
1546743446997025073 3           28        1        perpetua
1546743447019092699 2           11        2        langstroth
1546743447023970929 1           10        2        perpetua
1546743447027505445 8           23        2        langstroth
1546743447032866644 7           22        2        perpetua
1546743448855305845 12          23        1        langstroth
>

此时，butterflies honeybees已经是tag，属于索引，查询大规模数据的时候效率就会提升。

本文分享自微信公众号 - DevOps社群（DevOpsCommunity）。
如有侵权，请联系 support@oschina.cn 删除。
本文参与“OSC源创计划”，欢迎正在阅读的你也加入，一起分享。

微信关注我们

原文链接：https://my.oschina.net/u/4011572/blog/4549661

转载内容版权归作者及来源网站所有！

低调大师中文资讯倾力打造互联网数据资讯、行业资源、电子商务、移动互联网、网络营销平台。持续更新报道IT业界、互联网、市场资讯、驱动更新,是最及时权威的产业资讯及硬件资讯报道平台。

持续部署入门：基于 Kubernetes 实现滚动发布

前言软件世界比以往任何时候都更快。为了保持竞争力，需要尽快推出新的软件版本，而不会中断活跃用户访问，影响用户体验。越来越多企业已将其应用迁移到 Kubernetes。在 Kubernetes 中有几种不同的方式发布应用，所以为了让应用在升级期间依然平稳提供服务，选择一个正确的发布策略就非常重要了，本篇文章将讲解如何在 Kubernetes 使用滚动更新的方式更新镜像。原理策略定义为 RollingUpdate 的 Deployment。滚动更新通过逐个替换实例来逐步部署新版本的应用，直到所有实例都被替换完成为止，会有新版旧版同时存在的情况。 spec: replicas: 4 strategy: type: RollingUpdate rollingUpdate: maxSurge: 0 # 决定了配置中期望的副本数之外，最多允许超出的 pod 实例的数量 maxUnavailable: %25 # 决定了在滚动升级期间，相对于期望副本数能够允许有多少 pod 实例处于不可用状态上述更新策略执行结果如下图所示实践使用 Kubernetes 原生方式升级应用准备 imag...

2020-09-07

509

参考官方开源文档使用HTTP API查询数据https://docs.influxdata.com/influxdb/v1.7/guides/querying_data/ 使用HTTP的API查询数据 HTTP API是在InfluxDB中查询数据的主要方法（有关查询数据库的其他方法，请参阅命令行界面和客户端库）。注意：以下示例使用curl命令行工具，该工具使用URL传输数据。学习的基础知识curl与HTTP脚本指南。 API查询语句查询语句如下：curl -G 'http://localhost:8086/query?pretty=true' --data-urlencode "db=testdb" --data-urlencode "q=SELECT \"value\" FROM \"cpu_load_short\" WHERE \"region\"='us-west'" 在前面的篇章中，我已经创建了testdb数据库，以及插入了数据。首先查看一下当前InfluxDB中的数据，如下： > show databasesname: databasesname----_in...

2020-09-06

514

资源下载

更多资源

优质分享App

近一个月的开发和优化，本站点的第一个app全新上线。该app采用极致压缩，本体才4.36MB。系统里面做了大量数据访问、缓存优化。方便用户在手机上查看文章。后续会推出HarmonyOS的适配版本。

腾讯云软件源

为解决软件依赖安装时官方源访问速度慢的问题，腾讯云为一些软件搭建了缓存服务。您可以通过使用腾讯云软件源站来提升依赖包的安装速度。为了方便用户自由搭建服务架构，目前腾讯云软件源站支持公网访问和内网访问。

Nacos

Nacos /nɑ:kəʊs/ 是 Dynamic Naming and Configuration Service 的首字母简称，一个易于构建 AI Agent 应用的动态服务发现、配置管理和AI智能体管理平台。Nacos 致力于帮助您发现、配置和管理微服务及AI智能体应用。Nacos 提供了一组简单易用的特性集，帮助您快速实现动态服务发现、服务配置、服务元数据、流量管理。Nacos 帮助您更敏捷和容易地构建、交付和管理微服务平台。

Sublime Text

Sublime Text具有漂亮的用户界面和强大的功能，例如代码缩略图，Python的插件，代码段等。还可自定义键绑定，菜单和工具栏。Sublime Text 的主要功能包括：拼写检查，书签，完整的 Python API ， Goto 功能，即时项目切换，多选择，多窗口等等。Sublime Text 是一个跨平台的编辑器，同时支持Windows、Linux、Mac OS X等操作系统。