Nornir 网络自动化框架代码解析

Green

　　Nornir 一种类似ansible的python网络设备管理框架;
　　官方定义：

Nornir is an automation framework written in python to be used with python
基于python开发的、同时也是通过ptyhon方式调用的 (网络)自动化框架

　　官方参考文档:https://nornir.readthedocs.io/en/latest/index.html
　　本文目的：介绍nornir底层代码，以便后续进行二次开发。

1.初识nornir

　　先根据官网的tutorial 跑通一下nornir的整个流程

nornir_ssh % ls -R
config.yaml   inventory     nornir.log    nornir_01.py
./inventory:
defaults.yaml   groups.yaml   hosts.yaml

b.文件内容

# ./inventory/hosts.yaml
---
h3c_sw:
  hostname: 192.168.31.103
  port: 22
  username: admin
  password: admin
  platform: hp_comware  # 这里根据使用的connectionPlugin的不同，选择对应厂商的名字，
            # 本次使用netmiko，所以netmiko连接h3c的设备可以用hp_comware的设备类型(device_type)；
            # 如果是napalm的话 名字可能会不一样。
  group:
  - H3C_Device
  data:
  cmds:
    - display cur

# ./inventory/groups.yaml
---
H3C_Device:
  data:
  - vsr1000
Cisco_Device:
  data:
  - ios

# ./inventory/defaults.yaml
为空
# nornir.log
为空

# config.yaml
---
inventory:
  plugin: SimpleInventory
  options:
  host_file: "inventory/hosts.yaml"
  group_file: "inventory/groups.yaml"
  defaults_file: "inventory/defaults.yaml"
runner:
  plugin: threaded
  options:
  num_workers: 100

# nornir01.py 可以随便取，无所谓
from nornir import InitNornir
from nornir_utils.plugins.functions import print_result
from nornir_netmiko import netmiko_send_command

def show_cmds(task):
  outputs = []
  cmds = task.host.data['cmds']
  for cmd in cmds:
    result = task.run(netmiko_send_command, command_string=cmd)
    output = result.result
    outputs += output
  return outputs

nr = InitNornir(config_file='config.yaml', dry_run=True)
results = nr.run(task=show_cmds)
print_result(results['h3c_sw'][0])

　　运行上面的python文件:

python nornir01.py

2.理解nornir组件概念和python包的知识

# 1.最底层的task是函数，后面函数作为Task对象的参数，封装一个Task对象task
# 2.processor 本质是一个接口，所谓接口就是要实现特定方法的类
# 3.理解pkg_resource 包的作用，其实就是将一些目录下的文件所定义的对象 加载到一个字典中来；下面的连接插件注册和运行插件注册都是用来这个原理
# 4.connection plugin 是什么？ netmiko/napalm ？
# 5.runner plugin 是什么？ threaded ？
# 6.插件加载完后，会根据输入的内容判断 是否 插件字典available有对应名字的插件对象

3.结合nornir底层代码片段来分析整个流程

　　下面会重点讲诉如下几行代码：

'''
'''
def show_cmds(task):
  outputs = []
  cmds = task.host.data['cmds']
  for cmd in cmds:
    result = task.run(netmiko_send_command, command_string=cmd)
    output = result.result
    outputs += output
  return outputs
nr = InitNornir(config_file='config.yaml', dry_run=True)
results = nr.run(task=show_cmds)
'''
'''

3.1.自定义函数show_comds中必须要有1个参数，这个参数到时会被传递一个Task实例对象:

# nornir.core.task.py -> Task.start()       
r = self.task(self, **self.params)  # 这个self.task是自定义函数show_cmds

3.2.InitNornir()根据配置文件或关键字参数进行配置项初始化

def InitNornir(config_file: str = "", dry_run: bool = False, **kwargs: Any,) -> Nornir:
  """
  Arguments:
    config_file(str): Path to the configuration file (optional)
    dry_run(bool): Whether to simulate changes or not
    configure_logging: Whether to configure logging or not. This argument is being
      deprecated. Please use logging.enabled parameter in the configuration
      instead.
    **kwargs: Extra information to pass to the
      :obj:`nornir.core.configuration.Config` object
  Returns:
    :obj:`nornir.core.Nornir`: fully instantiated and configured
  """
  ConnectionPluginRegister.auto_register()  # 注册/加载 连接插件 比如netmiko、napalm等；通过pkg_resource包来注册
  # 有传递配置文件的话就根据配置文件初始化，否则根据关键字参数初始化
  if config_file:
    config = Config.from_file(config_file, **kwargs)
  else:
    config = Config.from_dict(**kwargs)
  data = GlobalState(dry_run=dry_run)
  config.logging.configure()
  # 返回一个Nornir实例对象
  return Nornir(
    inventory=load_inventory(config),  # 加载主机插件，比如SimpleInventory，将主机资产从配置文件 放到python对象中
    runner=load_runner(config),  # 加载runner插件，比如threaded、serial
    config=config,
    data=data,
  )

3.3 下面看一下Nornir.runner.run这个方法

　　重点是这行代码：result = self.runner.run(task, run_on)，其实是进入runner插件的run()方法里面

# nornir.core.__init__.py -> Nornir.run()
  def run(
    self,
    task,
    raise_on_error=None,
    on_good=True,
    on_failed=False,
    name: Optional[str] = None,
    **kwargs,
  ):
    """
    Run task over all the hosts in the inventory.
    Arguments:
      task (``callable``): function or callable that will be run against each device in
        the inventory
      raise_on_error (``bool``): Override raise_on_error behavior
      on_good(``bool``): Whether to run or not this task on hosts marked as good
      on_failed(``bool``): Whether to run or not this task on hosts marked as failed
      **kwargs: additional argument to pass to ``task`` when calling it
    Raises:
      :obj:`nornir.core.exceptions.NornirExecutionError`: if at least a task fails
        and self.config.core.raise_on_error is set to ``True``
    Returns:
      :obj:`nornir.core.task.AggregatedResult`: results of each execution
    """
    task = Task(
      task,  # task 是一个函数
      self,  # nornir 实例
      global_dry_run=self.data.dry_run,
      name=name,
      processors=self.processors,  # 将包含processor的列表传递进去；processor是实现来一些方法的类，这些方法在任务执行过程的某个时间点会被执行
      **kwargs,  # 这个参数应该是放入到task函数里面的
    )
    self.processors.task_started(task)  # 运行任务启动前的processor 的task_task_started方法；processor是针对所有设备而言的，不是针对特定的设备
    run_on = []  # 存放需要运行的task函数的所有设备
    if on_good:
      for name, host in self.inventory.hosts.items():
        if name not in self.data.failed_hosts:
          run_on.append(host)
    if on_failed:
      for name, host in self.inventory.hosts.items():
        if name in self.data.failed_hosts:
          run_on.append(host)
    # 打印日志，日志输出一共有多少台设备
    num_hosts = len(self.inventory.hosts)
    if num_hosts:
      logger.info(
        "Running task %r with args %s on %d hosts",
        task.name,
        kwargs,
        num_hosts,
      )
    else:
      logger.warning("Task %r has not been run – 0 hosts selected", task.name)
    # 将封装好的Task对象task 和所有主机 放入runner.run()中执行
    result = self.runner.run(task, run_on)
    # runner.run()异常处理
    raise_on_error = (
      raise_on_error
      if raise_on_error is not None
      else self.config.core.raise_on_error
    )  # noqa
    if raise_on_error:
      result.raise_on_error()
    else:
      self.data.failed_hosts.update(result.failed_hosts.keys())
    # runner.run()运行完成是调用processor.task_completed()函数
    self.processors.task_completed(task, result)
    return result

3.3.1 进入runner插件的run方法看一下

　　如果直接点击代码引用的话，是直接跳到runner接口定义的代码中，我们要看具体的runner插件实现.
　　看如下代码，其实是将Task.start方法放入线程池里面(因为我们的runner插件在config.yaml中配置的是threaded)

# nornir/plugins/runners/__init__.py  -> ThreadedRunner.run()
  def run(self, task: Task, hosts: List[Host]) -> AggregatedResult:
    result = AggregatedResult(task.name)
    futures = []
    with ThreadPoolExecutor(self.num_workers) as pool:
      for host in hosts:
        future = pool.submit(task.copy().start, host)
        futures.append(future)

3.3.2 再进上面代码中的重点代码看看

　　future = pool.submit(task.copy().start, host)

# copy比较简单 直接把Task实例对象的属性 用来再实例化Task
  def copy(self) -> "Task":
    return Task(
      self.task,
      self.nornir,
      self.global_dry_run,
      self.processors,
      self.name,
      self.severity_level,
      self.parent_task,
      **self.params
    )
  def __repr__(self) -> str:
    return self.name
  # 这个start方法是重点
  def start(self, host: "Host") -> "MultiResult":
    """
    Run the task for the given host.
    Arguments:
      host (:obj:`nornir.core.inventory.Host`): Host we are operating with. Populated right
        before calling the ``task``
      nornir(:obj:`nornir.core.Nornir`): Populated right before calling
        the ``task``
    Returns:
      host (:obj:`nornir.core.task.MultiResult`): Results of the task and its subtasks
    """
    self.host = host
    if self.parent_task is not None:
      self.processors.subtask_instance_started(self, host)
    else:
      self.processors.task_instance_started(self, host)
    try:
      logger.debug("Host %r: running task %r", self.host.name, self.name)
      # 重点，self.task是自定义函数来，不是Task对象，看第一个参数是self，跟代码分析时说的一样，我们自定义函数时必须要给函数设定一个参数
      r = self.task(self, **self.params)
      if not isinstance(r, Result):
        r = Result(host=host, result=r)
    except NornirSubTaskError as e:
      tb = traceback.format_exc()
      logger.error(
        "Host %r: task %r failed with traceback:\n%s",
        self.host.name,
        self.name,
        tb,
      )
      r = Result(host, exception=e, result=str(e), failed=True)
    except Exception as e:
      tb = traceback.format_exc()
      logger.error(
        "Host %r: task %r failed with traceback:\n%s",
        self.host.name,
        self.name,
        tb,
      )
      r = Result(host, exception=e, result=tb, failed=True)
    r.name = self.name
    if r.severity_level == DEFAULT_SEVERITY_LEVEL:
      if r.failed:
        r.severity_level = logging.ERROR
      else:
        r.severity_level = self.severity_level
    self.results.insert(0, r)
    if self.parent_task is not None:
      self.processors.subtask_instance_completed(self, host, self.results)
    else:
      self.processors.task_instance_completed(self, host, self.results)
    return self.results

3.4 下面在看一下Task.run()这个方法

　　注意跟Nornir.runner.run()区分，不要搞混淆

# nornir/core/task.py -> Task.run()
  # 这个Task里面的run 是自定义函数调用 其他task函数时用的(有点嵌套的感觉)
  def run(self, task: Callable[..., Any], **kwargs: Any) -> "MultiResult":
    """
    This is a utility method to call a task from within a task. For instance:
      def grouped_tasks(task):
        task.run(my_first_task)
        task.run(my_second_task)
      nornir.run(grouped_tasks)
    This method will ensure the subtask is run only for the host in the current thread.
    """
    if not self.host:
      msg = (
        "You have to call this after setting host and nornir attributes. ",
        "You probably called this from outside a nested task",
      )
      raise Exception(msg)
    if "severity_level" not in kwargs:
      kwargs["severity_level"] = self.severity_level
    # 重点1: 将内嵌的task方法封装成Task对象
    run_task = Task(
      task,
      self.nornir,
      global_dry_run=self.global_dry_run,
      processors=self.processors,
      parent_task=self,
      **kwargs
    )
    # 重点2: 调用start方法，上面“3.3.2”中说过的Task.start
    r = run_task.start(self.host)
    self.results.append(r[0] if len(r) == 1 else cast("Result", r))
    if r.failed:
      # Without this we will keep running the grouped task
      raise NornirSubTaskError(task=run_task, result=r)
    return r

4.总结

　　通过上述代码的演示，应该可以看出Nornir底层代码的大概实现了。
　　如果要看懂，要自己亲自动手一起调试结合断点才能看明白，不然会看的很晕。
　　ps:
其实还有个processor的东西还可以讲，其实就是在Task运行到某个状态执行processor类中的某个方法，其实在上面代码分析过程中也是可以大概看出一些眉目来的。因为篇幅的原因就说到这，如果有什么不对的地方，望批评指正，谢谢！

[python] Nornir 网络自动化框架代码解析

相关帖子

如何共享网络打印机到电脑

qq应用管理器在哪里打开

qq好友管理器在哪里找到

网络ssid填写什么好

网络共享设置步骤

台式电脑连接wifi步骤无网络

win7网络重置选项在哪

qq拼音输入法管理词库教程

怎么利用配置文件来管理多个 Node.js 进程

基于python的动态时钟，带十二时辰和经络养身

Green LV4

[python] Nornir 网络自动化框架 代码解析