官术网_书友最值得收藏!

Creating data repositories for data versioning

If you followed the local installation of Pachyderm specified in the Pachyderm documentation, you should have the following:

  • Kubernetes running in a Minikube VM on your machine
  • The pachctl command line tool installed and connected to your Pachyderm cluster

Of course, if you have a production cluster running in a cloud, the following steps still apply. Your pachctl would just be connected to the remote cluster.

We will be demonstrating data versioning functionality with the pachctl Command-line Interface ( CLI) tool below (which is a Go program). However, as mentioned above, Pachyderm has a full-fledged Go client. You can create repositories, commit data, and much more directly from your Go programs. This functionality will be demonstrated later in Chapter 9, Deploying and distributing Analyses and Models.

To create a repository of data called myrepo, you can run this code:

$ pachctl create-repo myrepo

You can then confirm that the repository exists with list-repo:

$ pachctl list-repo
NAME CREATED SIZE
myrepo 2 seconds ago 0 B

This myrepo repository is a collection of data that we have defined and is ready for housing-versioned data. Right now, there is no data in the repository, because we haven't put any data there yet.

主站蜘蛛池模板: 鄂温| 星子县| 福建省| 华宁县| 谢通门县| 武隆县| 德江县| 安陆市| 都昌县| 丹凤县| 万盛区| 新昌县| 玉树县| 青岛市| 巩留县| 上杭县| 荥经县| 五常市| 阳东县| 山丹县| 卢湾区| 北流市| 建宁县| 巩留县| 铁力市| 雷波县| 日喀则市| 乌海市| 牡丹江市| 应城市| 兴宁市| 丰顺县| 永安市| 泽库县| 阿克苏市| 库车县| 玛曲县| 昌黎县| 武威市| 通海县| 泉州市|