官术网_书友最值得收藏!

Creating data repositories for data versioning

If you followed the local installation of Pachyderm specified in the Pachyderm documentation, you should have the following:

  • Kubernetes running in a Minikube VM on your machine
  • The pachctl command line tool installed and connected to your Pachyderm cluster

Of course, if you have a production cluster running in a cloud, the following steps still apply. Your pachctl would just be connected to the remote cluster.

We will be demonstrating data versioning functionality with the pachctl Command-line Interface ( CLI) tool below (which is a Go program). However, as mentioned above, Pachyderm has a full-fledged Go client. You can create repositories, commit data, and much more directly from your Go programs. This functionality will be demonstrated later in Chapter 9, Deploying and distributing Analyses and Models.

To create a repository of data called myrepo, you can run this code:

$ pachctl create-repo myrepo

You can then confirm that the repository exists with list-repo:

$ pachctl list-repo
NAME CREATED SIZE
myrepo 2 seconds ago 0 B

This myrepo repository is a collection of data that we have defined and is ready for housing-versioned data. Right now, there is no data in the repository, because we haven't put any data there yet.

主站蜘蛛池模板: 天峻县| 抚州市| 平顺县| 新郑市| 湟中县| 息烽县| 土默特左旗| 韩城市| 海安县| 垦利县| 西充县| 承德县| 垣曲县| 灵山县| 内黄县| 织金县| 克什克腾旗| 宁晋县| 许昌县| 蓝田县| 呼伦贝尔市| 灵山县| 大荔县| 云南省| 兴仁县| 班戈县| 安溪县| 嘉黎县| 蓬溪县| 奉化市| 合作市| 建平县| 夏津县| 汉源县| 阳高县| 大兴区| 肥乡县| 六安市| 石家庄市| 建平县| 保靖县|