官术网_书友最值得收藏!

Creating data repositories for data versioning

If you followed the local installation of Pachyderm specified in the Pachyderm documentation, you should have the following:

  • Kubernetes running in a Minikube VM on your machine
  • The pachctl command line tool installed and connected to your Pachyderm cluster

Of course, if you have a production cluster running in a cloud, the following steps still apply. Your pachctl would just be connected to the remote cluster.

We will be demonstrating data versioning functionality with the pachctl Command-line Interface ( CLI) tool below (which is a Go program). However, as mentioned above, Pachyderm has a full-fledged Go client. You can create repositories, commit data, and much more directly from your Go programs. This functionality will be demonstrated later in Chapter 9, Deploying and distributing Analyses and Models.

To create a repository of data called myrepo, you can run this code:

$ pachctl create-repo myrepo

You can then confirm that the repository exists with list-repo:

$ pachctl list-repo
NAME CREATED SIZE
myrepo 2 seconds ago 0 B

This myrepo repository is a collection of data that we have defined and is ready for housing-versioned data. Right now, there is no data in the repository, because we haven't put any data there yet.

主站蜘蛛池模板: 兰溪市| 广灵县| 陇川县| 吉林省| 宁河县| 蚌埠市| 平阴县| 邵阳市| 肃北| 武威市| 高青县| 易门县| 紫阳县| 南江县| 西平县| 儋州市| 兴安县| 平顶山市| 天镇县| 浪卡子县| 巫溪县| 扎囊县| 台州市| 大埔区| 项城市| 偏关县| 镇坪县| 安义县| 通州区| 垫江县| 鹤岗市| 元阳县| 福建省| 平和县| 铁岭市| 泽州县| 博爱县| 岚皋县| 乌鲁木齐市| 湖南省| 察雅县|