Review of Scaling Distributed Machine Learning With the Parameter Server
Installation of Solr on Unix-uniform or Windows servers generally requires just extracting (or, unzipping) the download package.
Delight be sure to review the Solr Arrangement Requirements earlier starting Solr.
Available Solr Packages
Solr is available from the Solr website. Download the latest release https://lucene.apache.org/solr/downloads.html.
There are three carve up packages:
-
solr-8.four.0.tgz
for Linux/Unix/OSX systems -
solr-8.4.0.zip
for Microsoft Windows systems -
solr-8.4.0-src.tgz
the parcel Solr source code. This is useful if yous want to develop on Solr without using the official Git repository.
Preparing for Installation
When getting started with Solr, all you need to practice is extract the Solr distribution archive to a directory of your choosing. This will suffice equally an initial development environment, simply take care non to overtax this "toy" installation before setting upward your truthful development and product environments.
When you lot've progressed by initial evaluation of Solr, you'll desire to take care to plan your implementation. You lot may need to reinstall Solr on another server or make a clustered SolrCloud environment.
When yous're set up to setup Solr for a production environment, please refer to the instructions provided on the Taking Solr to Production page.
What Size Server Exercise I Need? How to size your Solr installation is a circuitous question that relies on a number of factors, including the number and structure of documents, how many fields you intend to store, the number of users, etc. It'due south highly recommended that yous spend a bit of time thinking most the factors that will impact hardware sizing for your Solr implementation. A very good blog mail that discusses the issues to consider is Sizing Hardware in the Abstract: Why We Don't take a Definitive Answer. |
One affair to note when planning your installation is that a hard limit exists in Lucene for the number of documents in a single index: approximately 2.xiv billion documents (2,147,483,647 to be exact). In practice, it is highly unlikely that such a large number of documents would fit and perform well in a single index, and you will probable need to distribute your alphabetize across a cluster before y'all ever approach this number. If you lot know you will exceed this number of documents in total before y'all've even started indexing, information technology'southward best to plan your installation with SolrCloud as part of your blueprint from the get-go.
Parcel Installation
To keep things elementary for now, excerpt the Solr distribution archive to your local home directory, for case on Linux, do:
cd ~/ tar zxf solr-8.4.0.tgz
One time extracted, y'all are at present ready to run Solr using the instructions provided in the Starting Solr section beneath.
Directory Layout
Subsequently installing Solr, you'll run across the post-obit directories and files within them:
- bin/
- This directory includes several of import scripts that volition make using Solr easier.
- solr and solr.cmd
- This is Solr's Command Script, besides known as
bin/solr
(*nix) /bin/solr.cmd
(Windows). This script is the preferred tool to start and stop Solr. You can too create collections or cores, configure hallmark, and work with configuration files when running in SolrCloud fashion. - mail service
- The PostTool, which provides a simple control line interface for POSTing content to Solr.
- solr.in.sh and solr.in.cmd
- These are holding files for *naught and Windows systems, respectively. Arrangement-level properties for Java, Jetty, and Solr are configured hither. Many of these settings can be overridden when using
bin/solr
/bin/solr.cmd
, merely this allows yous to set all the backdrop in i place. - install_solr_services.sh
- This script is used on *nix systems to install Solr as a service. It is described in more item in the section Taking Solr to Production.
- contrib/
- Solr'due south
contrib
directory includes add together-on plugins for specialized features of Solr. - dist/
- The
dist
directory contains the master Solr .jar files. - docs/
- The
docs
directory includes a link to online Javadocs for Solr. - example/
- The
example
directory includes several types of examples that demonstrate diverse Solr capabilities. See the department Solr Examples below for more details on what is in this directory. - licenses/
- The
licenses
directory includes all of the licenses for third party libraries used by Solr. - server/
- This directory is where the heart of the Solr application resides. A README in this directory provides a detailed overview, but here are some highlights:
- Solr's Admin UI (
server/solr-webapp
) - Jetty libraries (
server/lib
) - Log files (
server/logs
) and log configurations (server/resources
). See the section Configuring Logging for more details on how to customize Solr'south default logging. - Sample configsets (
server/solr/configsets
)
- Solr's Admin UI (
Solr Examples
Solr includes a number of example documents and configurations to use when getting started. If you lot ran through the Solr Tutorial, yous have already interacted with some of these files.
Here are the examples included with Solr:
- exampledocs
- This is a small set of uncomplicated CSV, XML, and JSON files that can be used with
bin/post
when first getting started with Solr. For more than information most usingbin/post
with these files, see Mail service Tool. - example-DIH
- This directory includes a few example DataImport Handler (DIH) configurations to help you lot go started with importing structured content in a database, an email server, or fifty-fifty an Atom feed. Each case will index a different set of data; see the README at that place for more details nigh these examples.
- files
- The
files
directory provides a basic search UI for documents such as Discussion or PDF that you may have stored locally. See the README in that location for details on how to apply this example. - films
- The
films
directory includes a robust gear up of information about movies in three formats: CSV, XML, and JSON. See the README there for details on how to use this dataset.
Starting Solr
Solr includes a control line interface tool called bin/solr
(Linux/MacOS) or bin\solr.cmd
(Windows). This tool allows you to start and cease Solr, create cores and collections, configure authentication, and check the status of your system.
To utilise it to start Solr y'all can but enter:
If you are running Windows, you can start Solr past running bin\solr.cmd
instead.
This will start Solr in the background, listening on port 8983.
When you lot start Solr in the groundwork, the script will wait to make sure Solr starts correctly before returning to the command line prompt.
Outset Solr with a Specific Bundled Example
Solr likewise provides a number of useful examples to assistance you learn virtually key features. You tin launch the examples using the -e
flag. For instance, to launch the "techproducts" example, you would do:
Currently, the available examples you can run are: techproducts, dih, schemaless, and cloud. See the section Running with Example Configurations for details on each example.
Bank check if Solr is Running
If yous're not sure if Solr is running locally, you tin use the status command:
This will search for running Solr instances on your computer and then assemble basic information nigh them, such as the version and memory usage.
That's it! Solr is running. If yous need disarming, use a Web browser to see the Admin Console.
http://localhost:8983/solr/
If Solr is not running, your browser will complain that information technology cannot connect to the server. Check your port number and try again.
Create a Core
If y'all did non start Solr with an example configuration, you would need to create a core in order to be able to index and search. You tin can do so by running:
bin/solr create -c <name>
This volition create a core that uses a information-driven schema which tries to approximate the correct field type when you add documents to the index.
To see all available options for creating a new core, execute:
Source: https://solr.apache.org/guide/8_4/installing-solr.html
0 Response to "Review of Scaling Distributed Machine Learning With the Parameter Server"
Post a Comment