Entries with tag solr times .

Actualidad

Updated Docker image for Alfresco monitoring with Alfresco Search Services

avatar
Cesar Capillas Sep 04, 2018
Zylk

During last days of summer, I took some time to update the Docker and Vagrant images for Alfresco monitoring setup with Nagios / Icinga, adding support for Alfresco Search Services and also PNP4Nagios for having some visualizations. In the provided configuration, I created a new host for Alfresco Search Services, and a sample configuration for the most useful services for monitoring SOLR 6: core index errors, average measured times for SOLR handlers or caches hitradios. The sample......

Read More

Using jq for parsing JSON documents

avatar
Cesar Capillas Aug 04, 2018

I'm a big fan of command line shell utilities, and nowadays with the proliferation of REST APIs and JSON documents, one of the commands that I use more frequently in conjuction with curl is jq. jq is a lightweight and flexible command-line JSON processor and, we could say it is like awk or sed, but for JSON syntax. It may installed via apt in Ubuntu, or downloaded directly from github. In the following I'm going to show some illustrative use cases with jq: Use case 1) Extracting information......

Read More

Deleting documents in SOLR

avatar
Cesar Capillas Apr 23, 2018
Zylk

Just a simple tip today for deleting documents in SOLR via REST API and post command. Consider that you have a collection called gettingstarted and some document with id 44C...

Read More

Using schemaless mode and post command in SOLR

avatar
Cesar Capillas Apr 02, 2018
Zylk

In SOLR 6.6 the data_driven_scheme_configs configset is able to implement the features of the so-named Schemaless mode. This mode is a set of features that allow users to construct an effective schema by simply indexing sample data, without having to manually edit the schema. In the following examples, I'm using SOLR Cloud 6.6 setup and collection API. $ cd /opt/solr6/solr-6.6.0/ $ ./bin/solr create -c gettingstarted -shards 1 -replicationFactor 2 -p 8983 -d......

Read More

SOLR client APIs

avatar
Cesar Capillas Mar 26, 2018
Zylk

In SOLR, you may find different client APIs for your favourite programming language such as Java, Python, Ruby, Perl or Javascript. Basically, client apps can reach Solr by creating HTTP requests and parsing the corresponding HTTP responses, encapsulating much of the work of sending requests and parsing responses, and making easier to write client applications. If you are a java programmer you will be confortable with SorlJ, while you can use pysolr or rsolr libs, for Python and Ruby......

Read More

Using the editorial marker field in SOLR elevation component

avatar
Cesar Capillas Mar 15, 2018
Zylk

One of the nice features of SOLR is the ability for doing promoted or recommended searches. With this feature, you can basically use a kind of query (elevation query), that is able to elevate static searches configured by id. Use case example: Imagine that you have a popular Alfresco blog in your site, and you want that when searching for term "alfresco blog", your first or just a "recommended page" is the solutions page. In elevate.xml:

Read More

Playing with SOLR Cloud and REST API

avatar
Cesar Capillas Mar 06, 2018
Zylk

Last days I've been involved in an enterprise search project based on Apache Nutch and Apache SOLR projects. As you probably know, SOLR is a powerful enterprise search and index engine with a powerful REST API, which exposes its features as query, index, delete, commit and optimize, and also including a very useful admin interface. Web applications developed in any programing language (such as Java, .NET, Python, or Ruby), can easily call this REST API and perform SOLR operations, which are......

Read More

More on monitoring dashboards for Alfresco using SOLR, Banana and Apache Zeppelin

avatar
Cesar Capillas Jan 08, 2018

In the previous posts I talked about ELK stack for monitoring Alfresco. But another possibility for loading metrics or logs information, extracted by logstash, is a SOLR index server (instead of Elastic Search), which is part of Alfresco architecture by default, and in principle, it would seem more natural for indexing our Alfresco logs and metrics. Besides, there exists some ports for Kibana in SOLR, such Banana or Silk, that may be deployed in our SOLR dedicated instance. Basically, you......

Read More

Kibana dashboard for monitoring Alfresco Community via OOTB Support Tools

avatar
Cesar Capillas Dec 26, 2017
Zylk

Some weeks ago I wrote a post about how to set up a basic Kibana dashboard representing System Performance and Active Sessions in Alfresco Enterprise via JMX metrics from a logstash JMX input. As you probably know, no such Alfresco JMX objects are available in Alfresco Community, such as active sessions (only the java and catalina objects are available). Anyway, it is possible to obtain a similar dashboard using the relevant information from a webscript, for example, the JSON-based webscripts......

Read More

My first Alfresco Hackathon at BeeCon 2017

avatar
Cesar Capillas May 02, 2017

Some customers told me that last week I was a little bit eventually consistent, and this post is somehow related. I was at the Beecon 2017 in Zaragoza, the Alfresco Developer Conference, time to re-connect, re-charge, and re-commit with Alfresco Community. The event was really amazing (congrats to the organizers and those who made it happen, great event). I have been in previous Alfresco meetups, devcons and summits, but this was my first time in a Hackathon. At first I wasn't sure about......

Read More

How to avoid indexing full content in Alfresco

avatar
Cesar Capillas Feb 04, 2016

To avoid indexing full content in Alfresco, we have different aproximations:   1. From SOLR point of view (tested in SOLR 1.4 and Alfresco 4.2.5):    In solrcore.properties (for workspace and archive store) set:    alfresco.index.transformContent=false alfresco.ignore.datatype.1=d:content   This is a general setup (for all content types).   2. From Filesystem Bulk Import and/or CMIS APIs point of view:    Add aspect cm:indexControl and use cm:isIndexed=true, and cm:isContentIndexed=false in......

Read More

How to deactivate metadata extraction in Alfresco

avatar
Cesar Capillas Dec 03, 2015

This a simple tip for deactivating metadata extractors in Alfresco 4 and 5 (from Alfresco 4.1.4 and above)  When do you need this ? In a bulk filesystem import, you can lose performance in data injection due metadata extracters. When you have a huge repository with millions of documents, and you want to apply indexation policies to prevent a non-controlled indices growth. In alfresco-global.properties just type (and then restart the service): # pdfs extracter.PDFBox.enabled=false # office......

Read More

How to track SOLR indexation process in Alfresco

avatar
Cesar Capillas Nov 18, 2015

For tracking indexation in SOLR, we have the Alfresco Administration Console, which gives us the indexation status, the indices in disk, and an estimation of the remaining time of transactions to be indexed. This is available in Enterprise edition in Admin Console:   http://localhost:8080/alfresco/service/enterprise/admin/admin-searchservice   This information is directly taken from SOLR Summary XML Report (in this case the url is using 8080 instead of the default secured 8843):  ......

Read More

Script para refrescar los tags de sitios en Alfresco Share

avatar
Cesar Capillas Mar 26, 2015

En muchas implantaciones de Alfresco CE y EE de las versiones 3.x, 4.x y 5.x me he encontrado que los tags no funcionan lo suficientemente bien, viendome obligado a refrescar el tagscope de los sitios. Esto produce un reindexado de estas etiquetas. Es una solución temporal, pero permite definir una acción a demanda por ejemplo en un dashlet, un webscript, o incluso en una tarea programada. Dejo aqui un script aplicable a las librerías de documentos de los sitios. Se puede ejecutar desde la......

Read More

Controlando la indexacion de contenido en Alfresco

avatar
Cesar Capillas Dec 09, 2014

La característica de indexado automático de los contenidos y metadatos de un documento, es normalmente bienvenida en un gestor documental. Sin embargo, en ciertos casos es deseable no indexar ese contenido, por ejemplo si una aplicación "no hace uso de ese contenido indexado".   En Alfresco 4.2 EE tenemos un par de opciones para gestionar el indexado de los documentos: La primera es a través del aspecto cm:indexControl que tiene dos propiedades, cm:isIndexed y cm:isContentIndexed, que por......

Read More

Backing up Alfresco indices

avatar
Cesar Capillas Oct 30, 2014

In a previous post we talk about search subsystems in Alfresco. There exists for each subsystem some quartz jobs related to backup indices. In the case of Lucene subsystem, Lucene backups are done at 3AM and their parameters are configurable via alfresco-global.properties (or extending the corresponding search subsystem).   index.backup.cronExpression=0 0 3 * * ? dir.indexes=${dir.root}/lucene-indexes dir.indexes.backup=/backup-lucene-indexes   So you can always change these values, and then......

Read More

Notes about Search Subsystem in Alfresco

avatar
Cesar Capillas Oct 26, 2014

Search subsystem in Alfresco can be Lucene or SOLR type. In the default configuration for Alfresco 4, the search subsystem is SOLR. This can be set up in alfresco-global.properties   index.subsystem.name=solr solr.port.ssl=8443 solr.host=localhost dir.keystore=${dir.root}/keystore     where the SOLR application is deployed by default on the same Tomcat container, and it works with the 8443 SSL connector. It is important, as part of the Alfresco installation, generating the a certificate......

Read More

Performing a full reindex with Solr for Alfresco ECM

avatar
Cesar Capillas Jul 29, 2014

For Alfresco 4.x and Solr 1.4: #! /bin/bash export ALF_HOME=/opt/alfresco4 export SOLR_HOME=/opt/alfresco4/alf_data/solr $ALF_HOME/alfresco.sh stop sleep 10 rm -rf $SOLR_HOME/workspace/SpacesStore/* rm -rf $SOLR_HOME/archive/SpacesStore/* rm -f $SOLR_HOME/workspace-SpacesStore/alfrescoModels/* rm -f $SOLR_HOME/archive-SpacesStore/alfrescoModels/* $ALF_HOME/alfresco.sh start   https://docs.alfresco.com/4.1/tasks/solr-reindex.html For Alfresco 5.x and Solr 4: #! /bin/bash export......

Read More
— 20 Items per Page
Showing 18 results.