KB 2445: REFINED FILTERING OF THE LEBONCOIN.FR WEBSITE
The leboncoin.fr site is a classified ads site. Olfeo filtering applies to the entire site in this category. However, for service access or statistical purposes, it may be necessary to segment the leboncoin.fr site in order to link each theme of the site to Olfeo categories. Here's a method for this refined filtering.
Context
The website leboncoin.fr is classified in the Classifieds category in the Olfeo database.
Olfeo has chosen not to split the categories of the leboncoin.fr site because the majority of our users prefer to consider leboncoin.fr for what it is in a global way, a generalist classified ad site, and because an explosion of the different parts of the site in other categories would, as standard, allow direct access to the parts concerned without going through the "box" of the leboncoin. fr home page via a simple query on a search engine (e.g.: leboncoin.fr/voitures/which would then be taken into account by the "Cars, Mechanics" category etc.). fr home page via a simple query on a search engine (e.g.: leboncoin.fr/voitures/ which would then be taken into account by the "Cars, Mechanics" category, etc.).
All filtering and statistics are based on this category.
However, it can be useful to divide the leboncoin.fr site according to theme, and link these to the Olfeo solution's URL filtering by category. Or simply define certain parts of the site as accessible or prohibited.
There are two ways to do this, depending on your needs.
Steps
You wish to block or authorize a part of the leboncoin.fr site's categories while applying an inverse operation to the rest.
To do this, enter a regex :
Important: for display reasons, the above regex has been split at the end of each line. For this regex to work, all terms must appear in sequence.
Once this has been integrated, to authorize or block only the entire car theme, leave only the terms VEHICLES|Cars|Motorbikes|Caravanning|Utilities in the regex and apply the filtering policy to them.
The advantage is to have a single regex for all themes on the leboncoin.fr site .
Please note that in the statistics, all accesses (authorizations and blocks) will appear in the Classifieds category, regardless of the part of the site viewed.
You can link each theme on the leboncoin.fr site to the corresponding Olfeo category and thus obtain a detailed filtering of the leboncoin.fr site to which the policies you have set up will apply.
To do this, you need to integrate the corresponding regex for each category. Click on a category in the webadmin and enter the regex in the list.
Category | Regex |
---|---|
Alcohol and tobacco | ^http://www\.leboncoin\.fr\/.*vins_gastronomie.* |
Teaching | ^http://www\.leboncoin\.fr\/.*cours_particuliers.* |
Real estate | ^http://www\.leboncoin\.fr\/.*_immobilier_.* |
Real estate | ^http://www\.leboncoin\.fr\/.*real-estate_sales.* |
Real estate | ^http://www\.leboncoin\.fr\/.*locations.* |
Real estate | ^http://www\.leboncoin\.fr\/.*colocations.* |
Real estate | ^http://www\.leboncoin\.fr\/.*locations_de_vacances.* |
Games, Toys | ^http://www\.leboncoin\.fr\/.*jeux_jouets.* |
Information Technology | ^http://www\.leboncoin\.fr\/.*_multimedia_.* |
Information Technology | ^http://www\.leboncoin\.fr\/.*informatique.* |
Video Games, Computer Games | ^http://www\.leboncoin\.fr\/.*consoles_jeux_video.* |
Information Technology | ^http://www\.leboncoin\.fr\/.*image_son.* |
Cell phones, Logos, Ringtones | ^http://www\.leboncoin\.fr\/.*telephonie.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*_maison_.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*ameublement.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*electromenager.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*decoration.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*linge_de_maison.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*bricolage.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*jardinage.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*vetements.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*chaussures.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*accessoires_bagagerie.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*montres_bijoux.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*equipement_bebe.* |
Fashion, Beauty, Well-being, Decoration | ^http://www\.leboncoin\.fr\/.*vetements_bebe.* |
Leisure, Hobbies, Passions | ^http://www\.leboncoin\.fr\/.*_loisirs_.* |
Classifieds | ^http://www\.leboncoin\.fr\/.*dvd_films.* |
Leisure, Hobb | ^http://www\.leboncoin\.fr\/.*cd_musique.* |
Arts & Culture | ^http://www\.leboncoin\.fr\/.*livres.* |
Leisure, Hobbies, Passions | ^http://www\.leboncoin\.fr\/.*animaux.* |
Leisure, Hobbies, Passions | ^http://www\.leboncoin\.fr\/.*velos.* |
Sport | ^http://www\.leboncoin\.fr\/.*sports_hobbies.* |
Leisure, Hobbies, Passions | ^http://www\.leboncoin\.fr\/.*instruments_de_musique.* |
Classifieds | ^http://www\.leboncoin\.fr\/.*collection.* |
Classifieds | ^http://www\.leboncoin\.fr\/.*_.* |
Classifieds | ^http://www\.leboncoin\.fr\/.*autres.* |
Recruitment, Interim | ^http://www\.leboncoin\.fr\/.*_emploi_services_.* |
Recruitment, Interim | ^http://www\.leboncoin\.fr\/.*emploi.* |
Business Services | ^http://www\.leboncoin\.fr\/.*bureaux_commerces.* |
Business Services | ^http://www\.leboncoin\.fr\/.*materiel_professionnel.* |
Personal Services | ^http://www\.leboncoin\.fr\/.*services.* |
Personal Services | ^http://www\.leboncoin\.fr\/.*evenements.* |
Outings, Evenings, Concerts | ^http://www\.leboncoin\.fr\/.*billetterie.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*_vehicles_.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*voitures.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*motos.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*caravaning.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*utilitaires.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*equipement_auto.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*equipement_moto.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*equipement_caravaning.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*nautisme.* |
Cars, Mechanics | ^http://www\.leboncoin\.fr\/.*equipement_nautisme.* |
N.B.: the .* allows you to integrate characters before and after the written expression.
It will thus be possible to filter each theme on the leboncoin.fr site using the policies applied to Olfeo categories.
This method may seem time-consuming, but it enables advanced filtering of the site.
These are suggestions only. Apply regex categorization according to filtering or statistical needs.