A critical vulnerability was discovered in React Server Components (Next.js). Our systems remain protected but we advise to update packages to newest version. Learn More

AI OnAI Off

Indexing page with external content

Vote:
 

Hi,

I have ha page that is getting its content from a extern sorce"available jobs".

How should i do to index this page in find when the content does not come from EPiServer CMS?

I have looked at Connectors but it seems unnecessary to have to pay for when it's just one page.

#171979
Edited, Nov 22, 2016 17:06
Vote:
 

Hi Pär,

You can create an extension method for that page type, and then index it as a field by setting custom conventions in an initialization module:

 SearchClient.Instance.Conventions
    .ForInstancesOf<MyPageType>()
    .IncludeField(x => x.MyExtension());
#172002
Edited, Nov 23, 2016 10:41
Vote:
 

Thanks Dejan for your response, I'll check into it.


I got one more question from the customer. 

Customer website has pages which redirects to external Web sites. Is it possible to get these external pages indexed too?

#172045
Nov 24, 2016 12:53
Vote:
 

Hi Pär,

Episerver Find is not limited to Episerver content (pages, blocks, etc.).

You can index custom classes as well.

For external pages, you could create a custom crawler. HtmlAgilityPack is your best friend: https://www.nuget.org/packages/HtmlAgilityPack

And then index the content using standard API calls: http://world.episerver.com/documentation/Items/Developers-Guide/EPiServer-Find/11/DotNET-Client-API/Indexing/

But the question is what they want to achieve, and do you really have to index external websites.

#172047
Nov 24, 2016 13:07
Vote:
 

Once again thank you so much for your help.

You ask a good question that I will ask the customer.

The customer has previously redirectat to a hacked site that showed pornographic material. It's probably good to think this through once or twice.

#172048
Nov 24, 2016 13:19
error This topic was created over six months ago and has been resolved. If you have a similar question, please create a new topic and refer to this one.
* You are NOT allowed to include any hyperlinks in the post because your account hasn't associated to your company. User profile should be updated.