OpenTTD Source
20241108-master-g80f628063a
|
Handling of C-type strings (char*). More...
#include "stdafx.h"
#include "debug.h"
#include "core/alloc_func.hpp"
#include "core/math_func.hpp"
#include "error_func.h"
#include "string_func.h"
#include "string_base.h"
#include "table/control_codes.h"
#include <sstream>
#include <iomanip>
#include <unicode/ustring.h>
#include "language.h"
#include "gfx_func.h"
#include "safeguards.h"
#include <unicode/stsearch.h>
#include <unicode/utext.h>
#include <unicode/brkiter.h>
Go to the source code of this file.
Data Structures | |
struct | CaseInsensitiveCharTraits |
Case insensitive implementation of the standard character type traits. More... | |
class | IcuStringIterator |
String iterator using ICU as a backend. More... | |
Typedefs | |
typedef std::basic_string_view< char, CaseInsensitiveCharTraits > | CaseInsensitiveStringView |
Case insensitive string view. | |
Functions | |
void | strecpy (std::span< char > dst, std::string_view src) |
Copies characters from one buffer to another. More... | |
std::string | FormatArrayAsHex (std::span< const uint8_t > data) |
Format a byte array into a continuous hex string. More... | |
template<class T > | |
static void | StrMakeValid (T &dst, const char *str, const char *last, StringValidationSettings settings) |
Copies the valid (UTF-8) characters from str up to last to the dst . More... | |
void | StrMakeValidInPlace (char *str, const char *last, StringValidationSettings settings) |
Scans the string for invalid characters and replaces then with a question mark '?' (if not ignored). More... | |
void | StrMakeValidInPlace (char *str, StringValidationSettings settings) |
Scans the string for invalid characters and replaces then with a question mark '?' (if not ignored). More... | |
std::string | StrMakeValid (std::string_view str, StringValidationSettings settings) |
Copies the valid (UTF-8) characters from str to the returned string. More... | |
bool | StrValid (std::span< const char > str) |
Checks whether the given string is valid, i.e. More... | |
void | StrTrimInPlace (std::string &str) |
Trim the spaces from given string in place, i.e. More... | |
std::string_view | StrTrimView (std::string_view str) |
bool | StrStartsWithIgnoreCase (std::string_view str, const std::string_view prefix) |
Check whether the given string starts with the given prefix, ignoring case. More... | |
bool | StrEndsWithIgnoreCase (std::string_view str, const std::string_view suffix) |
Check whether the given string ends with the given suffix, ignoring case. More... | |
int | StrCompareIgnoreCase (const std::string_view str1, const std::string_view str2) |
Compares two string( view)s, while ignoring the case of the characters. More... | |
bool | StrEqualsIgnoreCase (const std::string_view str1, const std::string_view str2) |
Compares two string( view)s for equality, while ignoring the case of the characters. More... | |
size_t | Utf8StringLength (const char *s) |
Get the length of an UTF-8 encoded string in number of characters and thus not the number of bytes that the encoded string contains. More... | |
size_t | Utf8StringLength (const std::string &str) |
Get the length of an UTF-8 encoded string in number of characters and thus not the number of bytes that the encoded string contains. More... | |
bool | strtolower (std::string &str, std::string::size_type offs) |
bool | IsValidChar (char32_t key, CharSetFilter afilter) |
Only allow certain keys. More... | |
size_t | Utf8Decode (char32_t *c, const char *s) |
Decode and consume the next UTF-8 encoded character. More... | |
template<class T > | |
size_t | Utf8Encode (T buf, char32_t c) |
Encode a unicode character and place it in the buffer. More... | |
size_t | Utf8Encode (char *buf, char32_t c) |
size_t | Utf8Encode (std::ostreambuf_iterator< char > &buf, char32_t c) |
size_t | Utf8Encode (std::back_insert_iterator< std::string > &buf, char32_t c) |
size_t | Utf8TrimString (char *s, size_t maxlen) |
Properly terminate an UTF8 string to some maximum length. More... | |
static bool | IsGarbageCharacter (char32_t c) |
Test if a unicode character is considered garbage to be skipped. More... | |
static std::string_view | SkipGarbage (std::string_view str) |
Skip some of the 'garbage' in the string that we don't want to use to sort on. More... | |
int | StrNaturalCompare (std::string_view s1, std::string_view s2, bool ignore_garbage_at_front) |
Compares two strings using case insensitive natural sort. More... | |
static int | ICUStringContains (const std::string_view str, const std::string_view value, bool case_insensitive) |
Search if a string is contained in another string using the current locale. More... | |
bool | StrNaturalContains (const std::string_view str, const std::string_view value) |
Checks if a string is contained in another string with a locale-aware comparison that is case sensitive. More... | |
bool | StrNaturalContainsIgnoreCase (const std::string_view str, const std::string_view value) |
Checks if a string is contained in another string with a locale-aware comparison that is case insensitive. More... | |
static int | ConvertHexNibbleToByte (char c) |
Convert a single hex-nibble to a byte. More... | |
bool | ConvertHexToBytes (std::string_view hex, std::span< uint8_t > bytes) |
Convert a hex-string to a byte-array, while validating it was actually hex. More... | |
Handling of C-type strings (char*).
Definition in file string.cpp.
|
static |
Convert a single hex-nibble to a byte.
c | The hex-nibble to convert. |
Definition at line 715 of file string.cpp.
Referenced by ConvertHexToBytes().
bool ConvertHexToBytes | ( | std::string_view | hex, |
std::span< uint8_t > | bytes | ||
) |
Convert a hex-string to a byte-array, while validating it was actually hex.
hex | The hex-string to convert. |
bytes | The byte-array to write the result to. |
Definition at line 734 of file string.cpp.
References ConvertHexNibbleToByte().
Referenced by X25519AuthorizedKeyClientHandler::GetValidSecretKeyAndUpdatePublicKey(), GRFLoadConfig(), PickerLoadConfig(), and ValidateSignature().
std::string FormatArrayAsHex | ( | std::span< const uint8_t > | data | ) |
Format a byte array into a continuous hex string.
data | Array to format |
Definition at line 81 of file string.cpp.
Referenced by AddGrfInfo(), CalculateHashV1(), GenerateUid(), X25519AuthenticationHandler::GetPeerPublicKey(), X25519AuthorizedKeyClientHandler::GetValidSecretKeyAndUpdatePublicKey(), HandleSavegameLoadCrash(), IsGoodGRFConfigList(), CommandHelperBase::LogCommandExecution(), NetworkContentListWindow::OpenExternalSearch(), OutputContentState(), ClientNetworkGameSocketHandler::Receive_SERVER_CHECK_NEWGRFS(), and SurveyGrfs().
|
static |
Search if a string is contained in another string using the current locale.
str | String to search in. |
value | String to search for. |
case_insensitive | Search case-insensitive. |
Definition at line 630 of file string.cpp.
References _current_collator.
Referenced by StrNaturalContains(), and StrNaturalContainsIgnoreCase().
|
static |
Test if a unicode character is considered garbage to be skipped.
c | Character to test. |
Definition at line 549 of file string.cpp.
Referenced by SkipGarbage().
bool IsValidChar | ( | char32_t | key, |
CharSetFilter | afilter | ||
) |
Only allow certain keys.
You can define the filter to be used. This makes sure no invalid keys can get into an editbox, like BELL.
key | character to be checked |
afilter | the filter to use |
Definition at line 396 of file string.cpp.
References CS_ALPHANUMERAL.
Referenced by IConsoleCmdExec().
|
static |
Skip some of the 'garbage' in the string that we don't want to use to sort on.
This way the alphabetical sorting will work better as we would be actually using those characters instead of some other characters such as spaces and tildes at the begin of the name.
str | The string to skip the initial garbage of. |
Definition at line 568 of file string.cpp.
References IsGarbageCharacter(), and Utf8Decode().
Referenced by StrNaturalCompare().
int StrCompareIgnoreCase | ( | const std::string_view | str1, |
const std::string_view | str2 | ||
) |
Compares two string( view)s, while ignoring the case of the characters.
str1 | The first string. |
str2 | The second string. |
Definition at line 334 of file string.cpp.
Referenced by MatchesExtension(), and StrEqualsIgnoreCase().
void strecpy | ( | std::span< char > | dst, |
std::string_view | src | ||
) |
Copies characters from one buffer to another.
Copies the source string to the destination buffer with respect of the terminating null-character and the size of the destination buffer.
dst | The destination buffer |
src | The buffer containing the string to copy |
Definition at line 60 of file string.cpp.
Referenced by FileStringReader::HandlePragma().
bool StrEndsWithIgnoreCase | ( | std::string_view | str, |
const std::string_view | suffix | ||
) |
Check whether the given string ends with the given suffix, ignoring case.
str | The string to look at. |
suffix | The suffix to look for. |
Definition at line 321 of file string.cpp.
References StrEqualsIgnoreCase().
Referenced by TextfileWindow::AfterLoadText().
bool StrEqualsIgnoreCase | ( | const std::string_view | str1, |
const std::string_view | str2 | ||
) |
Compares two string( view)s for equality, while ignoring the case of the characters.
str1 | The first string. |
str2 | The second string. |
Definition at line 347 of file string.cpp.
References StrCompareIgnoreCase().
Referenced by ChangeWorkingDirectoryToExecutable(), DEF_CONSOLE_CMD(), AIScannerInfo::FindInfo(), GameScannerInfo::FindInfo(), FiosGetSavegameListCallback(), FiosGetScenarioListCallback(), FiosGetTownDataListCallback(), BlitterFactory::GetBlitterFactory(), ScriptScanner::RegisterScript(), StrEndsWithIgnoreCase(), StringToContentType(), and StrStartsWithIgnoreCase().
std::string StrMakeValid | ( | std::string_view | str, |
StringValidationSettings | settings | ||
) |
Copies the valid (UTF-8) characters from str
to the returned string.
Depending on the settings
invalid characters can be replaced with a question mark, as well as determining what characters are deemed invalid.
str | The string to validate. |
settings | The settings for the string validation. |
Definition at line 205 of file string.cpp.
References settings, and StrMakeValid().
|
static |
Copies the valid (UTF-8) characters from str
up to last
to the dst
.
Depending on the settings
invalid characters can be replaced with a question mark, as well as determining what characters are deemed invalid.
It is allowed for dst
to be the same as src
, in which case the string is make valid in place.
dst | The destination to write to. |
str | The string to validate. |
last | The last valid character of str. |
settings | The settings for the string validation. |
Definition at line 107 of file string.cpp.
Referenced by ScriptInfo::AddLabels(), ScriptInfo::AddSetting(), ExtractString(), GetFileTitle(), GetMusicCatEntryName(), IConsolePrint(), IniGroup::IniGroup(), IniItem::IniItem(), IniLoadFile::LoadFromDisk(), LoadFromHighScore(), TextfileWindow::LoadText(), StringSettingDesc::MakeValueValid(), ReadDWordAsString(), Packet::Recv_string(), ShowNewGrfVehicleError(), StrMakeValid(), and StrMakeValidInPlace().
void StrMakeValidInPlace | ( | char * | str, |
const char * | last, | ||
StringValidationSettings | settings | ||
) |
Scans the string for invalid characters and replaces then with a question mark '?' (if not ignored).
str | The string to validate. |
last | The last valid character of str. |
settings | The settings for the string validation. |
Definition at line 178 of file string.cpp.
References settings, and StrMakeValid().
Referenced by Textbuf::Assign(), and StrMakeValidInPlace().
void StrMakeValidInPlace | ( | char * | str, |
StringValidationSettings | settings | ||
) |
Scans the string for invalid characters and replaces then with a question mark '?' (if not ignored).
Only use this function when you are sure the string ends with a '\0'; otherwise use StrMakeValidInPlace(str, last, settings) variant.
str | The string (of which you are sure ends with '\0') to validate. |
Definition at line 192 of file string.cpp.
References settings, and StrMakeValidInPlace().
int StrNaturalCompare | ( | std::string_view | s1, |
std::string_view | s2, | ||
bool | ignore_garbage_at_front | ||
) |
Compares two strings using case insensitive natural sort.
s1 | First string to compare. |
s2 | Second string to compare. |
ignore_garbage_at_front | Skip punctuation characters in the front |
Definition at line 589 of file string.cpp.
References _current_collator, and SkipGarbage().
Referenced by CargoSpecNameSorter(), GRFSorter(), IndustryDirectoryWindow::IndustryNameSorter(), IndustryTypeNameSorter(), NetworkContentListWindow::NameSorter(), NewGRFWindow::NameSorter(), DropDownString< TBase, TFs, TEnd >::NatSortFunc(), NetworkGameWindow::NGameNameSorter(), SignList::SignNameSorter(), CompanyStationsWindow::StationNameSorter(), TownDirectoryWindow::TownNameSorter(), and NetworkContentListWindow::TypeSorter().
bool StrNaturalContains | ( | const std::string_view | str, |
const std::string_view | value | ||
) |
Checks if a string is contained in another string with a locale-aware comparison that is case sensitive.
str | The string to search in. |
value | The string to search for. |
Definition at line 660 of file string.cpp.
References ICUStringContains(), MacOSStringContains(), and Win32StringContains().
Referenced by StringFilter::AddLine().
bool StrNaturalContainsIgnoreCase | ( | const std::string_view | str, |
const std::string_view | value | ||
) |
Checks if a string is contained in another string with a locale-aware comparison that is case insensitive.
str | The string to search in. |
value | The string to search for. |
Definition at line 687 of file string.cpp.
References ICUStringContains(), MacOSStringContains(), and Win32StringContains().
Referenced by StringFilter::AddLine().
bool StrStartsWithIgnoreCase | ( | std::string_view | str, |
const std::string_view | prefix | ||
) |
Check whether the given string starts with the given prefix, ignoring case.
str | The string to look at. |
prefix | The prefix to look for. |
Definition at line 281 of file string.cpp.
References StrEqualsIgnoreCase().
void StrTrimInPlace | ( | std::string & | str | ) |
Trim the spaces from given string in place, i.e.
the string buffer that is passed will be modified whenever spaces exist in the given string. When there are spaces at the begin, the whole string is moved forward and when there are spaces at the back the '\0' termination is moved.
str | The string to perform the in place trimming on. |
Definition at line 260 of file string.cpp.
Referenced by CheckClientAndServerName(), NetworkValidateClientName(), and NetworkValidateServerName().
bool StrValid | ( | std::span< const char > | str | ) |
Checks whether the given string is valid, i.e.
contains only valid (printable) characters and is properly terminated.
str | Span of chars to validate. |
Definition at line 227 of file string.cpp.
References Utf8Decode(), and Utf8EncodedCharLen().
size_t Utf8Decode | ( | char32_t * | c, |
const char * | s | ||
) |
Decode and consume the next UTF-8 encoded character.
c | Buffer to place decoded character. |
s | Character stream to retrieve character from. |
Definition at line 419 of file string.cpp.
References GB(), and HasBit().
Referenced by FixSCCEncoded(), SkipGarbage(), StrValid(), and TranslateTTDPatchCodes().
|
inline |
Encode a unicode character and place it in the buffer.
T | Type of the buffer. |
buf | Buffer to place character. |
c | Unicode character to encode. |
Definition at line 460 of file string.cpp.
Referenced by FixSCCEncoded().
size_t Utf8StringLength | ( | const char * | s | ) |
Get the length of an UTF-8 encoded string in number of characters and thus not the number of bytes that the encoded string contains.
s | The string to get the length for. |
Definition at line 359 of file string.cpp.
Referenced by Textbuf::Assign(), CmdAlterGroup(), CmdPlaceSign(), CmdRenameCompany(), CmdRenameDepot(), CmdRenameEngine(), CmdRenamePresident(), CmdRenameSign(), CmdRenameStation(), CmdRenameTown(), CmdRenameVehicle(), CmdRenameWaypoint(), Utf8StringLength(), and VerifyTownName().
size_t Utf8StringLength | ( | const std::string & | str | ) |
Get the length of an UTF-8 encoded string in number of characters and thus not the number of bytes that the encoded string contains.
s | The string to get the length for. |
Definition at line 373 of file string.cpp.
References Utf8StringLength().
size_t Utf8TrimString | ( | char * | s, |
size_t | maxlen | ||
) |
Properly terminate an UTF8 string to some maximum length.
s | string to check if it needs additional trimming |
maxlen | the maximum length the buffer can have. |
Definition at line 508 of file string.cpp.
References Utf8EncodedCharLen().